如何在新列中添加具有特定条件的列的字符串值

Question

So I have a dataframe in which there are a couple of columns and a lot of rows.所以我有一个 dataframe ，其中有几列和很多行。

Now I want to create a new column (C) which adds values of another column (A) as a string together if a third column (B) is identical.现在我想创建一个新列 (C)，如果第三列 (B) 相同，它将另一列 (A) 的值作为字符串添加在一起。

So each 'group' (that is identical in B) should have a different string than the other groups in that column in the end.因此，每个“组”（在 B 中相同）最后应该具有与该列中的其他组不同的字符串。

A一个	B乙	New Column C新立柱 C
First第一的	1 1	First_Third第一_第三
Second第二	22 22	Second_Fourth Second_Fourth
Third第三	1 1	First_Third第一_第三
Fourth第四	22 22	Second_Fourth Second_Fourth

Something like this pseudo code:像这样的伪代码：

for x in df[B]:
if (x "is identical to" x "of another row"):
df[C] = df[C].cat(df[A])

How do I code an algorithm that can do this?我如何编写可以做到这一点的算法？

Answer 1

Try this:尝试这个：

df['C'] = df.groupby('B')['A'].transform(lambda x: '_'.join(x))

Answer 2

You can use:您可以使用：

df['C'] = df.groupby('B')['A'].transform('_'.join)

Or, if you want to keep only unique values:或者，如果您只想保留唯一值：

df['C'] = df.groupby('B')['A'].transform(lambda x: '_'.join(x.unique()))

output: output：

        A   B              C
0   First   1    First_Third
1  Second  22  Second_Fourth
2   Third   1    First_Third
3  Fourth  22  Second_Fourth

如何在新列中添加具有特定条件的列的字符串值

问题描述

2 个解决方案

解决方案1
0 2022-08-11 13:58:37

解决方案2
0 2022-08-11 13:58:48

如何在新列中添加具有特定条件的列的字符串值

问题描述

2 个解决方案

解决方案1 0 2022-08-11 13:58:37

解决方案2 0 2022-08-11 13:58:48

解决方案1
0 2022-08-11 13:58:37

解决方案2
0 2022-08-11 13:58:48