以另外两列pandas为条件创建一个新列

Question

I have a dataframe with two columns.我有一个包含两列的数据框。 I want to create a new column and input whichever column has the longest string.我想创建一个新列并输入具有最长字符串的列。 so所以

        column_a        column_b             column_c

   0  'dog is fast'   'dog is faster'      'dog is faster' (desired output)

I tried this code but got an error saying that int is not iterable, I was thinking in merging the series after to the df.我试过这段代码，但得到一个错误，说 int 不可迭代，我正在考虑将系列后合并到 df。 I wasn't sure how to implement it right away into a column of the df.我不确定如何立即将它实施到 df 的列中。

column_c = pd.Series()

 for i in len(df.column_a):
  if len(df.column_a.iloc[i]) >= len(df.column_b.iloc[0]):
    column_c.append(df.column_a.iloc[i])
  else:
    column_c.append(df.column_b.iloc[i])

any help is apreciated.任何帮助都值得赞赏。

Answer 1

Use pandas.DataFrame.apply :使用pandas.DataFrame.apply ：

Given sample data给定样本数据

import pandas as pd

df = pd.DataFrame([['fast', 'faster'], ['slower', 'slow']])
        0       1
0    fast  faster
1  slower    slow

df['column_c'] = df.apply(lambda x:max(x, key=len), 1)

Output:输出：

        0       1 column_c
0    fast  faster   faster
1  slower    slow   slower

Answer 2

Using np.where with str.len使用np.where和str.len

df['column_c']=np.where(df.column_a.str.len()>df.column_b.str.len(),df.column_a,df.column_b)
df
Out[301]: 
        column_a         column_b         column_c
0  'dog is fast'  'dog is faster'  'dog is faster'

Answer 3

可以使用 df.apply()

df['column_c'] = df.apply(lambda x: x[0] if len(x[0]) > len(x[1]) else x[1], axis=1)

Answer 4

You can use DataFrame.apply .您可以使用DataFrame.apply 。 You need to apply on specific columns if you have more than two columns in your dataframe如果数据框中有两列以上，则需要对特定列进行应用

df['column_c'] = df.apply(lambda x: x[0] if len(x[0]) > len(x[1]) else x[1], axis = 1)

     column_a        column_b        column_c
0   'dog is fast'   'dog is faster' 'dog is faster'

以另外两列pandas为条件创建一个新列

问题描述

4 个解决方案

解决方案1
3 已采纳 2019-03-29 02:30:25

解决方案2
2 2019-03-29 02:33:26

解决方案3
2 2019-03-29 05:36:20

解决方案4
0 2019-03-29 02:29:33

以另外两列pandas为条件创建一个新列

问题描述

4 个解决方案

解决方案1 3 已采纳 2019-03-29 02:30:25

解决方案2 2 2019-03-29 02:33:26

解决方案3 2 2019-03-29 05:36:20

解决方案4 0 2019-03-29 02:29:33

解决方案1
3 已采纳 2019-03-29 02:30:25

解决方案2
2 2019-03-29 02:33:26

解决方案3
2 2019-03-29 05:36:20

解决方案4
0 2019-03-29 02:29:33