简体   繁体   English

Python:使用其他列将Pandas中的新列的值分配为列表

[英]Python: Assign value to a new column in Pandas as list using other columns

I have below pandas dataframe: 我有以下pandas数据帧:

Name1   Name2   Score1   Score2   
Bruce   Jacob    3        4
Aida    Stephan  0        1 

I want to create a new column in the dataframe "list_score" which is a list of score 1 and 2 我想在数据框“list_score”中创建一个新列,它是得分1和2的列表

Expected result: 预期结果:

Name1   Name2   Score1   Score2  list_score 
Bruce   Jacob    3        4        [3,4]
Aida    Stephan  0        1        [0,1]

Use zip with convert tuples to lists: 使用包含转换元组的zip到列表:

df['list_score'] = [list(x) for x in zip(df['Score1'], df['Score2'])]

Or: 要么:

df['list_score'] = list(map(list, zip(df['Score1'], df['Score2'])))
print (df)
   Name1    Name2  Score1  Score2 list_score
0  Bruce    Jacob       3       4     [3, 4]
1   Aida  Stephan       0       1     [0, 1]

Performance: 性能:

df = pd.concat([df] * 1000, ignore_index=True)

In [105]: %timeit df['list_score'] = [list(x) for x in zip(df['Score1'], df['Score2'])]
851 µs ± 36.1 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

In [106]: %timeit df['list_score'] = list(map(list, zip(df['Score1'], df['Score2'])))
745 µs ± 35.1 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

In [107]: %timeit df['list_score'] = df[['Score1', 'Score2']].apply(tuple, axis=1).apply(list)
35.5 ms ± 295 µs per loop (mean ± std. dev. of 7 runs, 1 loop each)

In [108]: %timeit df['list_score'] = df[['Score1', 'Score2']].values.tolist()
949 µs ± 105 µs per loop (mean ± std. dev. of 7 runs, 1000 loops each)

plot1

This was the setup used to generate the perfplot above: 这是用于生成上面的perfplot的设置:

def list_comp(df):
    df['list_score'] = [list(x) for x in zip(df['Score1'], df['Score2'])]
    return df

def map_list(df):
    df['list_score'] = list(map(list, zip(df['Score1'], df['Score2'])))
    return df

def apply(df):
    df['list_score'] = df[['Score1', 'Score2']].apply(tuple, axis=1).apply(list)
    return df

def values(df):
    df['list_score'] = df[['Score1', 'Score2']].values.tolist()
    return df


def make_df(n):
    df = pd.DataFrame(np.random.randint(10, size=(n, 2)), columns=['Score1','Score2'])
    return df

perfplot.show(
    setup=make_df,
    kernels=[list_comp, map_list, apply, values],
    n_range=[2**k for k in range(2, 15)],
    logx=True,
    logy=True,
    equality_check=False,  # rows may appear in different order
    xlabel='len(df)')

One way is to use pd.DataFrame.apply to convert to tuple and then list . 一种方法是使用pd.DataFrame.apply转换为tuple然后list If tuple is sufficient, the second part may be omitted. 如果tuple足够,则可以省略第二部分。

df['list_score'] = df[['Score1', 'Score2']].apply(tuple, axis=1).apply(list)

print(df)

   Name1    Name2  Score1  Score2 list_score
0  Bruce    Jacob       3       4     [3, 4]
1   Aida  Stephan       0       1     [0, 1]
df['list_score'] = df[['score1', 'score2']].values.tolist()

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 将值分配给新列[Python pandas] - assign value to new column [Python pandas] 在python中使用pandas根据其他列中给出的值选择列 - Selecting columns based on value given in other column using pandas in python 熊猫-python-使用列将值添加到新列 - pandas - python - using columns to add value to new column 根据同一pandas数据框中的其他列为列分配值 - Assign value to a column based of other columns from the same pandas dataframe 将列值拆分为 2 个新列 - Python Pandas - Splitting column value into 2 new columns - Python Pandas 根据多个条件将现有列的值分配给 Pandas 中的新列 - Assign value of existing column to new columns in pandas based on multiple conditions 使用 python pandas 将行值转换为列以将日期字段分配给每个新列 - Convert row values to columns to assign date field to each new column using python pandas Pandas:如何使用多个条件分配列值,包括比较两列列表对象? - Pandas: How do I assign a column value using multiple conditions, including comparing two columns of list objects? Python Pandas:从value不为null的其他列中创建新列 - Python Pandas: Create new column out of other columns where value is not null 根据其他 pandas 列中列表中的值数创建新列? - Create new columns based on number of values in list in other pandas column?
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM