从另一列创建列表列并仅显示 pandas dataframe 中的唯一值

Question

I am new to pandas, I am trying to use group by and create a list of in a new column.我是 pandas 的新手，我正在尝试使用 group by 并在新列中创建列表。 I have 3 columns in my Dataframe and I created a 4th column(New_List) to create a list from another column like below: using the below code:我的 Dataframe 中有 3 列，我创建了第 4 列（New_List）以从另一列创建列表，如下所示：使用以下代码：

new_df = df.join(pd.Series(df.groupby(by='NO_ACCOUNTS').apply(lambda x: list(x.Bucket)), name="list_of_b"), on='NO_ACCOUNTS') new_df = df.join(pd.Series(df.groupby(by='NO_ACCOUNTS').apply(lambda x: list(x.Bucket)), name="list_of_b"), on='NO_ACCOUNTS')

Account_Number   Bucket  Number_Transactions     New_List
   ABA            APP          155                 [APP]
   ABC            APP          1352                [APP]
   AAA            APP          90                  [API,APP]
   AAA            API          5                   [API,APP]

I am looking to get the desired output with 3 columns:我正在寻找具有 3 列的所需 output：

Account_Number     Number_Transactions     New_List
   ABA                      155                 [APP]
   ABC                      1352                [APP]
   AAA                      95                  [API,APP]

Answer 1

You can agg regate the two columns:您可以agg这两列：

out = (df.groupby("Account_Number", sort=False, as_index=False)
         .agg(Number_Transactions=("Number_Transactions", "sum"),
              New_List=("Bucket", list)))

which first groups by Account_Number while keeping their order with sort=False and not making it index with as_index=False , and then aggregates the Number_Transactions column with summation and appoints it to the same name columns and similarly, aggs the Bucket column with list and assigns it to New_List column in the output,首先按Account_Number分组，同时使用sort=False保持其顺序，而不是使用as_index=False使其索引，然后将Number_Transactions列与 summation 聚合并将其指定给相同名称的列，同样，将Bucket列与list聚合并分配它到New_List中的 New_List 列，

to get要得到

>>> out

  Account_Number  Number_Transactions    New_List
0            ABA                  155       [APP]
1            ABC                 1352       [APP]
2            AAA                   95  [APP, API]

从另一列创建列表列并仅显示 pandas dataframe 中的唯一值

问题描述

1 个解决方案

解决方案1
0 已采纳 2021-06-10 08:23:00

从另一列创建列表列并仅显示 pandas dataframe 中的唯一值

问题描述

1 个解决方案

解决方案1 0 已采纳 2021-06-10 08:23:00

解决方案1
0 已采纳 2021-06-10 08:23:00