如何将结果从一个变量传输到excel中的列？

Question

I want to add the values inside duplicates to column Name so that print(data["Name"]) can return all the values contained by the duplicates . 我想将duplicates内的值添加到列Name以便print(data["Name"])可以返回duplicates包含的所有值。 How can I achieve this? 我怎样才能做到这一点？

Quick story: I'm importing a csv file and I need to split the column Name to get rid of meaningless information and then I'm using list comprehension to find the duplicates. 快速故事：我正在导入一个csv文件，我需要拆分列Name以删除无意义的信息，然后我使用列表Name来查找重复项。

data = pd.read_csv(next(iglob('*.csv')))
data["Name"]= data["Name"].str.split("(", n = 1, expand = True) 
duplicates = [x for x in data["Name"]  if x in data["Name"] 
[data["Name"].duplicated()].values]

Answer 1

Edit: 编辑：

df['dupicates'] = df['Name'].where(df['Name'].duplicated(keep=False), '')

    Name duplicates
0  NameC           
1  NameA      NameA
2  NameB      NameB
3  NameA      NameA
4  NameA      NameA
5  NameB      NameB

Or if you only want to label those duplicate values...(remove keep=False ) 或者，如果您只想标记这些重复值...（remove keep=False ）

df['duplicates'] = df['Name'].where(df['Name'].duplicated(), '')

    Name duplicates
0  NameC           
1  NameA           
2  NameB           
3  NameA      NameA
4  NameA      NameA
5  NameB      NameB

IIUC, you can try something like this: IIUC，您可以尝试这样的事情：

df = pd.DataFrame({'Name':['NameC', 'NameA', 'NameB', 'NameA', 'NameA', 'NameB']})
duplicates = df.loc[df['Name'].duplicated(), 'Name'].unique().tolist()
duplicates

Output: 输出：

['NameA', 'NameB']

Explanation: Use duplicates to create a boolean series, then filter the dataframe by the boolean series and column 'Name' then use unique to get the unique values of all the duplicates. 说明：使用duplicates创建布尔系列，然后通过布尔系列和“名称”列过滤数据框，然后使用唯一来获取所有重复项的唯一值。

如何将结果从一个变量传输到excel中的列？

问题描述

1 个解决方案

解决方案1
0 2019-06-05 13:10:05

如何将结果从一个变量传输到excel中的列？

问题描述

1 个解决方案

解决方案1 0 2019-06-05 13:10:05

解决方案1
0 2019-06-05 13:10:05