如何合并 pandas dataframe 中具有相似名称的列？

Question

I have a dataframe which has the words Due Date written differently but it all means the same.我有一个 dataframe，它的截止日期写法不同，但意思相同。 The problem is in my master data(xls file), one due date has an extra space or doesnt and i cant change that.All i can change is my final output.问题出在我的主数据（xls 文件）中，一个截止日期有额外的空间或没有，我无法更改它。我只能更改我的最终 output。

Sr no Due Date    Due Date   DueDate
1     1/2/22      
2                  1/5/22    
3
4                         
5                             ASAP

I just want that column 2 and 3 all combine under column one at the same location they were我只希望第 2 列和第 3 列全部合并到第 1 列下的相同位置

Sr No.  Due Date
1        1/2/22
2        1/5/22
3        
4
5        ASAP

Answer 1

Try with bfill尝试使用bfill

out = df.bfill(axis = 1)[['Sr No','Due Date']]

Answer 2

You can use filter with a regex to get similar names, then bfill and get the first.您可以使用带有正则表达式的filter来获取相似的名称，然后bfill并获取第一个。 Finally join to original devoid of the found columns:最后加入没有找到的列的原始文件：

d = df.filter(regex=r'(?i)due\s*date')
df2 = (df
 .drop(columns=list(d.columns))
 .join(d.bfill(1).iloc[:,0])
 )

Output: Output：

   Sr no Due Date
0      1   1/2/22
1      2   1/5/22
2      3     None
3      4     None
4      5     ASAP

Answer 3

Possible solution is the following:可能的解决方案如下：

import pandas as pd

# set test data
data = {"Sr no": [1,2,3,4,5],
        "Due Date": ["1/2/22", "", "", "", ""], 
        "Due Date ": ["", "1/2/22", "", "", ""],
        " Due Date": ["", "", "", "", "ASAP"]
       }

# create pandas dataframe
df = pd.DataFrame(data)

# clean up column names 
df.columns = [col.strip() for col in df.columns]

# group data
df = df.groupby(df.columns, axis=1).agg(lambda x: x.apply(lambda y: ''.join([str(l) for l in y if str(l) != "nan"]), axis=1))

# reorder column
df = df[['Sr no', 'Due Date']]

df

Returns退货

如何合并 pandas dataframe 中具有相似名称的列？

问题描述

3 个解决方案

解决方案1
1 2022-03-17 19:12:54

解决方案2
1 已采纳 2022-03-17 19:35:06

解决方案3
0 2022-03-17 20:02:33

如何合并 pandas dataframe 中具有相似名称的列？

问题描述

3 个解决方案

解决方案1 1 2022-03-17 19:12:54

解决方案2 1 已采纳 2022-03-17 19:35:06

解决方案3 0 2022-03-17 20:02:33

解决方案1
1 2022-03-17 19:12:54

解决方案2
1 已采纳 2022-03-17 19:35:06

解决方案3
0 2022-03-17 20:02:33