在 Python 中將 csv 文件合並為一個（按列）

Question

我有很多這樣的.csv 文件（只有一列）：圖片

我想將它們合並到一個.csv 文件中，這樣每一列都將包含 csv 文件數據之一。 標題應該是這樣的（轉換為電子表格時）：圖片（第一個數字是從文件名中提取的分鍾數，第二個是名稱中“export_”后面的文件名中的第一個單詞，第三個是文件的全名）。

我想在 Python 工作。 你能請人幫我嗎？ 我是 Python 的新手。

非常感謝。

我試圖只加入 2 個文件，但我不知道如何在不手動寫下所有文件的情況下使用更多文件。 另外，我不知道如何從文件名中提取標題：

import pandas as pd 

file_list = ['export_Control 37C 4h_Single Cells_Single Cells_Single Cells.csv', 'export_Control 37C 0 min_Single Cells_Single Cells_Single Cells.csv']
df = pd.DataFrame()
for file in file_list:
    temp_df = pd.read_csv(file)
    df = pd.concat([df, temp_df], axis=1)
    
print(df)


df.to_csv('output2.csv', index=False)

Answer 1

Assuming that your .csv files they all have a header and the same number of rows, you can use the code below to put all the .csv (single-columned) one besides the other in a single Excel worksheet.

import os
import pandas as pd
              
csv_path = r'path_to_the_folder_containing_the_csvs'

csv_files = [file for file in os.listdir(csv_path)]

list_of_dfs=[]
for file in csv_files :
    temp=pd.read_csv(csv_path + '\\' + file, header=0, names=['Header'])
    time_number = pd.DataFrame([[file.split('_')[1].split()[2]]], columns=['Header'])
    file_title = pd.DataFrame([[file.split('_')[1].split()[0]]], columns=['Header'])
    file_name = pd.DataFrame([[file]], columns=['Header'])
    out = pd.concat([time_number, file_title, file_name, temp]).reset_index(drop=True)
    list_of_dfs.append(out)

final= pd.concat(list_of_dfs, axis=1, ignore_index=True)
final.columns = ['Column' + str(col+1) for col in final.columns]
final.to_csv(csv_path + '\output.csv', index=False)
final

例如，考慮三個.csv文件，運行上面的代碼會產生：

在 Python 中將 csv 文件合並為一個（按列）

問題描述

1 個解決方案

解決方案1
0 2022-09-09 16:38:38

`>>> Output (in Jupyter)`

`>>> Output (in Excel)`

在 Python 中將 csv 文件合並為一個（按列）

問題描述

1 個解決方案

解決方案1 0 2022-09-09 16:38:38

>>> Output (in Jupyter)

>>> Output (in Excel)

解決方案1
0 2022-09-09 16:38:38

`>>> Output (in Jupyter)`

`>>> Output (in Excel)`