如何使用Python合并多个CSV文件中的列

Question

may be the answer of this question is available but I could not get proper solution and thus I am looking for the perfect solution. 可能是这个问题的答案可用，但我无法获得适当的解决方案，因此我正在寻找理想的解决方案。 Suppose I have multiple CSV files (around 1500) having single column with some time series data (10,000 times or rows). 假设我有多个CSV文件（大约1500个），其中包含具有一些时间序列数据（10,000次或行）的单列。 The column header name is same in all CSV files. 所有CSV文件中的列标题名称均相同。 Suppose I have CSV files like: 假设我有CSV文件，例如：

aa1.csv      aa2.csv:      aa3.csv:............aa1500.csv:
datavalue   datavalue      datavalue           datavalue
    4            1             1                  2
    2            3             6                  4
    3            3             3                  8                
    4            4             8                  9


I want the output like this:


datavalue,datavalue,datavalue,datavalue,.....datavalue
4,1,1,..2
2,3,6,..4
3,3,3,..8
4,4,8,..9

My codes are not working and giving something else: 我的代码无法正常工作，并给出了其他提示：

import pandas as pd
import csv
import glob
import os
path 'F:/Work/'
files_in_dir = [f for f in os.listdir(path) if f.endswith('csv')]
for filenames in files_in_dir:
    df = pd.read_csv(filenames)
    df.to_csv('out.csv', mode='a')

If someone can help in this? 如果有人可以帮助您？

Answer 1

You can try it the following way with a little help from numpy 您可以在numpy的一些帮助下以以下方式尝试它

import pandas as pd
import numpy as np
import os
path 'F:/Work/'
files_in_dir = [f for f in os.listdir(path) if f.endswith('csv')]
temp_data = []
for filenames in files_in_dir:
    temp_data.append(np.loadtxt(filenames,dtype='str'))

temp_data = np.array(temp_data)
np.savetxt('out.csv',temp_data.transpose(),fmt='%s',delimiter=',')

Answer 2

Use pandas concat function 使用pandas concat函数

import pandas as pd
dfs = []
for filenum in range(1,1501):
    dfs.append( pd.read_csv('aa{}.csv'.format(filenum)) )
print(pd.concat(dfs,axis=1).to_csv(index=False))

Answer 3

One of the ways to achieve this is by creating another CSV file by merging data from existing CSV files (assuming you have the CSV files in the format aa##.csv )... 实现此目的的方法之一是通过合并现有CSV文件中的数据来创建另一个CSV文件（假设您拥有aa##.csv格式的CSV文件）...

contents = []

for filenum in range(2):
    f = open('aa{}.csv'.format(filenum + 1), 'r')
    lines = f.readlines()
    print(lines)
    f.close()

    if contents == []:
        contents = [[] for a in range(len(lines))]

    for row in range(len(lines)):
        contents[row].append(lines[row].rstrip('\n'))
        print(lines[row])

print(contents)
f = open('aa_new.csv', 'w')

for row in range(len(contents)):
    line = str(contents[row])
    line = line.strip('[]')
    f.write(line + '\n')

f.close()

You can then open & display this file as you wish using pandas. 然后，您可以使用熊猫打开并显示此文件。

如何使用Python合并多个CSV文件中的列

问题描述

3 个解决方案

解决方案1
2 2018-06-20 06:52:14

解决方案2
1 2018-06-20 06:56:40

解决方案3
0 2018-06-20 06:33:26

如何使用Python合并多个CSV文件中的列

问题描述

3 个解决方案

解决方案1 2 2018-06-20 06:52:14

解决方案2 1 2018-06-20 06:56:40

解决方案3 0 2018-06-20 06:33:26

解决方案1
2 2018-06-20 06:52:14

解决方案2
1 2018-06-20 06:56:40

解决方案3
0 2018-06-20 06:33:26