在循環熊貓中串聯數據框的列

Question

我有兩列的csv文件數據集：波長和吸光度值。

我想在一個循環中進行一些統計分析，該循環包含一組文件，例如具有標准偏差的平均吸光度文件等。

myfiles = sorted(glob.glob('blanks/Day01/Batch02/*.csv'))
mypath = 'blanks/Day01/Batch02/'
files         = [f for f in listdir(mypath) if isfile(join(mypath, f))]   # all files in a folder

for m in range(len(files)):
    df = pd.read_csv(mypath + files[m], skiprows=1, delimiter= ',')
    wl = df['Wavelength (nm)']
    A  = df['Abs']

我的問題是如何將每個文件中的“ Abs”列“連接”，然后對其進行操作以創建一個單一的，均值/中值的列？

Answer 1

首先創建所有DataFrame的列表-在read_csv按參數usecols過濾列，也可以省略delimiter=','因為默認參數為：

dfs = []
for m in range(len(files)):
    df = pd.read_csv(mypath + files[m], 
                     skiprows=1, 
                     usecols = ['Wavelength (nm)', 'Abs']) # usecols = ['Abs'] for filter Abs
    dfs.append(df)

替代方案：

dfs = [pd.read_csv(mypath + files[m], skiprows=1, usecols = ['Wavelength (nm)', 'Abs']) for m in range(len(files))]

而在去年concat在一起：

df = pd.concat(dfs, ignore_index=True)

在循環熊貓中串聯數據框的列

問題描述

1 個解決方案

解決方案1
2 已采納 2018-05-14 14:13:59

在循環熊貓中串聯數據框的列

問題描述

1 個解決方案

解決方案1 2 已采納 2018-05-14 14:13:59

解決方案1
2 已采納 2018-05-14 14:13:59