如何根據 pandas 中的條件使用 to_csv 后僅使用一次標頭？

Question

我正在嘗試根據條件從另一個 txt 文件制作新的 txt 文件。 兩個 txt 文件具有相同的標題。 但是在使用“to_csv”之后，我在 output 中看到我們有超過 1 個 header。 我只需要一次 header。

代碼：

import pandas as pd

import glob 

big_files = glob.glob('*.txt')

for small_file in big_files:
    
    df = pd.read_csv(small_file, sep= '\t')
    
    df[df['grade'].isin(['Good']) & df['area'].str.contains('Texas')].to_csv('out.txt',sep= '\t',index=False, mode = 'a')
    print('ok')

Output：

grade   area
Good    Texas
Good    Texas
Good    Texas
grade   area
Good    Texas
Good    Texas
Good    Texas

預期 Output：

grade   area
Good    Texas
Good    Texas
Good    Texas
Good    Texas
Good    Texas
Good    Texas

Answer 1

您可以將header參數用於to_csv方法：

import pandas as pd
import glob 

big_files = glob.glob('*.txt')

header = True
for small_file in big_files:
    df = pd.read_csv(small_file, sep= '\t')
    
    (df[df['grade'].isin(['Good']) & df['area'].str.contains('Texas')]
          .to_csv('out.txt', sep= '\t', 
                  index=False, mode = 'a', 
                  header=header))
    header = False
    print('ok')

Answer 2

解決這個問題的另一種方法是連接單獨的數據幀並且只寫出一次：

import pandas as pd

import glob 

big_files = glob.glob('*.txt')

dfs = [pd.read_csv(file, sep= '\t') for file in big_files]

df = pd.concat(dfs)
    
df[df['grade'].isin(['Good']) & df['area'].str.contains('Texas')].to_csv('out.txt',sep= '\t',index=False)
print('ok')

如何根據 pandas 中的條件使用 to_csv 后僅使用一次標頭？

問題描述

2 個解決方案

解決方案1
1 2020-06-23 14:37:53

解決方案2
0 2020-06-23 14:41:48

如何根據 pandas 中的條件使用 to_csv 后僅使用一次標頭？

問題描述

2 個解決方案

解決方案1 1 2020-06-23 14:37:53

解決方案2 0 2020-06-23 14:41:48

解決方案1
1 2020-06-23 14:37:53

解決方案2
0 2020-06-23 14:41:48