使用熊貓讀取以行作為列名的文本文件

Question

我正在一個項目中讀取由用戶生成的可變長度的文本文件。 文本文件的開頭有幾個注釋，其中之一需要用作列名。 我知道可以使用genfromtxt（）來做到這一點，但是我必須使用pandas。 這是一個示例文本文件的開頭：

#GeneratedFile
#This file will be generated by a user
#a b c d f g h i j k l m n p q r s t v w x y z
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23

我需要＃a，b，c，...作為列名。 我嘗試了以下代碼行以讀取數據並將其更改為數組，但是它僅返回行，而忽略了列名。

import pandas as pd    
data = pd.read_table('example.txt',header=2)    
d = pd.DataFrame.as_matrix(data)

有沒有不使用genfromtxt（）的方法？

Answer 1

一種方法是嘗試以下操作：

df = pd.read_csv('example.txt', sep='\s+', engine='python', header=2)

# the first column name become #a so, replacing the column name
df.rename(columns={'#a':'a'}, inplace=True)

# alternatively, other way is to replace # from all the column names
#df.columns = [column_name.replace('#', '') for column_name in df.columns]
print(df)

結果：

   a  b  c  d  f  g  h  i  j   k ...   p   q   r   s   t   v   w   x   y   z
0  0  1  2  3  4  5  6  7  8   9 ...  13  14  15  16  17  18  19  20  21  22
1  1  2  3  4  5  6  7  8  9  10 ...  14  15  16  17  18  19  20  21  22  23

[2 rows x 23 columns]

使用熊貓讀取以行作為列名的文本文件

問題描述

1 個解決方案

解決方案1
0 已采納 2017-09-24 02:01:40

使用熊貓讀取以行作為列名的文本文件

問題描述

1 個解決方案

解決方案1 0 已采納 2017-09-24 02:01:40

解決方案1
0 已采納 2017-09-24 02:01:40