如何跳過xls末尾的pandas數據幀中的行

Question

我有一個數據幀：

                                                        Energy Supply Energy Supply per Capita  % Renewable
    Country                                                                                                
    Afghanistan                                          3.210000e+08                       10    78.669280
    Albania                                              1.020000e+08                       35   100.000000
    British Virgin Islands                               2.000000e+06                       85     0.000000
    ...      
    Aruba                                                1.200000e+07                      120    14.870690                                                     ...                      
    Excludes the overseas territories.                            NaN                      NaN          NaN
    Data exclude Hong Kong and Macao Special Admini...            NaN                      NaN          NaN
    Data on kerosene-type jet fuel include aviation...            NaN                      NaN          NaN
    For confidentiality reasons, data on coal and c...            NaN                      NaN          NaN
    Data exclude Greenland and the Danish Faroes.                 NaN                      NaN          NaN

我曾經使用df = pd.read_excel(filelink, skiprows=16)在文件的最開頭刪除不需要的信息，但是如何在df結束時刪除“noize”信息呢？

我試圖將一個列表傳遞給skiprows，但它搞砸了結果。

Answer 1

看來你需要的參數skip_footer = 5在read_excel ：

skip_footer ：int，默認值為0

最后的行跳過（0索引）

樣品：

df = pd.read_excel('myfile.xlsx', skip_footer = 5)
print (df)
                  Country  Energy Supply  Energy Supply per Capita  \
0             Afghanistan    321000000.0                        10   
1                 Albania    102000000.0                        35   
2  British Virgin Islands      2000000.0                        85   
3                   Aruba     12000000.0                       120   

   % Renewable   
0      78.66928  
1     100.00000  
2       0.00000  
3      14.87069

另一種解決方案是使用dropna刪除某些列中所有NaN所有行：

df = pd.read_excel('myfile.xlsx')

cols = ['Energy Supply','Energy Supply per Capita','% Renewable']
df = df.dropna(subset=cols, how='all')
print (df)
                  Country  Energy Supply  Energy Supply per Capita  \
0             Afghanistan    321000000.0                      10.0   
1                 Albania    102000000.0                      35.0   
2  British Virgin Islands      2000000.0                      85.0   
3                   Aruba     12000000.0                     120.0   

   % Renewable  
0     78.66928  
1    100.00000  
2      0.00000  
3     14.87069

如何跳過xls末尾的pandas數據幀中的行

問題描述

1 個解決方案

解決方案1
3 已采納 2017-05-11 06:56:09

如何跳過xls末尾的pandas數據幀中的行

問題描述

1 個解決方案

解決方案1 3 已采納 2017-05-11 06:56:09

解決方案1
3 已采納 2017-05-11 06:56:09