Pandas dropping columns and rows from a dataframe that came from Excel

Question

I am trying to drop some useless columns in a dataframe but I am getting the error: "too many indices for array"

Here is my code :

import pandas as pd
def answer_one():
    energy = pd.read_excel("Energy Indicators.xls")
    energy.drop(energy.index[0,1], axis = 1)
answer_one()

Answer 1

Option 1
Your syntax is wrong when slicing the index and it should be the columns

import pandas as pd

energy = pd.read_excel("Energy Indicators.xls")
energy.drop(energy.columns[[0,1]], axis=1)

Option 2
I'd do it like this

import pandas as pd

energy = pd.read_excel("Energy Indicators.xls")
energy.iloc[:, 2:]

Answer 2

我认为在解析/读取Excel文件时最好跳过不需要的列：

energy = pd.read_excel("Energy Indicators.xls", parse_cols='C:ZZ')

Answer 3

If you're trying to drop the column need to change the syntax. You can refer to them by the header or the index. Here is how you would refer to them by name.

import pandas as pd

energy = pd.read_excel("Energy Indicators.xls")
energy.drop(['first_colum', 'second_column'], axis=1, inplace=True)

Another solution would be to exclude them in the first place:

energy = pd.read_excel("Energy Indicators.xls", usecols=[2:])

This will help speed up the import as well.

Pandas dropping columns and rows from a dataframe that came from Excel

Question

3 answers

solution1
2 ACCPTED 2017-09-15 19:51:36

solution2
1 2017-09-15 19:56:51

solution3
0 2017-09-15 19:57:30

Pandas dropping columns and rows from a dataframe that came from Excel

Question

3 answers

solution1 2 ACCPTED 2017-09-15 19:51:36

solution2 1 2017-09-15 19:56:51

solution3 0 2017-09-15 19:57:30

solution1
2 ACCPTED 2017-09-15 19:51:36

solution2
1 2017-09-15 19:56:51

solution3
0 2017-09-15 19:57:30