[英]Pythonic way to make row as column index and column as row index
我想变异数据框对象。 我想使第一行作为列索引。 并将第一列作为行索引。
import pandas as pd
wiki = "https://en.wikipedia.org/wiki/List_of_state_and_union_territory_capitals_in_India"
df = pd.read_html(wiki)[1]
df2 = df.copy()
df2.head()
目前,我正在这样做(我失去了行索引名称):
df2.columns = df.iloc[0]
df2.drop(0, inplace=True)
df2.drop('No.', axis=1, inplace=True)
df2.head()
如何以更加Pythonic的方式保存行索引名称?
您可以直接在read_html
指定read_html
的内容, header
指定将哪一行用作列,而index_col
哪一列用作索引:
In [16]: df = pd.read_html(wiki,header=0,index_col=0)[1]
In [17]: df.head()
Out[17]:
State or union territory Administrative capitals Legislative capitals \
No.
1 Andaman and Nicobar Islands Port Blair Port Blair
2 Andhra Pradesh Hyderabad[a] Hyderabad
3 Arunachal Pradesh Itanagar Itanagar
4 Assam Dispur Guwahati
5 Bihar Patna Patna
Judiciary capitals Year capital was established The Former capital
No.
1 Kolkata 1955 Calcutta (1945–1956)
2 Hyderabad 1959 Kurnool (1953-1956)
3 Guwahati 1986 NaN
4 Guwahati 1975 Shillong[b] (1874–1972)
5 Patna 1912 NaN
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.