pandas, how to access multiIndex dataframe?

Question

Show my code

>>> df = pd.DataFrame({'key1': ['a', 'a', 'b', 'b', 'a'], \
                   'key2': ['one', 'two', 'one', 'two', 'one'], \
                   'data1': np.random.randn(5), \
                   'data2': np.random.randn(5)})

>>> new_df = df.groupby(['key1', 'key2']).mean().unstack()
>>> print new_df
         data1               data2
key2       one       two       one       two
key1
a    -0.070742 -0.598649 -0.349283 -1.272043
b    -0.109347 -0.097627 -0.641455  1.135560 
>>> print new_df.columns
MultiIndex(levels=[[u'data1', u'data2'], [u'one', u'two']],
       labels=[[0, 0, 1, 1], [0, 1, 0, 1]],
       names=[None, u'key2'])

As you can see, the MultiIndex dataframe is different with normal dataframes, so how to access the data in the MultiIndex dataframe.

Answer 1

Accessing data in multiindex dataframe is similar to the way on a general dataframe. For example, if you want to read data at (a, data1.two), you can simply do: new_df['data1']['two']['a'] or new_df.loc['a', ('data1', 'two')]

Please read the official docs for more details.

Answer 2

This might helps you to know and visualize

unstacked = multi_indexDataFrame.unstack().dropna()
unstacked.plot(kind="bar")

pandas, how to access multiIndex dataframe?

Question

2 answers

solution1
6 ACCPTED 2016-04-23 05:53:36

solution2
0 2021-10-20 04:16:58

pandas, how to access multiIndex dataframe?

Question

2 answers

solution1 6 ACCPTED 2016-04-23 05:53:36

solution2 0 2021-10-20 04:16:58

solution1
6 ACCPTED 2016-04-23 05:53:36

solution2
0 2021-10-20 04:16:58