简体繁体中英

How does one determine which columns to set as an index in a Pandas DataFrame?

原文 2016-12-20 17:08:37 0 2 python/ pandas/ indexing/ dataframe

Let's say I have a DataFrame of financial securities, which often have multiple identifiers:

Should I choose only one column to set as the index? Should I set all potential identifiers as the index? Should I set all text data as an index, and leave all numeric data as columns? What is the best practice?

2 answers

This is more about database design than pandas.

The decision should be based on the business meaning of the dataframe (table in relational database) and its columns. Eg, if 'Internal Security ID' is used to identify this kind of data in its business, then it should be set as the index.

However, if you are not sure, just stick with the default integer index.

I tend to stick with the default index unless you have a need to have one of your columns as an index. If you do, I strongly recommend using a column with unique values. If there exists duplicates, this will cause you a lot of headache.

How to set the index of a pandas Dataframe to that of the length of the Columns?

How to apply to one set of columns in a dataframe with multi-index columns

Pandas: How do I set index on the columns of an existing DataFrame?

Index a pandas dataframe with a value in one of the columns

Pandas convert columns of one dataframe to index in another dataframe

how to replace index of columns and rows in Pandas DataFRame

How to populate Pandas dataframe as function of index and columns

How to get the first index of a pandas DataFrame for which several undefined columns are not null?

How to get the first index of a pandas dataframe for which two columns are both not null?

How to set index values in a MultiIndex pandas DataFrame?

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How to set the index of a pandas Dataframe to that of the length of the Columns? How to apply to one set of columns in a dataframe with multi-index columns Pandas: How do I set index on the columns of an existing DataFrame? Index a pandas dataframe with a value in one of the columns Pandas convert columns of one dataframe to index in another dataframe how to replace index of columns and rows in Pandas DataFRame How to populate Pandas dataframe as function of index and columns How to get the first index of a pandas DataFrame for which several undefined columns are not null? How to get the first index of a pandas dataframe for which two columns are both not null? How to set index values in a MultiIndex pandas DataFrame?

Related Tags

How does one determine which columns to set as an index in a Pandas DataFrame?

Question

2 answers

solution1
0 2017-03-06 16:38:21

solution2
0 2018-10-02 15:15:01

How does one determine which columns to set as an index in a Pandas DataFrame?

Question

2 answers

solution1 0 2017-03-06 16:38:21

solution2 0 2018-10-02 15:15:01

solution1
0 2017-03-06 16:38:21

solution2
0 2018-10-02 15:15:01