简体繁体中英

Identifying multiple columns by name in Pandas

原文 2014-04-11 14:22:06 6 2 python/ pandas

Is there a way to select a subset of columns using text matching or regular expressions?

In R it would be like this:

attach(iris) #Load the 'Stairway to Heaven' of R's built-in data sets
iris[grep(names(iris),pattern="Length")] #Prints only columns containing the word "Length"

2 answers

You can use the filter method for this (use axis=1 to filter on the column names). This function has different possibilities:

Equivalent to if 'Length' in col :
```
 df.filter(like='Length', axis=1) 
```
Using a regex (however, it is using re.search and not re.match , so you have possibly to adjust the regex):
```
 df.filter(regex=r'\\.Length$', axis=1) 
```

Using Python's in statement, it would work like this:

#Assuming iris is already loaded as a df called 'iris' and has a proper header
iris = iris[[col for col in iris.columns if 'Length' in col]]
print iris.head()

Or, using regular expressions,

import re
iris = iris[[col for col in iris.columns if re.match(r'\.Length$',col)]]
print iris.head()

The first will run faster but the second will be more accurate.

Multiple columns with the same name in Pandas

pandas - multiple columns to “column name - value” columns

Is there any function in Pandas to unstack column values into separate columns with multiple identifying columns?

Pandas merge by name and date (multiple columns)

Drop columns by name that appear in multiple pandas dataframes

Unpivot multiple columns with same name in pandas dataframe

Force Pandas to keep multiple columns with the same name

How to groupby multiple columns in pandas based on name?

Pandas— aggregating multiple columns with the same name?

Identifying statistical outliers with pandas: groupby and individual columns

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Multiple columns with the same name in Pandas pandas - multiple columns to “column name - value” columns Is there any function in Pandas to unstack column values into separate columns with multiple identifying columns? Pandas merge by name and date (multiple columns) Drop columns by name that appear in multiple pandas dataframes Unpivot multiple columns with same name in pandas dataframe Force Pandas to keep multiple columns with the same name How to groupby multiple columns in pandas based on name? Pandas— aggregating multiple columns with the same name? Identifying statistical outliers with pandas: groupby and individual columns

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM