简体   繁体   中英

Is there a more efficient way to apply a mapping to a pandas series?

I have a column in my pandas DataFrame called "State". It contains US state abbreviations.I have hard coded regions and I want to create a new column with the region for each state.

I used pd.Series.apply(), but I am wondering if there is a better practice for this type of mapping. Any suggestions on how I could improve my code?

This is my current code that works, but I'm just open for suggestions on best practices.

def get_region(s, *regions):
    if s in regions[0]:
        return 'west'
    elif s in regions[1]:
        return 'midwest'
    elif s in regions[2]:
        return 'south'
    elif s in regions[3]:
        return 'northeast'
    else:
        return None

west = ['WA','OR','CA','ID','NV','MT','WY','UT','AZ','CO','NM']
midwest = ['ND','MN','WI','MI','SD','NE','KS','IA','MO','IL','IN','OH']
south = ['TX','OK','AR','LA','MS','TN','KY','AL','GA','FL','SC','NC','VA','WV','MD','DE']
northeast = ['PA','NJ','NY','CT','MA','RI','VT','NH','ME']

regions = [west,midwest,south,northeast]

full_df['Region'] = full_df['State'].apply(get_region, args=regions)
full_df['Region'].head(15)

Out:
0          west
1       midwest
2         south
3         south
4       midwest
5          west
6         south
7         south
8          west
9       midwest
10        south
11    northeast
12    northeast
13         west
14         west
Name: Region, dtype: object

Check with map

s=pd.DataFrame([west,midwest,south,northeast],index=['west','midwest','south','northeast'])
s=s.reset_index().melt('index')
full_df['Region'] = full_df['State'].map(dict(zip(s['value'],s['index'])))

You could try creating a dict and mapping it to the column:

west_dict = {i:"west" for i in west}
midwest_dict = {i:"midwest" for i in midwest}
south_dict = {i:"south" for i in south}
northeast_dict = {i:"northeast" for i in northeast}
d = {**west_dict, **midwest_dict, **south_dict, **northeast_dict}
full_df['Region'] = full_df['State'].map(d)

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM