简体   繁体   中英

Combine 3 columns to one column pandas

I have the following code:

input= pd.DataFrame({'Police District Name': ['WHEATON', 'SILVER SPRING', 'BETHESDA','GERMANTOWN','WHEATON','MONTGOMERY VILLAGE'], 
                   'cn1': ['Crime Against Person', 'Crime Against Person', 'Crime Against Person','other','other','other'],
                  'cn2': ['Aggravated Assault', 'bla', 'bla','blaa','bla','one more  bla'],
                   'cn3': ['Aggravated Assault', 'bla', 'bla','blaa','bla','one more  bla'],

                    })
input

Desired output:

output= pd.DataFrame({'Police District Name': ['WHEATON', 'SILVER SPRING', 'BETHESDA','GERMANTOWN','WHEATON','MONTGOMERY VILLAGE'], 
                       'total crime number':[6,3,3,3,6,3],

                    })
output

How can i get this ?. Thank you!

If each value in cn1 , cn2 is filled with a crime, you can use the number of columns. The idea is to construct a series of counts via value_counts and multiply by the number of cnx columns. Then map to your dataframe.

counts = df['Police District Name'].value_counts() * (len(df.columns) - 1)
df['total crime number'] = df['Police District Name'].map(counts)

print(df[['Police District Name', 'total crime number']])

  Police District Name  total crime number
0              WHEATON                   6
1        SILVER SPRING                   3
2             BETHESDA                   3
3           GERMANTOWN                   3
4              WHEATON                   6
5   MONTGOMERY VILLAGE                   3

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM