Pandas groupby slice of a string

Question

I have a dataframe where I want to group by the first part of an ID field. For example, say I have the following:

>>> import pandas as pd
>>> df=pd.DataFrame(data=[['AA',1],['AB',4],['AC',5],['BA',11],['BB',2],['CA',9]], columns=['ID','Value'])
>>> df
   ID  Value
0  AA      1
1  AB      4
2  AC      5
3  BA     11
4  BB      2
5  CA      9
>>>

How can I group by the first letter of the ID field?

I can currently do this by creating a new column and then grouping on that, but I imagine there is a more efficient way:

>>> df['GID']=df['ID'].str[:1]
>>> df.groupby('GID')['Value'].sum()
GID
A    10
B    13
C     9
Name: Value, dtype: int64
>>>

Answer 1

您将需要以某种方式创建分组键，而不必在DataFrame本身上创建，例如：

df.groupby(df.ID.str[:1])['Value'].sum()

Pandas groupby slice of a string

Question

1 answers

solution1
5 ACCPTED 2015-12-30 18:36:12

Pandas groupby slice of a string

Question

1 answers

solution1 5 ACCPTED 2015-12-30 18:36:12

solution1
5 ACCPTED 2015-12-30 18:36:12