计算列中的值。 Python pandas dataframe

Question

1.我想数一数excel中我的“性别”一栏有多少男多少女。

我试过sex_value = df.groupby("sex").size()但其中一些有空间。例如。 "F "和"F" （与"M" "M "相同）

如果一切都像"M" or "F" ，我会使用：

sex_value = df.groupby("sex").size() 

Output:
sex
F           37
F           27
M           40
M           31
dtype: int64

就我而言，它应该是这样的

sex_value_female = df[(df['sex']=='F') & (df['sex'] == 'F ')].sum()
sex_value_male = df[(df['sex']=='M') & (df['sex'] == 'M ')].sum()

但它不起作用。

2.同样的问题是平均值。

#mean value of brainweight and bodyweight for males and females
mean = df.groupby('sex').agg({'bodywt': 'mean', 'brainwt': 'mean'})

Output:
             bodywt     brainwt
sex                            
F         19.696216  410.059459
F         21.262963  440.122222
M         21.669750  410.030000
M         22.870968  433.709677

Answer 1

让我们做 strip 来摆脱空白

df.sex = df.sex.str.strip()
sex_value = df.groupby("sex").size()

计算列中的值。 Python pandas dataframe

问题描述

1 个解决方案

解决方案1
0 已采纳 2020-05-11 23:01:00

计算列中的值。 Python pandas dataframe

问题描述

1 个解决方案

解决方案1 0 已采纳 2020-05-11 23:01:00

解决方案1
0 已采纳 2020-05-11 23:01:00