Counting the corresponding value for each distinct elements iteratively

Question

Suppose I have the following dataframe:

d = {'col1':['a','b','c','a','a','b','c','c','c','c'],'col2':[0,1,1,0,1,1,1,1,0,1]}
df = pd.DataFrame(d)

for each distinct value in col1 I wish to calculate the percentage, count and length to do this I made a new dataframe and do the operations I mentioned:

df1 = df[df['col1'].isin(["c"])]

Find the percentage of 0/1 for c :

df1['col2'].value_counts(normalize=True)*100

Find the count of 0/1 for c :

df1['col2'].value_counts()

and the length for c :

len(df1)

I wonder how can I do this iteratively for all distinct values a , b and c and make a new dataframe to show all the results instead of each time making a new dataframe as I did in df1 ? I know what I am doing now is not the best way to approach this.

Answer 1

Do a groupby:

grouped = df.groupby(['col1'])['col2']

# percentage
grouped.value_counts(normalize=True)

# counts
grouped.value_counts()

# total count
grouped.size()

Answer 2

Try with crosstab

out = pd.crosstab(df['col1'], df['col2'], normalize='index')*100
Out[89]: 
col2          0           1
col1                       
a     66.666667   33.333333
b      0.000000  100.000000
c     20.000000   80.000000

Counting the corresponding value for each distinct elements iteratively

Question

2 answers

solution1
2 2020-11-16 18:59:41

solution2
1 ACCPTED 2020-11-16 19:00:08

Counting the corresponding value for each distinct elements iteratively

Question

2 answers

solution1 2 2020-11-16 18:59:41

solution2 1 ACCPTED 2020-11-16 19:00:08

solution1
2 2020-11-16 18:59:41

solution2
1 ACCPTED 2020-11-16 19:00:08