Pandas Dataframe 按列分组

Question

I have a dataframe df which i need to groupby Department Name column我有一个 dataframe df 我需要按部门名称列分组

Input输入

Employee Name员工姓名	Department Name部门名称	Subjects科目	Billable可计费	Hours小时	Date日期
Anu阿努	CS CS	Java Java	Yes是的	8 8	01-03-2021 01-03-2021
Anu阿努	CS CS	Python Python	Yes是的	9 9	02-03-2021 02-03-2021
Anu阿努	CS CS	SQL SQL	No不	6 6	03-03-2021 03-03-2021
Anu阿努	CS CS	React反应	Yes是的	5 5	03-03-2021 03-03-2021
Anu阿努	CS CS	.Net 。网	No不	8 8	04-03-2021 04-03-2021
Bala巴拉	CS CS	SQL SQL	No不	5 5	01-03-2021 01-03-2021
Bala巴拉	CS CS	Python Python	Yes是的	4 4	01-03-2021 01-03-2021
Bala巴拉	CS CS	Java Java	Yes是的	2 2	02-03-2021 02-03-2021
Bala巴拉	CS CS	.Net 。网	No不	8 8	03-03-2021 03-03-2021
Bala巴拉	CS CS	React反应	Yes是的	7 7	04-03-2021 04-03-2021

Code代码

df = pd.pivot_table(df,index=['Department Name','Employee Name','Billable'],columns=['Subjects'],values='Hours',aggfunc={'Hours': np.sum})

# Resetting index
df = df.reset_index ()
list_column = df.columns

# Adding new columns and calculation
total = df.sum(axis=1)
df.insert(len(df.columns), column='Total', value=total)

available_col = len(df.columns)
Utilization_col = len(df.columns)
utilization_row = len(df.columns)

# Adding Available column
available = 168
df.insert(len(df.columns), column='Available', value=available)

# Adding Utilization column
utilization = (total / available)
df.insert(len(df.columns), column='Utilization', value=utilization)

# Filter dataframe using groupby
df1 = df.groupby(['Department Name','Employee Name'], sort=False ).sum(min_count=1)
df1['Available'] = available

# Adding Billable Utilization column and Non-billable Utilization column
df['Billable'] = np.where(df['Billable'] == 'Billable', 'Billable Utilization','Non Billable Utilization')

df2 = (df.groupby(['Employee Name', 'Billable Status'])[list_column].sum().sum(axis=1).unstack().div(available).mul(100)).round(2)

df = df1.join(df2).reset_index()
df.index = df.index

# Round the column value
df['Total'] = df['Total'].round(2)

df = df.groupby(['Department Name','Employee Name'], as_index=False).sum(min_count=1)

My Output我的 Output

Expected Output预计 Output

Note :注意：

I tried to use reset_index, but groupby function not works.我尝试使用reset_index，但groupby function 不起作用。

Answer 1

I have tried making the following function and I was able to get your desired output我已经尝试制作以下 function 并且我能够获得您想要的 output

def func(x): 
for i in range(1, x['Department Name'].size):
        x['Department Name'].iloc[i] = ''
return x;

df['Department Name'] = df['Department Name'].apply(str)
df = df.groupby('Department 
Name').apply(func).set_index('Department Name')
df.head()

Proof证明

Pandas Dataframe 按列分组

问题描述

My Output我的 Output

1 个解决方案

解决方案1
0 2021-12-07 10:50:23

Pandas Dataframe 按列分组

问题描述

My Output我的 Output

1 个解决方案

解决方案1 0 2021-12-07 10:50:23

解决方案1
0 2021-12-07 10:50:23