根据 Pandas Dataframe 中的时间戳列过滤给定的列（计数）

Question

I have a Pandas Dataframe which looks like below我有一个 Pandas Dataframe 如下所示

I want my Output or Visualization Plots which tell:我想要我的 Output 或可视化图告诉：
During which Hour, how many Jobs have failed,completed (count)在哪个小时内，有多少作业失败，完成（计数）

Answer 1

First filter by boolean indexing only rows filled by Failed and then use crosstab with DataFrame.plot.bar :首先按boolean indexing由Failed填充的行，然后使用带有DataFrame.plot.bar的crosstab ：

df1 = df[df['Status'].eq('Failed')]
out = pd.crosstab(df1['Hour'], df1['Job'])

out.plot.bar()

Answer 2

import pandas as pd

df = pd.read_csv('./data.csv')

# status
status = set(df['Status'])
dfStatus = {s: df[df['Status'] == s] for s in status}

# hours
hours = set(df['Hour'])
dfStatusPerHour = {}

# calculate them explicitly
for s in status:
    dfStatusPerHour[s] = {h: dfStatus[s][dfStatus[s]['Hour'] == h].shape[0] for h in hours}

# show results
for s in status:
    print(f"{s} : {dfStatusPerHour[s]}")

根据 Pandas Dataframe 中的时间戳列过滤给定的列（计数）

问题描述

2 个解决方案

解决方案1
1 已采纳 2020-08-17 12:03:47

解决方案2
1 2020-08-17 12:27:16

根据 Pandas Dataframe 中的时间戳列过滤给定的列（计数）

问题描述

2 个解决方案

解决方案1 1 已采纳 2020-08-17 12:03:47

解决方案2 1 2020-08-17 12:27:16

解决方案1
1 已采纳 2020-08-17 12:03:47

解决方案2
1 2020-08-17 12:27:16