簡體   English   中英

基於時間戳合並數據框中的行

[英]Merge rows in dataframe based on Timestamp

我想根據類似的時間戳合並數據幀的行。

 import pandas as pd 

 df = pd.DataFrame([VEST,False,0.6993550658226013,2019-11-27 18:56:12.616425+05:30],
 [HELMET,True,0.8506404161453247 ,2019-11-27 18:56:12.616425+05:30],
 [HELMET,True,0.5948962569236755 ,2019-11-27 18:56:13.617801+05:30],
 [VEST,False,0.6576083898544312 ,2019-11-27 18:56:14.595118+05:30],
 [HELMET,True,0.8451269865036011 ,2019-11-27 18:56:14.595118+05:30],
 [VEST,True,0.7157155275344849 ,2019-11-27 18:56:15.625841+05:30],
 [HELMET,True,0.80693519115448 ,2019-11-27 18:56:15.625841+05:30],
 [HELMET,True,0.5428823232650757 ,2019-11-27 18:56:41.639505+05:30],
 [VEST,False,0.6302998661994934 ,2019-11-27 18:56:42.582407+05:30],
 [HELMET,True,0.8790003657341003 ,2019-11-27 18:56:42.582407+05:30],
 [VEST,False,0.44062405824661255 ,2019-11-27 18:56:44.590130+05:30],
 [HELMET,True,0.9355553388595581, 2019-11-27 18:56:44.590130+05:30 ],columns = ['Type', 'voilation', 'score', 'timestamp']) 

有沒有辦法合並具有相似類型和時間戳(2-3 秒)的行並根據最高分分配違規類型。

 df.groupby(['Type', 'timestamp'])

Groupby 僅生成 3 幀。 無法弄清楚該怎么做。 任何幫助表示贊賞。

您可以使用pandas.Series.dt.round將時間戳舍入到最接近的三秒,然后分組,

df['rounded_timestamp'] = pd.to_datetime(df['timestamp']).dt.round('3s') 
df1 = df.groupby(['Type', 'rounded_timestamp']).agg({'score': 'max'}).reset_index()

>>>df1
    Type    rounded_timestamp   score
0   HELMET  2019-11-27 13:26:12 0.850640
1   HELMET  2019-11-27 13:26:15 0.845127
2   HELMET  2019-11-27 13:26:42 0.879000
3   HELMET  2019-11-27 13:26:45 0.935555
4   VEST    2019-11-27 13:26:12 0.699355
5   VEST    2019-11-27 13:26:15 0.715716
6   VEST    2019-11-27 13:26:42 0.630300
7   VEST    2019-11-27 13:26:45 0.440624

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM