如何根據python中的時間變化對數據集進行分類或重組

Question

我需要為每小時不同時間之間的值分配數字。 然后我如何向其中添加一個新列，我可以在其中指定要每小時分組的每個單元格。 比如00:00:00到00:59:59的交易都填1，01:00:00到01:59:59的交易填2，以此類推到23:00 :00 到 23:59:59 填充 24

Time_duration = df['period']

print (Time_duration)

0        23:59:56
1        23:59:56
2        23:59:55
3        23:59:53
4        23:59:52
           ...
74187    00:00:18
74188    00:00:09
74189    00:00:08
74190    00:00:03
74191    00:00:02 ```


# this is the result I desire.... How can I then add a new column to this where I can specify each cell to be grouped hourly. for instance, all the transactions within 00:00:00 to 00:59:59 to be filled with 1, transactions within 01:00:00 to 01:59:59 to be filled with 2, and so on till 23:00:00 to 23:59:59 to be filled with 24.

0        23:59:56        24
1        23:59:56        24
2        23:59:55        24
3        23:59:53        24
4        23:59:52        24
           ...
74187    00:00:18         1
74188    00:00:09         1
74189    00:00:08         1
74190    00:00:03         1
74191    00:00:02         1

Answer 1

您可以使用正則表達式和str.extract

import pandas as pd
pattern= r'^(\d{1,2}):' #capture the digits of the hour
df['hour']=df['period'].str.extract(pattern).astype('int') + 1 # cast it as int so that you can add 1

Answer 2

df.sort_values(by=["period"])
timeStamp_list = (pd.to_datetime(list(df['period'])))
df['Hour'] =timeStamp_list.hour

試試這個代碼，這對我有用。

如何根據python中的時間變化對數據集進行分類或重組

問題描述

2 個解決方案

解決方案1
0 2020-02-01 11:14:50

解決方案2
0 已采納 2020-02-01 12:54:25

如何根據python中的時間變化對數據集進行分類或重組

問題描述

2 個解決方案

解決方案1 0 2020-02-01 11:14:50

解決方案2 0 已采納 2020-02-01 12:54:25

解決方案1
0 2020-02-01 11:14:50

解決方案2
0 已采納 2020-02-01 12:54:25