[英]Calculate the specific time intervals between two given date values in a DataFrame
给定 Python 中的以下 pandas 数据帧:
显示 3 个灯泡在不同时间的开启和关闭。 使用日期时间对象。
date code other time
1 2022-02-27 15:30:21+00:00 5 ON NaT
2 2022-02-29 17:05:21+00:00 5 OFF 2 days 01:35:00
3 2022-04-07 17:05:21+00:00 5 OFF NaT
4 2022-04-06 16:10:21+00:00 4 ON NaT
5 2022-04-07 15:30:21+00:00 4 OFF 0 days 23:20:00
6 2022-02-03 22:40:21+00:00 3 ON NaT
7 2022-02-03 23:20:21+00:00 3 OFF 0 days 00:40:00
8 2022-02-04 00:20:21+00:00 3 ON NaT
9 2022-02-04 14:30:21+00:00 3 ON NaT
10 2022-01-31 15:30:21+00:00 3 ON NaT
11 2022-02-04 15:35:21+00:00 3 OFF 4 days 00:05:00
12 2022-02-04 15:40:21+00:00 3 OFF NaT
13 2022-02-04 19:40:21+00:00 3 ON NaT
14 2022-02-06 15:35:21+00:00 3 OFF 1 days 19:55:00
15 2022-02-23 21:10:21+00:00 3 ON NaT
16 2022-02-24 07:10:21+00:00 3 OFF 0 days 10:00:00
我想添加一个名为nights
的新列。 此列将仅包含变量time
与NaT
不同的行。 灯泡亮了多少晚的信息。 夜间时段定义为 22:00:00 至 05:00:00。
生成的 DataFrame 示例:
date code other time nights
1 2022-02-27 15:30:21+00:00 5 ON NaT 0
2 2022-02-29 17:05:21+00:00 5 OFF 2 days 01:35:00 2
3 2022-04-07 17:05:21+00:00 5 OFF NaT 0
4 2022-04-06 16:10:21+00:00 4 ON NaT 0
5 2022-04-07 15:30:21+00:00 4 OFF 0 days 23:20:00 1
6 2022-02-03 22:40:21+00:00 3 ON NaT 0
7 2022-02-03 23:20:21+00:00 3 OFF 0 days 00:40:00 0
8 2022-02-04 00:20:21+00:00 3 ON NaT 0
9 2022-02-04 14:30:21+00:00 3 ON NaT 0
10 2022-01-31 15:30:21+00:00 3 ON NaT 0
11 2022-02-04 15:35:21+00:00 3 OFF 4 days 00:05:00 4
12 2022-02-04 15:40:21+00:00 3 OFF NaT 0
13 2022-02-04 19:40:21+00:00 3 ON NaT 0
14 2022-02-06 15:35:21+00:00 3 OFF 1 days 19:55:00 2
15 2022-02-23 21:10:21+00:00 3 ON NaT 0
16 2022-02-24 07:10:21+00:00 3 OFF 0 days 10:00:00 1
灯泡 ON 寄存器就在time
变量的非 NaT 值之前。
补充信息:如果灯泡在半夜关闭并打开,则不考虑可变的nights
。 它必须花费整个时间间隔打开。
date code other time nights
1 2022-02-27 21:00:00+00:00 1 ON NaT 0
2 2022-02-28 01:00:00+00:00 1 OFF 0 days 04:00:00 0
3 2022-02-28 03:15:00+00:00 1 ON NaT 0
4 2022-02-28 09:30:00+00:00 1 OFF 0 days 06:15:00 0
不知道你想如何处理部分夜晚,你可以尝试这样的事情.... [更新以更好地处理部分白天] [更新以添加导入]
import pandas as pd
import datetime as dt
df['edate'] = df.date + df.time
df['first_night_start'] = pd.to_datetime(df.date.dt.date) + dt.timedelta(hours=22)
df['first_night_end'] = df['first_night_start'] + dt.timedelta(hours=7)
def get_nights(ser):
if pd.isna(ser.time):
return
days = ser.time.days
the_rest = ser.time - dt.timedelta(days)
if ((ser.date + the_rest) > ser.first_night_end) and (ser.date < ser.first_night_start):
return days + 1
else:
return days
df['nights'] = df.apply(get_nights, axis=1)
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.