将熊猫数据框中的初始日期与时间列合并为日期时间

Question

I'm reading text log files that have the year-month-day in the header, and a column with hours:minutes:seconds for each row. 我正在读取文本日志文件，该文件的标题中包含年月日，并且每行的时间为小时：分钟：秒。 Sort of like this: 有点像这样：

Yr=17  Mn= 3 Dy= 3

19:22:59.894       52
19:24:12.130      106
19:24:13.241      107
...

I have the date as a datetime.date object: eg datetime.date(2017, 3, 3) . 我将日期作为datetime.date对象：例如datetime.date(2017, 3, 3) 。 I have the times as a Series: 我有一系列的时代：

df['Time'] = pd.to_datetime(df['Time strings'], format='%H:%M:%S.%f')

How can I add together the scalar date with the array of times? 如何将标量日期与时间数组相加？

Also, some of these logs go past midnight. 此外，其中一些日志会超过午夜。 I'm thinking I need to use something like numpy.unwrap() to continue onto the next date, but I'm not sure how to do that with pandas.datetime . 我在想我需要使用numpy.unwrap()类的东西继续到下一个日期，但是我不确定如何使用pandas.datetime 。

Answer 1

Instead of using pd.to_datetime on the times , you could use pd.to_timedelta . 而不是使用的pd.to_datetime的时候，你可以使用pd.to_timedelta 。 Then, you could simply add the datetime for that file to the entire column and it would be converted to a column of datetimes. 然后，您可以简单地将该文件的日期时间添加到整个列中，并将其转换为日期时间列。

Example 例

times = ['4:23:12.12', '11:25:43.23', '14:29:55.42']

df = pd.DataFrame(dict(times=times))

df.times = pd.to_timedelta(df.times)

df
#             times
# 0 04:23:12.120000
# 1 11:25:43.230000
# 2 14:29:55.420000

file_date = datetime.date(2017, 3, 3)

df.times += file_date

df
#                     times
# 0 2017-03-03 04:23:12.120
# 1 2017-03-03 11:25:43.230
# 2 2017-03-03 14:29:55.420

Alternatively, you could try to read the dates/times upon input with parse_dates . 或者，您可以尝试使用parse_dates输入时读取日期/时间。

将熊猫数据框中的初始日期与时间列合并为日期时间

问题描述

1 个解决方案

解决方案1
3 已采纳 2017-03-05 01:15:30

将熊猫数据框中的初始日期与时间列合并为日期时间

问题描述

1 个解决方案

解决方案1 3 已采纳 2017-03-05 01:15:30

解决方案1
3 已采纳 2017-03-05 01:15:30