简体   繁体   English

如何将不同格式的字符串(YYYY-MM-DD,DD-MM-YYYY)转换为熊猫中一种格式的日期对象?

[英]How to convert string of different format(YYYY-MM-DD, DD-MM-YYYY) to date object of one format in pandas?

I have a column of string object where it contains different format(YYYY-MM-DD, DD-MM-YYYY). 我有一列字符串对象,其中包含不同的格式(YYYY-MM-DD,DD-MM-YYYY)。 How to convert to DD-MM-YYYY of date object. 如何转换为日期对象的DD-MM-YYYY。

I tried with, df['accepted_date'] = pd.to_datetime(df['accepted_date'], format='%d-%m-%Y') 我尝试使用df ['accepted_date'] = pd.to_datetime(df ['accepted_date'],format ='%d-%m-%Y')

I got error as time data '1899-12-31' does not match format '%d-%m-%Y' (match) 我收到错误消息,因为时间数据'1899-12-31'与格式'%d-%m-%Y'不匹配(匹配)

Thanks, 谢谢,

Let pandas to parse dates, but then some days with months should be swapped: 让大熊猫解析日期,但随后应将几个月的月份替换为:

df['accepted_date'] = pd.to_datetime(df['accepted_date'])

So better is use to_datetime with format and parameter errors='coerce' , what return only matched datetimes with NaT for non matched. 更好的方法是使用to_datetime ,其格式和参数errors='coerce' ,对于不匹配的内容,仅返回带有NaT匹配日期时间。 Last use combine_first for join all Series - NaT are replaced by values from another Series : 最后使用combine_first为所有参加Series - NaT被来自另一个值替换Series

df = pd.DataFrame({'accepted_date':['2017-01-02','07-08-2017','20-03-2017','2017-01-04']})

d1 = pd.to_datetime(df['accepted_date'], format='%d-%m-%Y', errors='coerce')
d2 = pd.to_datetime(df['accepted_date'], format='%Y-%m-%d', errors='coerce')

df['accepted_date1'] = d1.combine_first(d2)
df['accepted_date2'] = pd.to_datetime(df['accepted_date'])
print (df)
  accepted_date accepted_date1 accepted_date2
0    2017-01-02     2017-01-02     2017-01-02
1    07-08-2017     2017-08-07     2017-07-08 <-swapped dd-mm
2    20-03-2017     2017-03-20     2017-03-20
3    2017-01-04     2017-01-04     2017-01-04

Detail : 详细说明

print (d1)
0          NaT
1   2017-08-07
2   2017-03-20
3          NaT
Name: accepted_date, dtype: datetime64[ns]

print (d2)
0   2017-01-02
1          NaT
2          NaT
3   2017-01-04
Name: accepted_date, dtype: datetime64[ns]

EDIT: 编辑:

Another solution is use parameter dayfirst=True : 另一个解决方案是使用参数dayfirst=True

df['accepted_date3'] = pd.to_datetime(df['accepted_date'], dayfirst=True)
print (df)
  accepted_date accepted_date3
0    2017-01-02     2017-01-02
1    07-08-2017     2017-08-07
2    20-03-2017     2017-03-20
3    2017-01-04     2017-01-04

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM