[英]How to convert string of different format(YYYY-MM-DD, DD-MM-YYYY) to date object of one format in pandas?
I have a column of string object where it contains different format(YYYY-MM-DD, DD-MM-YYYY). 我有一列字符串对象,其中包含不同的格式(YYYY-MM-DD,DD-MM-YYYY)。 How to convert to DD-MM-YYYY of date object.
如何转换为日期对象的DD-MM-YYYY。
I tried with, df['accepted_date'] = pd.to_datetime(df['accepted_date'], format='%d-%m-%Y') 我尝试使用df ['accepted_date'] = pd.to_datetime(df ['accepted_date'],format ='%d-%m-%Y')
I got error as time data '1899-12-31' does not match format '%d-%m-%Y' (match) 我收到错误消息,因为时间数据'1899-12-31'与格式'%d-%m-%Y'不匹配(匹配)
Thanks, 谢谢,
Let pandas to parse dates, but then some days with months should be swapped: 让大熊猫解析日期,但随后应将几个月的月份替换为:
df['accepted_date'] = pd.to_datetime(df['accepted_date'])
So better is use to_datetime
with format and parameter errors='coerce'
, what return only matched datetimes with NaT
for non matched. 更好的方法是使用
to_datetime
,其格式和参数errors='coerce'
,对于不匹配的内容,仅返回带有NaT
匹配日期时间。 Last use combine_first
for join all Series
- NaT
are replaced by values from another Series
: 最后使用
combine_first
为所有参加Series
- NaT
被来自另一个值替换Series
:
df = pd.DataFrame({'accepted_date':['2017-01-02','07-08-2017','20-03-2017','2017-01-04']})
d1 = pd.to_datetime(df['accepted_date'], format='%d-%m-%Y', errors='coerce')
d2 = pd.to_datetime(df['accepted_date'], format='%Y-%m-%d', errors='coerce')
df['accepted_date1'] = d1.combine_first(d2)
df['accepted_date2'] = pd.to_datetime(df['accepted_date'])
print (df)
accepted_date accepted_date1 accepted_date2
0 2017-01-02 2017-01-02 2017-01-02
1 07-08-2017 2017-08-07 2017-07-08 <-swapped dd-mm
2 20-03-2017 2017-03-20 2017-03-20
3 2017-01-04 2017-01-04 2017-01-04
Detail : 详细说明 :
print (d1)
0 NaT
1 2017-08-07
2 2017-03-20
3 NaT
Name: accepted_date, dtype: datetime64[ns]
print (d2)
0 2017-01-02
1 NaT
2 NaT
3 2017-01-04
Name: accepted_date, dtype: datetime64[ns]
EDIT: 编辑:
Another solution is use parameter dayfirst=True
: 另一个解决方案是使用参数
dayfirst=True
:
df['accepted_date3'] = pd.to_datetime(df['accepted_date'], dayfirst=True)
print (df)
accepted_date accepted_date3
0 2017-01-02 2017-01-02
1 07-08-2017 2017-08-07
2 20-03-2017 2017-03-20
3 2017-01-04 2017-01-04
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.