简体   繁体   中英

Extracting the hour from a time column in pandas

Suppose I have the following dataset:

在此处输入图片说明

How would I create a new column, to be the hour of the time?

For example, the code below works for individual times, but I haven't been able to generalise it for a column in pandas.

t = datetime.strptime('9:33:07','%H:%M:%S')
print(t.hour)

Use to_datetime to datetimes with dt.hour :

df = pd.DataFrame({'TIME':['9:33:07','9:41:09']})

#should be slowier
#df['hour'] = pd.to_datetime(df['TIME']).dt.hour

df['hour'] = pd.to_datetime(df['TIME'], format='%H:%M:%S').dt.hour
print (df)
      TIME  hour
0  9:33:07     9
1  9:41:09     9

If want working with datetime s in column TIME is possible assign back:

df['TIME'] = pd.to_datetime(df['TIME'], format='%H:%M:%S')

df['hour'] = df['TIME'].dt.hour
print (df)
                 TIME  hour
0 1900-01-01 09:33:07     9
1 1900-01-01 09:41:09     9

My suggestion:

df = pd.DataFrame({'TIME':['9:33:07','9:41:09']})
df['hour']= df.TIME.str.extract("(^\d+):", expand=False)
  • "str.extract(...)" is a vectorized function that extract a regular expression pattern ( in our case "(^\\d+):" which is the hour of the TIME) and return a Pandas Series object by specifying the parameter "expand= False"
  • The result is stored in the "hour" column

You can use extract() twice to feature out the 'hour' column

df['hour'] = df. TIME. str. extract("(\d+:)")
df['hour'] = df. hour. str. extract("(\d+)")

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM