熊貓時間序列重采樣：KeyError：“ [['year''month''day']不在索引中”

Question

請幫忙

我嘗試對數據進行重新采樣未成功。 它生成上面的錯誤，當應用DatetimeIndex時， 它將截斷時間戳，刪除HH：MM：SS 。 它仍然不能將數據識別為Datetime對象。 提前致謝。

源文件可以在這里找到

import pandas as pd
import numpy as np

df= pd.read_csv('20170713.csv')
df2= df.loc[:,['sen_id', 'pos_id', 'heat_val', 'sat_val', 'timestamp']] 
cols = df2.columns.tolist() 
cols = cols[-1:] + cols[:-1]
df2 = df2[cols]
#print(df2.head())

df3 = df2.set_index(['timestamp'])
df3.index = pd.DatetimeIndex(df3.index)
print(df3.head())

pd.to_datetime(df3[['year', 'month', 'day']])
df3.resample('1H').mean()
print(df3)

Answer 1

問題是pd.to_datetime()使用不正確，其中您提供了df3不存在的三列作為df3[['year','month','day']] 。 相反，您只想提供一個系列。 然后，您要提供參數format='%d/%m/%Y %H:%M' ，它對應於您的日期strptime格式

df= pd.read_csv('20170713.csv')
df2= df.loc[:,['sen_id', 'pos_id', 'heat_val', 'sat_val', 'timestamp']] 
cols = df2.columns.tolist() 
cols = cols[-1:] + cols[:-1]
df2 = df2[cols]
#print(df2.head())

df3 = df2.set_index(['timestamp'])
#df3.index = pd.DatetimeIndex(df3.index)
#print(df3.head())

#pd.to_datetime(df3[['year', 'month', 'day']])
df3.index = pd.to_datetime(df3.index,format='%d/%m/%Y %H:%M')
df3 = df3.resample('1H').mean()
print(df3)

舉個例子，為了提高可讀性，您的代碼實際上也可以被壓縮，

df = pd.read_csv('20170713.csv')

#Preserve desired columns and reorder as df2
df2 = df[['timestamp', 'sen_id', 'pos_id', 'heat_val', 'sat_val']]

#set timestamp as index and convert to datetime
df2.set_index(['timestamp'],drop=True,inplace=True)
df2.index = pd.to_datetime(df2.index,format='%d/%m/%Y %H:%M')

#resample
df3 = df2.resample('1H').mean()

print df3

熊貓時間序列重采樣：KeyError：“ [['year''month''day']不在索引中”

問題描述

1 個解決方案

解決方案1
0 已采納 2017-07-29 22:11:06

熊貓時間序列重采樣：KeyError：“ [[&#39;year&#39;&#39;month&#39;&#39;day&#39;]不在索引中”

問題描述

1 個解決方案

解決方案1 0 已采納 2017-07-29 22:11:06

熊貓時間序列重采樣：KeyError：“ [['year''month''day']不在索引中”

解決方案1
0 已采納 2017-07-29 22:11:06