简体   繁体   English

从multiindex过滤列表中的特定日期

[英]filter out specific dates in list from multiindex

I have a multi index data in which i would like to filter out a list of specific dates eg : 我有一个多索引数据,我想在其中过滤掉特定日期的列表,例如:

  date_list=[Timestamp('2018-05-19 00:00:00'),
 Timestamp('2018-06-24 00:00:00'),
 Timestamp('2014-11-12 00:00:00'),
 Timestamp('2015-11-11 00:00:00'),
 Timestamp('2012-05-28 00:00:00'),
 Timestamp('2012-06-23 00:00:00')] 

I try to filter out these dates by the following but it does not work: 我尝试通过以下方法过滤掉这些日期,但它不起作用:

df.iloc[df.index.get_level_values('Date') != date_list] 

Can anyone please help. 谁能帮忙。

Use Index.isin with inverted boolean mask by ~ , iloc should be removed because filter by boolean indexing : Index.isin与反掩码为~ ,应删除iloc因为按boolean indexing过滤:

Notice: 注意:

Check if DatetimeIndex before filtering: 过滤之前检查DatetimeIndex是否:

print (df.index.get_level_values('Date'))

df1 = df[~df.index.get_level_values('Date').isin(date_list)]

Another solution with drop withparameters level and errors : 带有参数levelerrors drop另一种解决方案:

df1 = df.drop(date_list, level='Date', errors='ignore')

Sample : 样品

df = pd.DataFrame({'Date':['2018-05-19','2014-11-10','2018-06-24','2014-11-13'],
                   'ID':[1,1,2,2],
                   'Val':list('abcd')})

df['Date'] = pd.to_datetime(df['Date'])
df = df.set_index(['Date','ID'])
print (df)
              Val
Date       ID    
2018-05-19 1    a
2014-11-10 1    b
2018-06-24 2    c
2014-11-13 2    d

date_list=[pd.Timestamp('2018-05-19 00:00:00'),
 pd.Timestamp('2018-06-24 00:00:00'),
 pd.Timestamp('2014-11-12 00:00:00'),
 pd.Timestamp('2015-11-11 00:00:00'),
 pd.Timestamp('2012-05-28 00:00:00'),
 pd.Timestamp('2012-06-23 00:00:00')] 

print (df.index.get_level_values('Date'))
DatetimeIndex(['2018-05-19', '2014-11-10', '2018-06-24', '2014-11-13'], 
              dtype='datetime64[ns]', name='Date', freq=None)

df1 = df[~df.index.get_level_values('Date').isin(date_list)]
print (df1)
              Val
Date       ID    
2014-11-10 1    b
2014-11-13 2    d

df1 = df.drop(date_list, level='Date', errors='ignore')
print (df1)
              Val
Date       ID    
2014-11-10 1    b
2014-11-13 2    d

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM