如何遍历for循环中的熊猫行

Question

我在R中有两个数据框

 df1 
 code     date          time       prod    price
 123      01-01-2018    06:11:00   MS      12
 123      01-01-2018    06:16:12   HS      13
 123      01-01-2018    06:17:12   HS      13
 123      01-01-2018    06:19:00   MS      12
 123      02-01-2018    06:17:12   HS      13
 123      02-01-2018    06:19:00   MS      12

 df2
 code     date         prod      price
 123      01-01-2018   MS        12
 123      01-01-2018   HS        13
 123      02-01-2018   HS        13

我想用df1中的唯一日期过滤df1和df2中的行。 我的df1有1000多行，而df2只有4行

例如，在df1有2个唯一的dates ，其中有两个唯一的产品MS和HS 。 所以我想比较例如我想过滤日期和产品01-01-2018 & MS行，然后用df2与相同的日期和时间进行比较

我目前也在使用for循环

unique_dates = iss_trans_268559['transaction_date'].unique()   
unique_dates.sort()
unique_products = iss_trans_268559['prodcode'].unique()

for i in range(len(unique_dates)):
   current_date = df1[(df1['date'] == unique_dates[i]) & df1['prod'] == unique_products[i] 
   df2_current = df2[df2['date'] == unique_dates[i]]

我应该在for循环中获得以下数据帧，然后可以比较current_date和df2_current的日期

current_date
 code     date          time       prod    price
 123      01-01-2018    06:11:00   MS      12
 123      01-01-2018    06:19:00   MS      12

 df2_current 
 code     date         prod      price
 123      01-01-2018   MS        12

上面循环中的问题是i将对unique_products和df2超出限制，我该怎么办？

Answer 1

我认为您需要：

dates= df1['date'].unique()
prods = df1['prod'].unique()

import itertools

comb = list(itertools.product(dates, prods))
# [('01-01-2018', 'MS'), ('01-01-2018', 'HS'), ('02-01-2018', 'MS'), ('02-01-2018', 'HS')]

current_date = []
df2_current = []

for i,j in comb:
    current_date.append(df1[(df1['date']==i) & (df1['prod']==j)])
    df2_current.append(df2[(df2['date']==i) & (df2['prod']==j)])

您可以使用current_date[0]和df2_current[0]访问数据df2_current[0]依此类推...

如何遍历for循环中的熊猫行

问题描述

1 个解决方案

解决方案1
2 已采纳 2018-12-07 09:22:50

如何遍历for循环中的熊猫行

问题描述

1 个解决方案

解决方案1 2 已采纳 2018-12-07 09:22:50

解决方案1
2 已采纳 2018-12-07 09:22:50