R中两个数据帧上的条件JOIN

Question

Suppose there are two data frames likes the following (given from this post ): 假设有两个数据帧，如下所示（从本文中得出）：

df1 = data.frame(CustomerId = c(1:6), Product = c(rep("Toaster", 3), rep("Radio", 3)))
df2 = data.frame(CustomerId = c(2, 4, 6), State = c(rep("Alabama", 2), rep("Ohio", 1)))

df1
#  CustomerId Product
#           1 Toaster
#           2 Toaster
#           3 Toaster
#           4   Radio
#           5   Radio
#           6   Radio

df2
#  CustomerId   State
#           2 Alabama
#           4 Alabama
#           6    Ohio

The question is how can I do the following sql query in R: 问题是如何在R中执行以下sql查询：

SELECT * FROM df1 JOIN df2 on df1.CustomerId <= df2.CustomerId

What I have known is that I can do the inner join using merge(df1, df2, by = "CustomerId") . 我所知道的是，我可以使用merge(df1, df2, by = "CustomerId")进行内部merge(df1, df2, by = "CustomerId") 。 But it is not satisfied the condition of the join. 但是不满足加入条件。

Answer 1

This one confusing way to do this. 这是一种令人困惑的方法。 But it works though: 但是它可以工作：

library(tidyverse)
df1 = data.frame(CustomerId = c(1:6), Product = c(rep("Toaster", 3), rep("Radio", 3)))
df2 = data.frame(CustomerId = c(2, 4, 6), State = c(rep("Alabama", 2), rep("Ohio", 1)))

map2_df(
  df1$CustomerId, df1$Product,
  .f = ~ {
    temp <- df2 %>% filter(.x <= CustomerId)
    tibble(CustomerId.x = .x, Product = .y, 
           CustomerId.y = temp$CustomerId, State = temp$State)
  }
)

Answer 2

As I found in comments by dear Grothendieck, one straightforward solution is using sqldf package and get exactly my result in sql format: 正如我在亲爱的Grothendieck的评论中所发现的那样，一个简单的解决方案是使用sqldf软件包，并以sql格式获取我的结果：

library(sqldf)
sqldf("SELECT * FROM df1 JOIN df2 on df1.CustomerId <= df2.CustomerId")

R中两个数据帧上的条件JOIN

问题描述

2 个解决方案

解决方案1
0 2017-10-15 16:01:29

解决方案2
0 已采纳 2017-10-23 19:00:13

R中两个数据帧上的条件JOIN

问题描述

2 个解决方案

解决方案1 0 2017-10-15 16:01:29

解决方案2 0 已采纳 2017-10-23 19:00:13

解决方案1
0 2017-10-15 16:01:29

解决方案2
0 已采纳 2017-10-23 19:00:13