如果两列匹配，则将2个数据框连接在一起

Question

I have 2 dataframes: 我有2个数据框：

CountryPoints 乡村积分

From.country  To.Country points
Belgium       Finland    4
Belgium       Germany    5
Malta         Italy      12
Malta         UK         1

and another dataframe with neighbouring/bordering countries: 以及与邻国/邻国的另一个数据框：

From.country    To.Country 
    Belgium       Finland   
    Belgium       Germany   
    Malta         Italy

I would like to add another column in CountryPoints called neighbour (Y/N) depending if the key value pair is found in the neighbour/bordering countries dataframe. 我想在CountryPoints中添加另一列称为邻居（Y / N），具体取决于是否在邻居/边界国家/地区数据框中找到了键值对。 Is this somehow possible - so it is a kind of a join but the result should be a boolean column. 这有可能吗-所以它是一种联接，但结果应该是一个布尔列。

The result should be: 结果应为：

From.country  To.Country points  Neighbour
    Belgium       Finland    4    Y
    Belgium       Germany    5    Y
    Malta         Italy      12   Y
    Malta         UK         1    N

In the question below it shows how you can merge but it doesn't show how you can add that extra boolean column 在下面的问题中，它显示了如何合并，但未显示如何添加该额外的布尔列

Answer 1

Two alternative approaches: 两种替代方法：

1) with base R: 1）使用基数R：

idx <- match(df1$From.country, df2$From.country, nomatch = 0) &
  match(df1$To.Country, df2$To.Country, nomatch = 0)
df1$Neighbour <- c('N','Y')[1 + idx]

2) with data.table : 2）与data.table ：

library(data.table)
setDT(df1)
setDT(df2)

df1[, Neighbour := 'N'][df2, on = .(From.country, To.Country), Neighbour := 'Y'][]

which both give ( data.table -output shown): 两者都给出（显示了data.table ）：

  From.country To.Country points Neighbour 1: Belgium Finland 4 Y 2: Belgium Germany 5 Y 3: Malta Italy 12 Y 4: Malta UK 1 N

Answer 2

Borrowing the idea from this post : 从这篇文章中借用这个想法：

df1$Neighbour  <- duplicated(rbind(df2[, 1:2], df1[, 1:2]))[ -seq_len(nrow(df2)) ]

df1
#   From.country To.Country points Neighbour
# 1      Belgium    Finland      4      TRUE
# 2      Belgium    Germany      5      TRUE
# 3        Malta      Italy     12      TRUE
# 4        Malta         UK      1     FALSE

Answer 3

What about something like this? 那这样的东西呢？

sortpaste <- function(x) paste0(sort(x), collapse = "_");
df1$Neighbour <- apply(df1[, 1:2], 1, sortpaste) %in% apply(df2[, 1:2], 1, sortpaste)
#  From.country To.Country points Neighbour
#1      Belgium    Finland      4      TRUE
#2      Belgium    Germany      5      TRUE
#3        Malta      Italy     12      TRUE
#4        Malta         UK      1     FALSE

Sample data 样本数据

df1 <- read.table(text =
    "From.country  To.Country points
Belgium       Finland    4
Belgium       Germany    5
Malta         Italy      12
Malta         UK         1", header = T)

df2 <- read.table(text =
    "From.country    To.Country
    Belgium       Finland
    Belgium       Germany
    Malta         Italy", header = T)

如果两列匹配，则将2个数据框连接在一起

问题描述

3 个解决方案

解决方案1
3 2018-05-09 12:23:20

解决方案2
2 2018-05-09 13:47:58

解决方案3
0 已采纳 2018-05-09 11:53:24

Sample data 样本数据

如果两列匹配，则将2个数据框连接在一起

问题描述

3 个解决方案

解决方案1 3 2018-05-09 12:23:20

解决方案2 2 2018-05-09 13:47:58

解决方案3 0 已采纳 2018-05-09 11:53:24

Sample data 样本数据

解决方案1
3 2018-05-09 12:23:20

解决方案2
2 2018-05-09 13:47:58

解决方案3
0 已采纳 2018-05-09 11:53:24