dplyr字符串作為列引用

Question

無論如何將字符串作為列引用傳遞給dplyr過程？

下面是一個示例 - 使用分組數據集和一個簡單函數，我嘗試將字符串作為引用傳遞給列。 謝謝！

machines <- data.frame(Date=c("1/31/2014", "1/31/2014", "2/28/2014", "2/28/2014", "3/31/2014", "3/31/2014"), 
            Model.Num=c("123", "456", "123", "456", "123", "456"), 
            Cost=c(200, 300, 250, 350, 300, 400))

my.fun <- function(data, colname){
    mutate(data, position=cumsum(as.name(colname)))
}

machines <- machines %>% group_by(Date, Model.Num)     
machines <- my.fun(machines, "Cost")

Answer 1

這是一個使用lazyeval包中的interp interp()的選項， interp()隨dplyr安裝一起提供。 在您的函數內部，您需要使用dplyr函數的標准評估版本。 在這種情況下，它將是mutate_() 。

請注意，由於您在machines設置分組的方式，新列position將與此處的“ Cost列相同。 對my_fun()的第二次調用顯示它正在處理一組不同的分組變量。

library(dplyr)
library(lazyeval)

my_fun <- function(data, col) {
    mutate_(data, position = interp(~ cumsum(x), x = as.name(col)))
}

my_fun(machines, "Cost")
#        Date Model.Num Cost position
# 1 1/31/2014       123  200      200
# 2 1/31/2014       456  300      300
# 3 2/28/2014       123  250      250
# 4 2/28/2014       456  350      350
# 5 3/31/2014       123  300      300
# 6 3/31/2014       456  400      400

## second example - different grouping
my_fun(group_by(machines, Model.Num), "Cost")
#        Date Model.Num Cost position
# 1 1/31/2014       123  200      200
# 2 1/31/2014       456  300      300
# 3 2/28/2014       123  250      450
# 4 2/28/2014       456  350      650
# 5 3/31/2014       123  300      750
# 6 3/31/2014       456  400     1050

Answer 2

我們可以在不使用lazyeval包的標准評估中進行評估。 我們可以使用setNames將一些字符串設置為變量名。

library(tidyverse)

machines <- data.frame(
  Date = c("1/31/2014", "1/31/2014", "2/28/2014", "2/28/2014", "3/31/2014", "3/31/2014"), 
  Model.Num = c("123", "456", "123", "456", "123", "456"), 
  Cost = c(200, 300, 250, 350, 300, 400)
  )

my_fun <- function(data, col) {
  mutate_(data, .dots = setNames(paste0("cumsum(", col, ")"), "position"))
}

my_fun(machines %>% group_by(Date, Model.Num), "Cost")
# Source: local data frame [6 x 4]
# Groups: Date, Model.Num [6]
# 
# Date Model.Num  Cost position
# <fctr>    <fctr> <dbl>    <dbl>
# 1 1/31/2014       123   200      200
# 2 1/31/2014       456   300      300
# 3 2/28/2014       123   250      250
# 4 2/28/2014       456   350      350
# 5 3/31/2014       123   300      300
# 6 3/31/2014       456   400      400
my_fun(machines %>% group_by(Model.Num), "Cost")
# Source: local data frame [6 x 4]
# Groups: Model.Num [2]
# 
# Date Model.Num  Cost position
# <fctr>    <fctr> <dbl>    <dbl>
# 1 1/31/2014       123   200      200
# 2 1/31/2014       456   300      300
# 3 2/28/2014       123   250      450
# 4 2/28/2014       456   350      650
# 5 3/31/2014       123   300      750
# 6 3/31/2014       456   400     1050

dplyr字符串作為列引用

問題描述

2 個解決方案

解決方案1
7 已采納 2015-01-17 01:12:29

解決方案2
0 2016-10-26 05:37:46

dplyr字符串作為列引用

問題描述

2 個解決方案

解決方案1 7 已采納 2015-01-17 01:12:29

解決方案2 0 2016-10-26 05:37:46

解決方案1
7 已采納 2015-01-17 01:12:29

解決方案2
0 2016-10-26 05:37:46