簡體   English   中英

如何在 function 中的 dplyr 過濾器中將參數稱為字符

[英]How to refer to an argument as character in dplyr filter inside a function

我正在嘗試構建一個 function 來計算某些變量的百分比 - 但我很難將參數稱為引號內的字符值,因為我需要在過濾器動詞內使用它。 我有下面的數據集。

e1_done <- structure(list(koen_new = c("Kvinde", "Kvinde", "Mand", "Kvinde", 
                                "Mand", "Mand", "Kvinde", "Kvinde", "Mand", "Mand", "Kvinde", 
                                "Kvinde", "Kvinde", "Mand", "Mand", "Mand", "Kvinde", "Kvinde", 
                                "Mand", "Kvinde", "Mand", "Mand", "Kvinde", "Kvinde", "Mand", 
                                "Mand", "Kvinde", "Mand", "Kvinde", "Kvinde", "Mand", "Kvinde", 
                                "Kvinde", "Mand", "Mand", "Kvinde", "Kvinde", "Mand", "Mand", 
                                "Mand", "Mand", "Mand", "Mand", "Mand", "Mand", "Kvinde", "Mand", 
                                "Kvinde", "Kvinde", "Kvinde"), 
frvlg_1 = structure(c(0, 0, 0, 
                                                                                     0, 0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 
                                                                                     0, 0, 0, 0, 0, 0, 0, 0, 0, 1, 0, 0, 0, 0, 0, 1, 0, 1, 0, 0, 0, 
                                                                                     0, 0, 0, 0, 0))), row.names = c(NA, -50L), class = c("tbl_df", "tbl", "data.frame"))

    # A tibble: 50 × 2
       koen_new frvlg_1
       <chr>      <dbl>
     1 Kvinde         0
     2 Kvinde         0
     3 Mand           0
     4 Kvinde         0
     5 Mand           0
     6 Mand           0
     7 Kvinde         1
     8 Kvinde         0
     9 Mand           0
    10 Mand           0
    # … with 40 more rows

我已經構建了以下 function:

per.gender <- function(x) {
  e1_done %>% 
    group_by(koen_new) %>% 
    mutate(total_n_gender = n()) %>% 
    group_by(koen_new,{{x}}) %>% 
    mutate(n_frvl = n()) %>% 
    dplyr::select(n_frvl, total_n_gender) %>% 
    mutate(procentandel = n_frvl/total_n_gender) %>% 
    distinct(koen_new, {{x}}, procentandel,.keep_all = TRUE) %>% 
    filter({{x}} == 1) %>% 
    ungroup() %>% 
    select(koen_new, procentandel) 
}

哪個產生我想要的:

per.gender(frvlg_1) 

# A tibble: 2 × 2
  koen_new procentandel
  <chr>           <dbl>
1 Kvinde         0.0417
2 Mand           0.115 

但是,我還希望將列procentandel重命名為執行 function 的每個變量的特定值,即我想在存儲在另一個 tibble 中的代碼簿中查找變量,如下所示:

codebook <- structure(list(Label = c("Frvlg: Kultur (Fx Museer, Lokalhistoriske Arkiver, Sangkor, Teater)", 
"Frvlg: Idræt (Fx Sportsklubber, Danseforeninger, Svømmehaller)", 
"Frvlg: Fritid i Øvrigt (Fx Hobbyforeninger, Slægtsforskning, Spejder)"
), Variable = c("frvlg_1", "frvlg_2", "frvlg_3")), row.names = c(NA, 
-3L), class = c("tbl_df", "tbl", "data.frame"))


# A tibble: 3 × 2
  Label                                                                 Variable
  <chr>                                                                 <chr>   
1 Frvlg: Kultur (Fx Museer, Lokalhistoriske Arkiver, Sangkor, Teater)   frvlg_1 
2 Frvlg: Idræt (Fx Sportsklubber, Danseforeninger, Svømmehaller)        frvlg_2 
3 Frvlg: Fritid i Øvrigt (Fx Hobbyforeninger, Slægtsforskning, Spejder) frvlg_3 

我可以用這個查找這個值,這是我想將列procentandel重命名為的字符值:

codebook_e1 %>% filter(Variable == "frvlg_1") %>% select(Label) %>% pull()
[1] "Frvlg: Kultur (Fx Museer, Lokalhistoriske Arkiver, Sangkor, Teater)"

但是,我不知道如何在 function 內的過濾器動詞中將x稱為字符值以引用密碼本。 我已經嘗試了各種eval函數等等 - 但是,它似乎對我沒有任何作用。

如果我在引號中添加第二個參數 x ,它會起作用 - 但是我只想要 function 中的一個參數。

我希望這個問題足夠清楚!

使用rlang::ensym()x捕獲為符號,然后您可以使用as.character()進行轉換:

library(tidyverse)

per.gender <- function(x) {
  new_name <- codebook_e1 %>% 
    filter(Variable == as.character(ensym(x))) %>% 
    select(Label) %>% 
    pull()

  e1_done %>% 
    group_by(koen_new) %>% 
    mutate(total_n_gender = n()) %>% 
    group_by(koen_new,{{x}}) %>% 
    mutate(n_frvl = n()) %>% 
    select(n_frvl, total_n_gender) %>% 
    mutate(procentandel = n_frvl/total_n_gender) %>% 
    distinct(koen_new, {{x}}, procentandel,.keep_all = TRUE) %>% 
    filter({{x}} == 1) %>% 
    ungroup() %>% 
    select(koen_new, !!new_name := procentandel) 
}

per.gender(frvlg_1) 

結果:

# A tibble: 2 x 2
  koen_new `Frvlg: Kultur (Fx Museer, Lokalhistoriske Arkiver, Sangkor, Teater)`
  <chr>                                                                    <dbl>
1 Kvinde                                                                  0.0417
2 Mand                                                                    0.115 

還要注意使用!! :=運算符在最終的select()語句中使用new_name引用的值——否則該列只會被命名為“new_name”。

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM