使用str_extract_all和unnest但从NA中丢失行

Question

我正在使用str_extract()和str_extract_all()对正则表达式进行一些研究。 有零个，一个或多个结果，所以我想将多个结果unnest()分成多行。 由于ab_all中的character（0）（我假设），unnest不会给出输出中的所有行。

library(tidyverse)

my_tbl <- tibble(clmn = c("abcd", "abef, abgh", "xkcd"))

ab_tbl <- my_tbl %>% 
  mutate(ab = str_extract(clmn, "(?<=ab)[:alpha:]*\\b"), 
         ab_all = str_extract_all(clmn, "(?<=ab)[:alpha:]*\\b"), 
         cd = str_extract(clmn, "[:alpha:]*(?=cd)"))

ab_tbl %>% unnest(ab_all, .drop = FALSE)

# A tibble: 3 x 4
  clmn       ab    cd    ab_all
  <chr>      <chr> <chr> <chr> 
1 abcd       cd    ab    cd    
2 abef, abgh ef    NA    ef    
3 abef, abgh ef    NA    gh

编辑：预期输出：

# A tibble: 3 x 4
  clmn       ab    cd    ab_all
  <chr>      <chr> <chr> <chr> 
1 abcd       cd    ab    cd    
2 abef, abgh ef    NA    ef    
3 abef, abgh ef    NA    gh 
4 xkcd       NA    xk    NA

在输出中未给出带有xkccd的行。 这与str_extract_all或其他消息有关吗，还是应该更改方法？

Answer 1

可能是我们可以将0的长度更改为NA ，然后执行unnest

library(tidyverse)
ab_tbl %>%
    mutate(ab_all = map(ab_all,  ~ if(length(.x) ==0) NA_character_ else .x)) %>% 
     unnest

注意：假设str_extract中的模式正确

使用str_extract_all和unnest但从NA中丢失行

问题描述

1 个解决方案

解决方案1
2 已采纳 2019-06-18 05:19:27

使用str_extract_all和unnest但从NA中丢失行

问题描述

1 个解决方案

解决方案1 2 已采纳 2019-06-18 05:19:27

解决方案1
2 已采纳 2019-06-18 05:19:27