如何识别列表中的所有数据框是否具有唯一ID

Question

I have a list of dfs.我有一个 dfs 列表。 I want to know whether there is a smart way to tell whether each df in lst has unique ID , and create a summary table like below"我想知道是否有一种聪明的方法来判断lst中的每个df是否具有唯一ID ，并创建一个如下所示的汇总表"

Sample data:样本数据：

lst<-list(structure(list(ID = c("Tom", "Jerry", "Mary"), Score = c(85, 
85, 96)), row.names = c(NA, -3L), class = c("tbl_df", "tbl", 
"data.frame")), structure(list(ID = c("Tom", "Jerry", "Mary", 
"Jerry"), Score = c(75, 65, 88, 98)), row.names = c(NA, -4L), class = c("tbl_df", 
"tbl", "data.frame")), structure(list(ID = c("Tom", "Jerry", 
"Tom"), Score = c(97, 65, 96)), row.names = c(NA, -3L), class = c("tbl_df", 
"tbl", "data.frame")))

Answer 1

We could loop over the list and check with n_distinct我们可以遍历list并检查n_distinct

library(dplyr)
library(stringr)
library(purrr)
map_dfr(setNames(lst, str_c("df", seq_along(lst))), 
   ~.x %>% 
   summarise(UniqueID = c("N", "Y")[1 + (n_distinct(ID) == n())]), .id= 'Data')

-output -输出

# A tibble: 3 × 2
  Data  UniqueID
  <chr> <chr>   
1 df1   Y       
2 df2   N       
3 df3   N

Answer 2

In base R :在base R ：

data.frame(Data = paste0("df", seq(lst)),
           UniqueID = ifelse(sapply(lst, \(x) length(unique(x$ID)) == nrow(x)), "Y", "N"))

  Data UniqueID
1  df1        Y
2  df2        N
3  df3        N

如何识别列表中的所有数据框是否具有唯一ID

问题描述

2 个解决方案

解决方案1
3 已采纳 2022-09-09 15:01:15

解决方案2
3 2022-09-09 15:07:17

如何识别列表中的所有数据框是否具有唯一ID

问题描述

2 个解决方案

解决方案1 3 已采纳 2022-09-09 15:01:15

解决方案2 3 2022-09-09 15:07:17

解决方案1
3 已采纳 2022-09-09 15:01:15

解决方案2
3 2022-09-09 15:07:17