[英]How can I remove all rows of a dataframe that have the same string value across a subset of columns in R?
[英]How to do subset of the dataframe with value 1 in both rows and columns in R
我有 dataframe,它有 900 行和 2000 列。行和列有不同的 ID 名稱。 我需要從 dataframe 中提取值 1 以及列名和行名。 我附上了示例輸入我附上了示例 output output 應該是這種格式
一個簡單的基本 R 方法如下:
data[data != 1] <- NA
data[rowSums(data,na.rm = TRUE) > 0, colSums(data,na.rm = TRUE) > 0]
# A C D F H
#I 1 1 1 NA NA
#J NA 1 NA 1 1
#K NA NA NA NA 1
樣本數據:
data <- structure(list(A = c(1L, 3L, 3L, 3L, 4L, 3L, 3L, 4L, 4L, 3L,
3L), B = c(2L, 4L, 2L, 3L, 3L, 3L, 2L, 3L, 3L, 2L, 3L), C = c(1L,
1L, 2L, 2L, 3L, 3L, 2L, 3L, 3L, 3L, 2L), D = c(1L, 0L, 2L, 2L,
3L, 3L, 3L, 3L, 3L, 2L, 2L), E = c(0L, 2L, 2L, 3L, 4L, 3L, 3L,
3L, 3L, 3L, 3L), F = c(0L, 1L, 2L, 2L, 3L, 3L, 2L, 3L, 4L, 2L,
2L), G = c(2L, 0L, 2L, 2L, 3L, 3L, 2L, 3L, 3L, 2L, 2L), H = c(2L,
1L, 1L, 3L, 2L, 3L, 2L, 3L, 3L, 2L, 3L)), class = "data.frame", row.names = c("I",
"J", "K", "L", "M", "N", "O", "P", "Q", "R", "S"))
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.