繁体 English 中英

仅保留所有组中的公共行

[英]Keeping only common rows in all groups

原文 2021-05-27 12:43:38 8 2 r/ dplyr

我有一个包含十组的数据集。 某些组中缺少某些观察值（行）。 我只想保留每个组中常见的观察结果。 我试着做一个最小的例子。 在那个例子中，我做了三个组。 在第一组中，缺少一个观察结果。 因此 output 我应该在每组中会有两个观察结果。

library(tidyverse)
## data_set
test_df<-data.frame(groups=c(1,1,1,2,2,2,3,3,3),date=as.Date(c("2000-01-01","2000-01-02","2000-01-03","2000-01-01","2000-01-02","2000-01-03","2000-01-01","2000-01-02","2000-01-03")),data=c(1,2,NA,3,4,5,6,7,8))

## required_output
## keeping data only with common dates
test_df_new<-test_df[c(1,2,4,5,7,8),]   

## groups 
test_df_new<-test_df%>%
        group_by()%>%

2 个解决方案

首先，我在数据列中找到了带有 NA 的日期：

test_df$date[is.na(test_df$data)]

然后我通过dplyr过滤：

test_df %>% filter(date != test_df$date[is.na(test_df$data)])

删除数据为 NA 的日期，然后在组中获取剩余日期的交集，然后过滤：

ix <- which(!is.na(test_df$data))
test_df[ test_df$date %in% 
           Reduce(intersect,
                  split(test_df$date[ ix ], test_df$groups[ ix ])), ]
#   groups       date data
# 1      1 2000-01-01    1
# 2      1 2000-01-02    2
# 4      2 2000-01-01    3
# 5      2 2000-01-02    4
# 7      3 2000-01-01    6
# 8      3 2000-01-02    7

根据所有先前保留的数据组 R for loop 过滤行

[英]filter rows based on all previous keeping groups of data R for loop

合并R中的两个DF并仅保留其中具有共同日期的行

[英]Combining two DFs in R and keeping only the rows which have a common date in it

将r中的数据框合并或联接到一些公共列，同时使所有行保持INTACT和顺序？

[英]Merge or join dataframe in r with some common columns while keeping all rows INTACT and order?

随机排列矩阵中的行，但在R中将组保持在一起

[英]Randomizing rows in a matrix but keeping groups together in R

将所有数字列除以一个公因数；每组不同的行有不同的因子

[英]Divide all numeric columns by a common factor; different factor per different groups of rows

调整 function 使其不是遍历所有行，而是仅遍历组内的所有行

[英]Adjust function so that it instead of it looping through all rows, it loops only through all rows within groups

只保留不间断的组

[英]keeping only non-breaking groups

只保留三个数据集中的公共列

[英]Keeping only common columns in three data sets

提取所有列组中共有的元素

[英]Extract elements common in all column groups

在一个条件下仅改变数据子集的值，同时保留所有数据行

[英]Mutating values for only a subset of the data under a condition, while keeping all data rows

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 根据所有先前保留的数据组 R for loop 过滤行合并R中的两个DF并仅保留其中具有共同日期的行将r中的数据框合并或联接到一些公共列，同时使所有行保持INTACT和顺序？随机排列矩阵中的行，但在R中将组保持在一起将所有数字列除以一个公因数；每组不同的行有不同的因子调整 function 使其不是遍历所有行，而是仅遍历组内的所有行只保留不间断的组只保留三个数据集中的公共列提取所有列组中共有的元素在一个条件下仅改变数据子集的值，同时保留所有数据行

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM