R。我正在尝试将我的数据框子集几十年。因此，我想通过使用列的值进行子集化

Question

I have the year column from 1921 to 2020. I want to make an analysis based on the decades, so I want to subset the data frame into decades.我有从 1921 年到 2020 年的年份列。我想根据几十年进行分析，所以我想将数据框子集为几十年。 I tried couple of codes but they keep giving errors.我尝试了几个代码，但他们不断给出错误。

decade1=data_all%>%filter(data_all$year%>%1920:1929)

Error: Problem with filter() input ..1 .错误： filter()输入..1有问题。 x 3 arguments passed to ':' which requires 2 ℹ Input ..1 is data_all$year %>% 1920:1929 . x 3 arguments 传递给 ':' 这需要 2 ℹ 输入..1是data_all$year %>% 1920:1929 。 Run rlang::last_error() to see where the error occurred.运行rlang::last_error()以查看错误发生的位置。

decade1=data_all%>%filter(data_all$year==1920:1929)

Warning message: In data_all$year == 1920:1929: longer object length is not a multiple of shorter object length警告消息：在 data_all$year == 1920:1929 中：较长的 object 长度不是较短 object 长度的倍数

What code should I be using?我应该使用什么代码？

Answer 1

We can change the syntax to %in% within filter我们可以在filter中将语法更改为%in%

library(dplyr)
data_all%>% 
      filter(year %in% 1920:1929)

Answer 2

It may help to group the years first.

df <- data.frame(
    year = sample(1920:2020,50,replace = TRUE)
)

df %>% 
  mutate( decade = cut(df$year, breaks=c(1910,1919,1929,1939,1949,1959,1969,1979,1989,1999,2009,2019,2029), 
          labels=c("1910s","1920s","1930s","1940s","1950s","1960s","1970s","1980s","1990s","2000s","2010s","2020s"))) %>%
  arrange(year)

Answer 3

my solution of the exercise was (quite similar to yours):我对练习的解决方案是（与您的非常相似）：

create new features, 'decade' (log10-based)创建新功能，“十年”（基于 log10）

decades <- seq(1890,2010, by=10)

data$decade <- as.factor(data$Year %/% 10 * 10)

print(data$decade)

R。我正在尝试将我的数据框子集几十年。因此，我想通过使用列的值进行子集化

问题描述

3 个解决方案

解决方案1
1 2020-12-05 16:46:29

解决方案2
0 2020-12-06 04:37:00

解决方案3
0 2020-12-06 23:10:23

create new features, 'decade' (log10-based)创建新功能，“十年”（基于 log10）

R。 我正在尝试将我的数据框子集几十年。 因此，我想通过使用列的值进行子集化

问题描述

3 个解决方案

解决方案1 1 2020-12-05 16:46:29

解决方案2 0 2020-12-06 04:37:00

解决方案3 0 2020-12-06 23:10:23

create new features, 'decade' (log10-based)创建新功能，“十年”（基于 log10）

R。我正在尝试将我的数据框子集几十年。因此，我想通过使用列的值进行子集化

解决方案1
1 2020-12-05 16:46:29

解决方案2
0 2020-12-06 04:37:00

解决方案3
0 2020-12-06 23:10:23