R如何用规则填写NA

Question

data=data.frame(person=c(1,1,1,2,2,2,2,3,3,3,3),
t=c(3,NA,9,4,7,NA,13,3,NA,NA,12),
WANT=c(3,6,9,4,7,10,13,3,6,9,12))

So basically I am wanting to create a new variable 'WANT' which takes the PREVIOUS value in t and ADDS 3 to it, and if there are many NA in a row then it keeps doing this. 因此，基本上我想创建一个新的变量“ WANT”，该变量将t中的PREVIOUS值和ADDS 3赋给它，并且如果连续有多个NA，则它会继续这样做。 My attempt is: 我的尝试是：

library(dplyr)
data %>% 
  group_by(person) %>% 
  mutate(WANT_TRY = fill(t) + 3)

Answer 1

Here's one way - 这是一种方法-

data %>% 
  group_by(person) %>%
  mutate(
    # cs = cumsum(!is.na(t)), # creates index for reference value; uncomment if interested
    w = case_when(
      # rle() gives the running length of NA
      is.na(t) ~ t[cumsum(!is.na(t))] + 3*sequence(rle(is.na(t))$lengths),
      TRUE ~ t
      )
  ) %>% 
  ungroup()

# A tibble: 11 x 4
   person     t  WANT     w
    <dbl> <dbl> <dbl> <dbl>
 1      1     3     3     3
 2      1    NA     6     6
 3      1     9     9     9
 4      2     4     4     4
 5      2     7     7     7
 6      2    NA    10    10
 7      2    13    13    13
 8      3     3     3     3
 9      3    NA     6     6
10      3    NA     9     9
11      3    12    12    12

Answer 2

Here is another way. 这是另一种方式。 We can do linear interpolation with the imputeTS package. 我们可以使用imputeTS软件包进行线性插值。

library(dplyr)
library(imputeTS)

data2 <- data %>%
  group_by(person) %>%
  mutate(WANT2 = na.interpolation(WANT)) %>%
  ungroup()

data2
# # A tibble: 11 x 4
#    person     t  WANT WANT2
#     <dbl> <dbl> <dbl> <dbl>
#  1      1     3     3     3
#  2      1    NA     6     6
#  3      1     9     9     9
#  4      2     4     4     4
#  5      2     7     7     7
#  6      2    NA    10    10
#  7      2    13    13    13
#  8      3     3     3     3
#  9      3    NA     6     6
# 10      3    NA     9     9
# 11      3    12    12    12

Answer 3

This is harder than it seems because of the double NA at the end. 由于末尾的双NA ，这比看起来要难。 If it weren't for that, then the following: 如果不是那样，那么请执行以下操作：

ifelse(is.na(data$t), c(0, data$t[-nrow(data)])+3, data$t)

...would give you want you want. ...会给你想要的。 The simplest way, that uses the same logic but doesn't look very clever (sorry!) would be: 最简单的方法，即使用相同的逻辑，但看起来不太聪明（对不起！）将是：

.impute <- function(x) ifelse(is.na(x), c(0, x[-length(x)])+3, x)
.impute(.impute(data$t))

...which just cheats by doing it twice. ...只是做两次就作弊。 Does that help? 有帮助吗？

Answer 4

You can use functional programming from purrr and "NA-safe" addition from hablar : 您可以使用来自purrr功能编程和来自hablar “ NA-safe”功能：

library(hablar)
library(dplyr)
library(purrr)

data %>% 
  group_by(person) %>% 
  mutate(WANT2 = accumulate(t, ~.x %plus_% 3))

Result 结果

# A tibble: 11 x 4
# Groups:   person [3]
   person     t  WANT WANT2
    <dbl> <dbl> <dbl> <dbl>
 1      1     3     3     3
 2      1    NA     6     6
 3      1     9     9     9
 4      2     4     4     4
 5      2     7     7     7
 6      2    NA    10    10
 7      2    13    13    13
 8      3     3     3     3
 9      3    NA     6     6
10      3    NA     9     9
11      3    12    12    12

R如何用规则填写NA

问题描述

4 个解决方案

解决方案1
1 2019-06-17 23:02:52

解决方案2
1 2019-06-18 00:32:22

解决方案3
0 2019-06-18 01:16:46

解决方案4
0 2019-06-18 07:53:02

R如何用规则填写NA

问题描述

4 个解决方案

解决方案1 1 2019-06-17 23:02:52

解决方案2 1 2019-06-18 00:32:22

解决方案3 0 2019-06-18 01:16:46

解决方案4 0 2019-06-18 07:53:02

解决方案1
1 2019-06-17 23:02:52

解决方案2
1 2019-06-18 00:32:22

解决方案3
0 2019-06-18 01:16:46

解决方案4
0 2019-06-18 07:53:02