使用 dplyr 减去几个不同列的最有效方法是什么

Question

I have a dataset like this:我有一个这样的数据集：

data.frame(x = c(1:5), y = c(0:4), z = c(2:6))

  x y z
1 1 0 2
2 2 1 3
3 3 2 4
4 4 3 5
5 5 4 6

I would like to get a dataset like this:我想得到这样的数据集：

  x y z y-x z-y
1 1 0 2  -1   2
2 2 1 3  -1   2
3 3 2 4  -1   2
4 4 3 5  -1   2
5 5 4 6  -1   2

when I use:当我使用：

a <- a %>% mutate(across((x:z), ~. - lag(.)))

I get:我得到：

That is, the mutate is subtracting in the same column and I needed to subtract in different columns.也就是说，变异是在同一列中减去，我需要在不同的列中减去。 How can I resolve this?我该如何解决这个问题？

Answer 1

I wouldn't use dplyr for this.我不会为此使用dplyr 。 I would use base R directly:我会直接使用 base R：

diff_cols = your_data[-1] - your_data[-ncol(your_data)]
names(diff_cols) = paste0(
  names(your_data)[-1],
  "-",
  names(your_data)[-ncol(your_data)]
)
cbind(your_data, diff_cols)
#   x y z y-x z-y
# 1 1 0 2  -1   2
# 2 2 1 3  -1   2
# 3 3 2 4  -1   2
# 4 4 3 5  -1   2
# 5 5 4 6  -1   2

Answer 2

Using dplyr you could do this:使用dplyr你可以这样做：

library(dplyr, warn.conflicts = FALSE)


df1 <- data.frame(x = c(1:5), y = c(0:4), z = c(2:6))

df1 |> 
  mutate(`y-x` = y - x,
         `z-y` = z - y)
#> # A tibble: 5 × 5
#> # Rowwise: 
#>       x     y     z `y-x` `z-y`
#>   <int> <int> <int> <int> <int>
#> 1     1     0     2    -1     2
#> 2     2     1     3    -1     2
#> 3     3     2     4    -1     2
#> 4     4     3     5    -1     2
#> 5     5     4     6    -1     2

^{Created on 2022-12-27 with reprex v2.0.2}^{创建于 2022-12-27，使用reprex v2.0.2}

Answer 3

You could use something like你可以使用类似的东西

library(dplyr)

df %>% 
  mutate(across(x:y, 
                ~. - df[[names(df)[which(names(df) == cur_column()) + 1]]],
                .names = "{.col}-{names(df)[which(names(df) == .col) + 1]}")
         )

This returns这返回

  x y z x-y y-z
1 1 0 2   1  -2
2 2 1 3   1  -2
3 3 2 4   1  -2
4 4 3 5   1  -2
5 5 4 6   1  -2
Warning message:
Problem while computing `..1 = across(...)`.
ℹ longer object length is not a multiple of shorter object length

but casts a warning which I can't remove.但发出了一个我无法删除的警告。

Answer 4

Here's a tidyr::pivot_longer + dplyr approach.这是 tidyr::pivot_longer + dplyr 方法。 The same code should work for any number of columns.相同的代码应该适用于任意数量的列。

df1 <- data.frame(x = c(1:5), y = c(0:4), z = c(2:6)) %>%
  mutate(row = row_number()) %>%
  pivot_longer(-row)

bind_rows(df1, 
  df1 %>%
    group_by(row) %>%
    mutate(name = paste0(name, "-", lag(name)), value = value - lag(value)) %>%
    ungroup() %>% filter(!is.na(value))) %>%
  pivot_wider(names_from = name, values_from = value)

Result结果

# A tibble: 5 × 6
    row     x     y     z `y-x` `z-y`
  <int> <int> <int> <int> <int> <int>
1     1     1     0     2    -1     2
2     2     2     1     3    -1     2
3     3     3     2     4    -1     2
4     4     4     3     5    -1     2
5     5     5     4     6    -1     2

Answer 5

We may use across2我们可以使用across2

library(dplyover)
a %>% 
  mutate(across2(y:z, x:y, `-`))
  x y z y_x z_y
1 1 0 2  -1   2
2 2 1 3  -1   2
3 3 2 4  -1   2
4 4 3 5  -1   2
5 5 4 6  -1   2

If the column name should be - instead of _ ,如果列名应该是-而不是_ ，

a %>% 
  mutate(across2(y:z, x:y, `-`, .names = "{xcol}-{ycol}"))
  x y z y-x z-y
1 1 0 2  -1   2
2 2 1 3  -1   2
3 3 2 4  -1   2
4 4 3 5  -1   2
5 5 4 6  -1   2

Or with dplyr using two across或者dplyr使用两个across

library(dplyr)
 a %>%
  mutate(across(y:z, .names = "{.col}-{names(a)[match(.col, names(a))-1]}") -
       across(x:y))

-output -输出

  x y z y-x z-y
1 1 0 2  -1   2
2 2 1 3  -1   2
3 3 2 4  -1   2
4 4 3 5  -1   2
5 5 4 6  -1   2

使用 dplyr 减去几个不同列的最有效方法是什么

问题描述

5 个解决方案

解决方案1
2 2022-12-27 16:33:59

解决方案2
1 2022-12-27 17:00:13

解决方案3
1 2022-12-27 17:29:36

解决方案4
0 2022-12-27 18:33:08

解决方案5
0 2022-12-27 18:56:47

使用 dplyr 减去几个不同列的最有效方法是什么

问题描述

5 个解决方案

解决方案1 2 2022-12-27 16:33:59

解决方案2 1 2022-12-27 17:00:13

解决方案3 1 2022-12-27 17:29:36

解决方案4 0 2022-12-27 18:33:08

解决方案5 0 2022-12-27 18:56:47

解决方案1
2 2022-12-27 16:33:59

解决方案2
1 2022-12-27 17:00:13

解决方案3
1 2022-12-27 17:29:36

解决方案4
0 2022-12-27 18:33:08

解决方案5
0 2022-12-27 18:56:47