（基本 R）如何将 `diff()` 应用于列并将 append 作为新的 data.frame 列？

Question

I have a data frame df1 like this:我有一个像这样的数据框df1 ：

time时间	Diamond.Hands钻石手	returns返回	volume体积	close关
2021-02-16 10:00:00 2021-02-16 10:00:00	0.4583333 0.4583333	0.0056710775 0.0056710775	10059 10059	53.20 53.20
2021-02-16 11:00:00 2021-02-16 11:00:00	0.2352941 0.2352941	-0.0037586920 -0.0037586920	8664 8664	53.01 53.01
2021-02-16 12:00:00 2021-02-16 12:00:00	0.4400000 0.4400000	-0.0037586920 -0.0037586920	10059 10059	52.40 52.40

# Log return
prices <- df1$close
log_returns <- diff(log(prices), lag=1)
df1$logreturns <- log_returns

returns the error:返回错误：

Fehler in `$<-.data.frame`(`*tmp*`, logreturns, value = c(0.000187952260679136,  :
  Ersetzung hat 2219 Zeilen, Daten haben 2220

Do you have any ideas how to fix that?你有什么想法可以解决这个问题吗？

Answer 1

When you do当你这样做

y <- diff(x, lag = m, differences = k)

the resulting vector y has m * k fewer elements than x .结果向量y的元素比x少m * k 。 If you want to have both x and y as data.frame/matrix columns, you need to pad m * k number of leading NAs to y .如果要将x和y都作为 data.frame/matrix 列，则需要将m * k前导 NA 数填充到y 。

In your case, m = 1 and k = 1 , so you need to pad one NA:在您的情况下， m = 1和k = 1 ，因此您需要填充一个 NA：

df1$logreturns <- c(NA, log_returns)

More concisely, we can pack your 3 lines of code into 1:更简洁地说，我们可以将你的 3 行代码打包成 1 行：

df1$logreturns <- c(NA, diff(log(df1$close)))

Remark:评论：

If you want to know how to do mutate() + diff() in dplyr , then maybe something like:如果您想知道如何在dplyr中执行mutate() + diff() ，那么可能类似于：

df1 %>% mutate(logreturns = c(NA, diff(log(close))))

Here is another possibly related Q & A: Error when using "diff" function inside of dplyr mutate .这是另一个可能相关的问答：在 dplyr mutate 中使用“diff” function 时出错。

（基本 R）如何将 `diff()` 应用于列并将 append 作为新的 data.frame 列？

问题描述

1 个解决方案

解决方案1
1 已采纳 2022-08-02 07:23:54

（基本 R）如何将 `diff()` 应用于列并将 append 作为新的 data.frame 列？

问题描述

1 个解决方案

解决方案1 1 已采纳 2022-08-02 07:23:54

解决方案1
1 已采纳 2022-08-02 07:23:54