[英]How to calculate rates between observations with R
考慮到我有一個按日期排序的數據框,並且對於每個數據框我都有一些數量,我該如何計算每一行的X天/X天-1指數?
我的數據集: https : //raw.githubusercontent.com/imdevskp/covid_19_jhu_data_web_scrap_and_cleaning/master/covid_19_clean_complete.csv
我的過程數據集(R 代碼):
library(tidyverse)
library(lubridate)
covid19 <- read.table(file = "covid_19_clean_complete.csv",
header = TRUE,
stringsAsFactors = FALSE,
sep = ",",
dec = ".",
quote = "\"")
covid19$Date <- mdy(covid19$Date)
brasil <- covid19 %>%
filter(Country.Region == "Brazil") %>%
group_by(Country.Region, Date) %>%
summarise(Cases = sum(Confirmed))
我的費率將根據Cases變量計算。
我們可以采用“案例”的lag
並用它來划分“案例”
library(dplyr)
out <- covid19 %>%
group_by(Country.Region, Date) %>%
summarise(Cases = sum(Confirmed)) %>%
mutate(Ratio = Cases/lag(Cases))
out %>%
filter(Country.Region == "Brazil") %>%
tail
# A tibble: 6 x 4
# Groups: Country.Region [1]
# Country.Region Date Cases Ratio
# <chr> <date> <int> <dbl>
#1 Brazil 2020-03-08 20 1.54
#2 Brazil 2020-03-09 25 1.25
#3 Brazil 2020-03-10 31 1.24
#4 Brazil 2020-03-11 38 1.23
#5 Brazil 2020-03-12 52 1.37
#6 Brazil 2020-03-13 151 2.90
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.