[英]How to aggregate data with the dyplr package in R
我想了解如何使用dplyr包編寫以下代碼:
averageStepsDayType <- aggregate(
NAreplacement$steps,
by=list(interval=NAreplacement$interval, dayType=NAreplacement$dayType),
mean
)
這是原始數據幀:
> head(NAreplacement)
steps date interval dayType
1 1.7169811 2012-10-01 0 weekday
2 0.3396226 2012-10-01 5 weekday
3 0.1320755 2012-10-01 10 weekday
4 0.1509434 2012-10-01 15 weekday
5 0.0754717 2012-10-01 20 weekday
6 2.0943396 2012-10-01 25 weekday
這是通過上面的代碼獲得的所需輸出:
> head(averageStepsDayType)
interval dayType x
1 0 weekday 2.25115304
2 5 weekday 0.44528302
3 10 weekday 0.17316562
4 15 weekday 0.19790356
5 20 weekday 0.09895178
6 25 weekday 1.59035639
這是我的嘗試:
byDate <- group_by(NAreplacement, date)
averageStepsDayType <- summarize(byDate, mean(steps))
> head(averageStepsDayType)
date x
1 2012-10-01 37.38260
2 2012-10-02 0.43750
3 2012-10-03 39.41667
4 2012-10-04 42.06944
5 2012-10-05 46.15972
6 2012-10-06 53.54167
有人可以給我一個例子嗎? 我想這應該是select和group_by函數之間的組合,但是我還沒有弄清楚該怎么做。
嘗試
library(dplyr)
NAreplacement %>%
group_by(interval, dayType) %>%
summarise(x=mean(steps))
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.