簡體   English   中英

如何在R中使用dyplr軟件包聚合數據

[英]How to aggregate data with the dyplr package in R

我想了解如何使用dplyr包編寫以下代碼:

averageStepsDayType <- aggregate(
    NAreplacement$steps, 
    by=list(interval=NAreplacement$interval, dayType=NAreplacement$dayType), 
    mean
)

這是原始數據幀:

> head(NAreplacement)
      steps       date interval dayType
1 1.7169811 2012-10-01        0 weekday
2 0.3396226 2012-10-01        5 weekday
3 0.1320755 2012-10-01       10 weekday
4 0.1509434 2012-10-01       15 weekday
5 0.0754717 2012-10-01       20 weekday
6 2.0943396 2012-10-01       25 weekday

這是通過上面的代碼獲得的所需輸出:

> head(averageStepsDayType)
  interval dayType          x
1        0 weekday 2.25115304
2        5 weekday 0.44528302
3       10 weekday 0.17316562
4       15 weekday 0.19790356
5       20 weekday 0.09895178
6       25 weekday 1.59035639

這是我的嘗試:

byDate <- group_by(NAreplacement, date)
averageStepsDayType <- summarize(byDate, mean(steps))

> head(averageStepsDayType)
        date    x
1 2012-10-01    37.38260
2 2012-10-02     0.43750
3 2012-10-03    39.41667
4 2012-10-04    42.06944
5 2012-10-05    46.15972
6 2012-10-06    53.54167

有人可以給我一個例子嗎? 我想這應該是select和group_by函數之間的組合,但是我還沒有弄清楚該怎么做。

嘗試

library(dplyr)
NAreplacement %>% 
      group_by(interval, dayType) %>%
      summarise(x=mean(steps))

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM