根據R中的條件查詢和匯總數據

Question

我有一個數據框，如果type a值等於1，我希望得到每年type b的所有值的均值。

Year  type   value1   value2  value3  value4  value5
1     a       1        1        2       3       4
1     b       10       12       9       8       10
2     a       1        2        2       2       1
2     b       11       10       13      9       14

所以我的最終產品看起來像這樣：

Year  type_b_values
1      11
2      12.5

這是Year1的value1和value2的平均值，以及Year1的value1和5的Year2 。 謝謝！

Answer 1

這是一種使用基本函數的方法。 我猜plyr或reshape也可能是有用的包，但我對它們不太熟悉：

dat <- read.table(text="Year  type   value1   value2  value3  value4  value5
1     a       1        1        2       3       4
1     b       10       12       9       8       10
2     a       1        2        2       2       1
2     b       11       10       13      9       14", header=TRUE)


dat_split <- split(dat, dat$Year)       # split our data into a list by year

output <- sapply(dat_split, function(x) {
    y <- x[x$type == "a", -c(1:2)] == 1 # which a in that year = 1
    z <- x[x$type == "b", -c(1:2)][y]   # grab the b values that a = 1
    if (sum(y) == 0) {                  # eliminate if no a = 1
        return(NA)
    }
    mean(z)
})

data.frame(Year = names(output), type_b_values = output)

## > data.frame(Year = names(output), type_b_values = output)
##   Year type_b_values
## 1    1          11.0
## 2    2          12.5

Answer 2

這是使用plyr的版本：

library(plyr)
ddply(dat, "Year", function(x) {
  values.cols <- grep("value", names(x), value = TRUE)
  a <- subset(x, type == "a", values.cols)
  b <- subset(x, type == "b", values.cols)  
  c("type_b_values" = mean(b[a == 1]))
})

#   Year type_b_values
# 1    1          11.0
# 2    2          12.5

根據R中的條件查詢和匯總數據

問題描述

2 個解決方案

解決方案1
3 已采納 2013-05-09 23:17:32

解決方案2
1 2013-05-09 23:43:42

根據R中的條件查詢和匯總數據

問題描述

2 個解決方案

解決方案1 3 已采納 2013-05-09 23:17:32

解決方案2 1 2013-05-09 23:43:42

解決方案1
3 已采納 2013-05-09 23:17:32

解決方案2
1 2013-05-09 23:43:42