計算另一個因子中的因子水平 - R

Question

假設我們有一個包含數百個汽車觀察的數據框。 每個觀察可以按 model、品牌和國家分組。

我們如何計算每個國家生產了多少型號的汽車？

我試過了：

janitor::tabyl(data,  coutry, model)

但是我得到每個國家的 model 的每個觀察量。 但是，我正在尋找每個國家/地區的特定國家/地區的模型數量和列表。

例子：

Country  n    model
Italy    4    Punto, Panda, Mito, Giulietta
Germany  3    Polo, Golf, X5

Answer 1

你可以在 dplyr

df <- data.frame(Country = c("Italy","Italy",
                             "Italy","Italy","Germany","Germany","Germany"),
                   Model = c("Punto","Panda",
                             "Mito","Mito","Polo","Golf","Golf")
      )

library(dplyr)
df %>% group_by(Country) %>%
  summarise(n = n_distinct(Model),
            model = toString(unique(Model)), .groups = 'drop')

# A tibble: 2 x 3
  Country     n model             
  <chr>   <int> <chr>             
1 Germany     2 Polo, Golf        
2 Italy       3 Punto, Panda, Mito

^{由reprex package (v2.0.0) 於 2021 年 5 月 6 日創建}

計算另一個因子中的因子水平 - R

問題描述

1 個解決方案

解決方案1
2 已采納 2021-05-06 08:48:10

計算另一個因子中的因子水平 - R

問題描述

1 個解決方案

解決方案1 2 已采納 2021-05-06 08:48:10

解決方案1
2 已采納 2021-05-06 08:48:10