如何根据 R 中另一列的值计算一列中最常见的变量？

Question

I have a dataframe that looks like this:我有一个 dataframe，看起来像这样：

 plot sp value sum
1  A  1a  1    3   
2  A  1b  1    3
3  A  1a  1    3
4  B  1a  2    4
5  B  1a  2    4
6  C  1b  3    9
7  C  1b  3    9
8  C  1b  3    9

I calculated then the share of sum for each line and got this:然后我计算了每一行的sum份额并得到了这个：

 plot sp value sum share
1  A  1a  1    3    0.3
2  A  1b  1    3    0.3
3  A  1a  1    3    0.3
4  B  1a  2    4    0.5
5  B  1a  2    4    0.5
6  C  1b  3    9    0.3
7  C  1b  3    9    0.3
8  C  1b  3    9    0.3

I want to know now what is the most common sp for each plot based on the share .我现在想知道基于share的每个plot最常见的sp是什么。 In the case of this example I would like it to look like this:在这个例子中，我希望它看起来像这样：

 plot sp value sum share dom.sp
1  A  1a  1    3    0.3    1a
2  A  1b  1    3    0.3    1a
3  A  1a  1    3    0.3    1a
4  B  1a  2    4    0.5    1a
5  B  1a  2    4    0.5    1a
6  C  1b  3    9    0.3    1b
7  C  1b  3    9    0.3    1b
8  C  1b  3    9    0.3    1b

Answer 1

Very similar solution to your previous question与您之前的问题非常相似的解决方案

> ave(df$sp,df$plot,FUN=function(x){names(table(x))[1]})
[1] "1a" "1a" "1a" "1a" "1a" "1b" "1b" "1b"

Answer 2

If I understood correctly this might work:如果我理解正确，这可能会起作用：

library(dplyr)

df <-
structure(list(account = c("M205109", "M205109", "M201212", "M205668", 
"M207954", "M208966", "M203465", "M207622", "M201869", "M201869"
), age = c(20, 20, 18, 29, 21, 19, 19, 23, 22, 22)), class = "data.frame", row.names = c(NA, 
-10L))

df %>% 
  group_by(plot) %>% 
  mutate(dom.sp = relper::cat_mode(sp))

# A tibble: 8 x 5
# Groups:   plot [3]
  plot  sp      sum share dom.sp
  <chr> <chr> <dbl> <dbl> <chr> 
1 A     1a        3   0.3 1a    
2 A     1b        3   0.3 1a    
3 A     1a        3   0.3 1a    
4 B     1a        4   0.5 1a    
5 B     1a        4   0.5 1a    
6 C     1b        9   0.3 1b    
7 C     1b        9   0.3 1b    
8 C     1b        9   0.3 1b

I used a function called cat_mode that get the mode of a variable, here the package if you want to try:我使用了一个名为cat_mode的 function 来获取变量的模式，如果你想尝试，这里是 package：

remotes::install_github("vbfelix/relper")

如何根据 R 中另一列的值计算一列中最常见的变量？

问题描述

2 个解决方案

解决方案1
2 2022-11-30 14:29:55

解决方案2
2 2022-11-30 14:30:18

如何根据 R 中另一列的值计算一列中最常见的变量？

问题描述

2 个解决方案

解决方案1 2 2022-11-30 14:29:55

解决方案2 2 2022-11-30 14:30:18

解决方案1
2 2022-11-30 14:29:55

解决方案2
2 2022-11-30 14:30:18