如何使用 dplyr 查找列在一行中的排名？

Question

Say I have a dataframe that looks like this:假设我有一个如下所示的数据框：

dat <- data.frame(
  Iowa = c(11, 12, 15),
  Wisconsin = c(10, 14, 12),
  Florida = c(14, 9, 11)
)

I want to get the rowwise rank for Iowa relative to the other columns.我想获得爱荷华州相对于其他列的行列排名。 So the output would look like this:所以输出看起来像这样：

out <- data.frame(
  Iowa = c(11, 12, 15),
  Wisconsin = c(10, 14, 12),
  Florida = c(14, 9, 11),
  IowaRank = c(2, 2, 1)
)

What's the best way to achieve this using R, ideally in a dplyr pipe?使用 R 实现这一目标的最佳方法是什么，最好是在dplyr管道中？

Answer 1

Easiest will be apply to loop over the rows, apply the rank extract the first row最简单的将apply循环行，应用rank提取第一行

dat$IowaRank <- apply(-dat, 1, rank)[1,]

-output -输出

dat
  Iowa Wisconsin Florida IowaRank
1   11        10      14        2
2   12        14       9        2
3   15        12      11        1

Or using rowRanks from matrixStats或使用rowRanks从matrixStats

library(matrixStats)
dat$IowaRank <- rowRanks(-as.matrix(dat))[,1]

Or with dplyr或者用dplyr

library(dplyr)
dat %>%
    rowwise %>%
    mutate(IowaRank = rank(-c_across(everything()))[1]) %>%
    ungroup
# A tibble: 3 x 4
   Iowa Wisconsin Florida IowaRank
  <dbl>     <dbl>   <dbl>    <dbl>
1    11        10      14        2
2    12        14       9        2
3    15        12      11        1

Answer 2

You can get the ranks for all the columns using dense_rank -您可以使用dense_rank获取所有列的dense_rank -

library(dplyr)
library(tidyr)

dat %>%
  mutate(row = row_number()) %>%
  pivot_longer(cols = -row) %>%
  group_by(row) %>%
  mutate(rank = dense_rank(-value)) %>%
  ungroup %>%
  pivot_wider(names_from = name, values_from = c(value, rank)) %>%
  select(-row)

#  value_Iowa value_Wisconsin value_Florida rank_Iowa rank_Wisconsin rank_Florida
#       <dbl>           <dbl>         <dbl>     <int>          <int>        <int>
#1         11              10            14         2              3            1
#2         12              14             9         2              1            3
#3         15              12            11         1              2            3

If you are interested only in rank of Iowa using rowwise you can do -如果您只对使用rowwise的爱荷华州排名感兴趣，您可以这样做 -

dat %>%
  rowwise() %>%
  mutate(IowaRank = dense_rank(-c_across())[1])

#   Iowa Wisconsin Florida IowaRank
#  <dbl>     <dbl>   <dbl>    <int>
#1    11        10      14        2
#2    12        14       9        2
#3    15        12      11        1

如何使用 dplyr 查找列在一行中的排名？

问题描述

2 个解决方案

解决方案1
3 已采纳 2021-07-24 23:02:55

解决方案2
1 2021-07-25 01:38:18

如何使用 dplyr 查找列在一行中的排名？

问题描述

2 个解决方案

解决方案1 3 已采纳 2021-07-24 23:02:55

解决方案2 1 2021-07-25 01:38:18

解决方案1
3 已采纳 2021-07-24 23:02:55

解决方案2
1 2021-07-25 01:38:18