Creating a new ranking column based on conditions in R

Question

I have a dataset that looks like this:

id  profile_id   company   product    price   
1      1           A        book      10.42
2      1           A        shirt     23.91
3      1           A        cup        5.95
4      2           B        book       7.99
5      2           B        shirt      5.95 
6      2           B        cup       11.76

I would like to create a new column, "rank", that shows the rank of the price per product, per company and per profile_id.

The output would look like this:

id  profile_id   company   product    price    rank 
1      1           A        book      10.42     2
2      1           A        shirt     23.91     3
3      1           A        cup        5.95     1
4      2           B        book       7.99     2
5      2           B        shirt      5.95     1
6      2           B        cup       11.76     3

I feel like this should be rather easy, but I can't really get this to work... Any help would be appreciated!

reproducible code:

df2 <- data.frame(id=c(1,2,3,4,5,6),
                  profile_id = c(1, 1, 1, 2, 2,2), 
                  company = c("A","A","A","B","B","B"), 
                  product = c("book", "shirt", "cup","book", "shirt", "cup"),
                  price = c(10.42, 23.91, 5.95, 7.99, 5.95, 11.76))

Answer 1

first group_by the "per company, profile_id" variables and then apply rank() :

library(dplyr)
df %>% group_by(company, profile_id) %>% mutate(rank = rank(price))

library(data.table)
df[,rank:=rank(price),by = .(company, profile_id)]

#     id profile_id company product price  rank
#1     1          1       A    book 10.42     2
#2     2          1       A   shirt 23.91     3
#3     3          1       A     cup  5.95     1
#4     4          2       B    book  7.99     2
#5     5          2       B   shirt  5.95     1
#6     6          2       B     cup 11.76     3

Answer 2

我们可以使用base R来做到这一点

df$rank <- with(df, ave(price, company, profile_id, FUN = rank))

Creating a new ranking column based on conditions in R

Question

2 answers

solution1
1 ACCPTED 2017-02-02 14:27:34

solution2
0 2017-02-03 01:42:56

Creating a new ranking column based on conditions in R

Question

2 answers

solution1 1 ACCPTED 2017-02-02 14:27:34

solution2 0 2017-02-03 01:42:56

solution1
1 ACCPTED 2017-02-02 14:27:34

solution2
0 2017-02-03 01:42:56