简体   繁体   中英

How to create groups with multiple columns in a dataframe

How to create groups with multiple columns?

Bar plot: How do I create groups(using columns) to color the bars according to groups.

structure(list(DH105 = c(0.95238, 0.8922, 
0.8232, 0.2323), DH106 = c(0, 0.5327, 
0.5337, 0.8232), DH107 = c(0.2736, 
0.2321, 0.7382, 0.8923), 
    DH108 = c(0.2332, 0, 0, 0.3213), 
    DH112 = c(0.0315, 0.2639, 0.0321, 
    0.2673), DH113 = c(0.2372, 0.2871, 
    0.7222, 0)), row.names = c(NA, -4L), class = c("tbl_df", 
"tbl", "data.frame"))

I have a data frame with column names A, B, C etc. I want to assign A and B to group 1; C, D, E to group 2. So that I can color the plots according to group.

You can leverage tidyverse to gather the data and then categorize it into groups:

library(tidyverse)

dat <- structure(list(A = c(0.9523, 0.06944, 
                     0.53061, 0.11111, 0.03125, 0.64794, 
                     0.10763, 0.02782, 0.0374149659863946, 0.8439), 
               B = c(0, 0.2378, 0.0068, 0.8328, 0.7292, 0.7539, 0.7439, 0.0742, 0.5272, 0.6822), 
               C = c(0.0273, 0.0901, 0.7778, 0.9462, 0.5327, 0.2744, 0.5327, 0.4262, 0.6821, 0.03125), 
               D = c(0.0297, 0, 0, 0.03462, 0.0272, 0.0325, 0.6282, 0.6282, 0.6329, 0.8925), 
               E = c(0.0325, 0.0829, 0.6328, 0.5237, 0.5722, 0.7283, 0.6382, 0.5637, 0.5632, 0.0532)), 
row.names = c(NA, -10L), class = c("tbl", "data.frame"))

dat_clean <- dat %>% 
  gather(key = "col", value = "value") %>% 
  mutate(group = case_when(
    col %in% c("A","B") ~ "group1",
    col %in% c("C", "D", "E") ~ "group2"
  ))

ggplot(dat_clean, aes(x = col, y = value, fill = group)) + 
  geom_bar(stat = 'identity') + 
  theme(legend.position = 'None')

Created on 2019-04-30 by the reprex package (v0.2.1)

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM