[英]How to permute list of data.frame and create its annotated stack bar plot in ggplot2?
I have list of data.frame that needed to be permuted first and create annotated stack bar plot for each data.frame. 我有需要首先排列的data.frame列表,并为每个data.frame创建带注释的堆栈条形图。 I have searched related post in SO and got some idea how to do it.
我已经在SO中搜索了相关文章,并知道了一些方法。 However, I gave my shot how to make this as I expected, but my code is slow if data.frame is rather big, while getting stack bar plot is not desired.
但是,我按照我的预期进行了拍摄,但是如果data.frame很大,我的代码将很慢,并且不需要获取堆栈条形图。 So I am continously reading ggplot2' vignette to get solution.
因此,我不断阅读ggplot2的插图,以获取解决方案。 I am stucked with permuting data.frame list in desired way.
我被困在以所需的方式排列data.frame列表。 How can I permute list of data.frame and create its annotated stack bar plot ?
如何排列data.frame列表并创建其带注释的堆栈条形图? Can anyone give me idea how to do this easily and efficiently ?
谁能告诉我如何轻松有效地做到这一点? How to manipulate list of data.frame and get annotated stacked bar plot(number of observation, label) ?
如何操作data.frame列表并获取带注释的堆叠条形图(观察数,标签)? Thanks in advance
提前致谢
reproducible data.frame : 可复制的data.frame:
confirmedDF <- list(
bar = data.frame(begin=seq(2, by=11, len=25), end=seq(8, by=11, len=25), score=sample(54,25)),
cat = data.frame(begin=seq(5, by=8, len=35), end=seq(9, by=8, len=35), score=sample(45,35)),
foo = data.frame(begin=seq(8, by=13, len=25), end=seq(17, by=13, len=25), score=sample(49,25))
)
discardedDF <- list(
bar = data.frame(begin=seq(3, by=12, len=40), end=seq(8, by=12, len=40), score=sample(72,40)),
cat = data.frame(begin=seq(9, by=15, len=50), end=seq(17, by=15, len=50), score=sample(60,50)),
foo = data.frame(begin=seq(21, by=19, len=30), end=seq(32, by=19, len=30), score=sample(42,30))
)
then my input list of data.frame : 然后是我的data.frame输入列表:
library(tidyverse)
library(magrittr)
names(confirmedDF) <- paste("confirmed", names(confirmedDF), sep = ".")
names(discardedDF) <- paste("discarded", names(discardedDF), sep = ".")
merged <- do.call(rbind, c(confirmedDF, discardedDF))
merged %<>% rownames_to_column(var = "cn")
merged %<>% separate(cn, c("list", "letters", "seq"), sep = "\\.")
merged %<>% mutate(stringency = ifelse(score >= 12, "Stringent", "Weak"))
res <- merged %>% split(list(.$letters, .$stringency, .$list))
my attempt to get each individual stack bar plot, trivial code as follows : 我试图获取每个单独的堆栈图,平凡的代码如下:
library(dplyr)
library(ggplot2)
lapply(res, function(ele_) {
plot_data <- ele_ %>%
group_by(sample, stringency, list) %>%
tally %>%
group_by(sample, stringency) %>%
mutate(percentage = n / sum(n), cumsum = cumsum(percentage))
ggplot(data = plot_data, aes(x = sample, y= n ,fill = stringency)) +
geom_bar(position = "dodge",stat = "identity")
})
I don't understand using lapply to get each bar plot is quite slow and inefficient. 我不明白使用lapply获取每个条形图是相当缓慢且效率低下的。 above trivial code didn't give my desired bar plot.
上面的琐碎代码没有给出我想要的条形图。 How can I optimize the code?
如何优化代码? How to permute list of data.frame and getting its annotated bar plot ?
如何置换data.frame列表并获取带注释的条形图?
How can I achieve my desired output stack bar plot ? 如何获得所需的输出堆栈条形图? Any idea ?
任何想法 ?
You could try this: 您可以尝试以下方法:
res %>%
bind_rows %>%
group_by(stringency, list, sample) %>%
tally %>%
ungroup %>%
setNames(c("var", "val", "sample", "n")) %>%
{bind_rows(., setNames(., c("val", "var", "sample", "n")))} %>%
ggplot(aes(x=var, y=n, fill=val)) +
geom_col() +
geom_text(aes(label=n), position=position_stack(vjust = 0.5)) +
facet_wrap(~sample)
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.