簡體   English   中英

如何在R中使用相似的列名將數據從寬格式轉換為長格式?

[英]How to melt data from wide format to long format using similar column names in R?

最近,我從一個網站上抓取了數據,它類似於下面input變量中的數據表。

input <- data.frame(
     "Date" = sprintf("%02d-Jan", 1:15),
     "Type_event_1" =  c(rep("Skiing", 3), rep("Marathon", 7), rep("Skating", 5)),
     "sport_event_1"= c(rep("Alpine skiing",4), rep("Biathlon",6), rep("Curling",3), rep("Figure skating",2)),
     "Type_event_2" =  c(rep("Skiing", 4), rep("Marathon", 6),rep("Ice-Hockey", 3), rep("Skating", 2)),
     "sport_event_2"= c(rep("Skeleton",4), rep("Luge",6), rep("Hockey",3), rep("Ski Jumping",2))
     )

我想rbind與普通后綴(“event_1”,“event_2”)一個低於其他列隨着“日期”欄。 在這種情況下,我只有4列,即2個事件,如果我有40列,即20個此類事件,那該怎么辦。 我該如何使用for循環呢?
預期的輸出如下所示:

expected_output <- data.frame(
  "Date" = rep(sprintf("%02d-Jan", 1:15),2),
  "Type_event_1" =  c(rep("Skiing", 3), rep("Marathon", 7), rep("Skating", 5),rep("Skiing", 4), rep("Marathon", 6),rep("Ice-Hockey", 3), rep("Skating", 2)),
  "sport_event_1"= c(rep("Alpine skiing",4), rep("Biathlon",6), rep("Curling",3), rep("Figure skating",2),rep("Skeleton",4), rep("Luge",6), rep("Hockey",3), rep("Ski Jumping",2))
)

嘗試

library(data.table)
library(dplyr)
out1=data.table::melt(input[c(1,grep("Type_event_",names(input)))],"Date")[,c(1,3)]
out2=data.table::melt(input[c(1,grep("sport_event_",names(input)))],"Date")[,c(1,3)]
final<-cbind(out1,out2[,-1])
names(final)<-c("Date","Type_event","sport_event")
library(tidyverse)

tbl_df(input) %>%
  unite(v1, Type_event_1, sport_event_1) %>%
  unite(v2, Type_event_2, sport_event_2) %>%
  gather(v1,v2, -Date) %>%
  separate(v2, c("Type_event","sport_event"), sep = "_") %>%
  select(-v1)

# # A tibble: 30 x 3
#     Date   Type_event sport_event  
#    <fct>  <chr>      <chr>        
# 1 01-Jan Skiing     Alpine skiing
# 2 02-Jan Skiing     Alpine skiing
# 3 03-Jan Skiing     Alpine skiing
# 4 04-Jan Marathon   Alpine skiing
# 5 05-Jan Marathon   Biathlon     
# 6 06-Jan Marathon   Biathlon     
# 7 07-Jan Marathon   Biathlon     
# 8 08-Jan Marathon   Biathlon     
# 9 09-Jan Marathon   Biathlon     
#10 10-Jan Marathon   Biathlon     
# # ... with 20 more rows

注意:我僅將tbl_df(input)用於可視化目的。 您可以只使用input %>% ...

暫無
暫無

聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.

 
粵ICP備18138465號  © 2020-2024 STACKOOM.COM