反復運行R腳本

Question

我有70個CSV文件，它們具有與我想要執行相同處理的相同列。 基本上，我要導入，清理，寫入文件並刪除所有變量，然后對下一個重復。 因為每個是0.5GB。

在不以有效方式迭代加載程序包的情況下該怎么辦？

library(tidyverse)
setwd("~/R/R-3.5.1/bin/i386")
df <- read.csv(file.choose(), header = TRUE, sep = ",")

inds <- which(df$pc_no == "DELL")
df[inds - 1, c("event_rep", "loc_id")] <- df[inds, c("pc_no", "cust_id")]
df1 <- df[-inds, ]

write.csv(df1, "df1.csv")

rm(list=ls())

為此，我想我將使用這段代碼，但不知道在哪里正確使用它。 IE如何實現上述代碼？

list.files(pattern="^events.*?\\.csv", full.names=TRUE, recursive=FALSE)
lapply(files, function(x) {
files <- function(df1)

})

Answer 1

根據上面的評論，在將文件分配給對象（您已定義為文件）后，只需使用lapply遍歷每個文件。

library(tidyverse)
setwd("~/R/R-3.5.1/bin/i386")

files <- list.files(pattern="^events.*?\\.csv", full.names=TRUE, recursive=FALSE)

lapply(files, function(x) {

  df <- read.csv(x, header = TRUE, sep = ",")

  inds <- which(df$pc_no == "DELL")
  df[inds - 1, c("event_rep", "loc_id")] <- df[inds, c("pc_no", "cust_id")]
  df1 <- df[-inds, ]

  write.csv(df1, paste0('cleaned_', x), row.names = FALSE)

})

反復運行R腳本

問題描述

1 個解決方案

解決方案1
1 2018-10-22 17:44:37

反復運行R腳本

問題描述

1 個解決方案

解決方案1 1 2018-10-22 17:44:37

解決方案1
1 2018-10-22 17:44:37