r：遍歷幾列的所有元素以檢測短語

Question

我正在嘗試遍歷包含文本文件的數據框中的幾列。

我想檢查第 7 列到第 16 列的每個條目，以查看是否有任何文本文件包含某個短語。

每次檢測到短語時，我想將它出現的次數增加 1。

這看起來很簡單。 我想我應該遍歷列和行，但我似乎無法確切地弄清楚如何做到這一點。

有什么建議？ 提前感謝您的任何見解。

fc_count <- 0

for (col in profiles[7:16]){
  for (row in 1:nrow(profiles)){

    if(isTRUE(grepl("my name is jeff", row)) == TRUE){

      fc_count = fc_count + 1

    }

  }

}

fc_count

Answer 1

我們可以使用lapply循環第 7 到 16 列，應用grepl ，使用pattern來獲取邏輯向量list ， Reduce ，通過添加 ( + ) 將其轉換為單個整數向量，然后通過sum獲得sum

sum(Reduce(`+`, lapply(profiles[7:16], grepl, pattern = "my name is jeff")))

由於grepl被vector grepl vector ，如果我們將 'data.frame' 轉換為matrix （ matrix是具有暗淡屬性的向量），它會更緊湊

sum(grepl("my name is jeff", as.matrix(profiles[7:16])))

此外，對於for循環，我們不需要嵌套循環，因為grepl是矢量化的

fc_count <- 0
for(prf in profiles[7:16]){
    fc_count <- fc_count + sum(grepl("my name is jeff", prf))
 }

r：遍歷幾列的所有元素以檢測短語

問題描述

1 個解決方案

解決方案1
1 已采納 2020-03-27 19:14:51

r：遍歷幾列的所有元素以檢測短語

問題描述

1 個解決方案

解決方案1 1 已采納 2020-03-27 19:14:51

解決方案1
1 已采納 2020-03-27 19:14:51