在 R 中，如何在使用列值的每个 dataframe 行上应用 function？

Question

假设我有一个 dataframe

作者 | 歌词 |

Name1 文本（字符）

Name2 文本（字符）

我想通过应用 function 来创建另一列，该列对于每一行从 Text 列中获取 Text，用空格分隔，然后遍历每个标记以查看它是否在我制作的另一个向量中（这样我就可以计算出文本中位于该其他向量中的标记）。

到目前为止我写的 function 如下

ReturnPercentPosWord = function(textLyrics){

WhitespaceSplitText = strsplit(textLyrics, " ")

LengthSplitText = length(WhitespaceSplitText)

CountInPosList = 0

for (i in WhitespaceSplitText) {

if (i %in% PositiveWords$word) {
  CountInPosList = CountInPosList+1
}

}

 if (CountInPosList == 0) {
return(0)

}

PercentInPos = (CountInPosList/LengthSplitText)*100

return(PercentInPos)}

我现在想将此 function 应用于每一行。 我努力了

TestPOSwordsDF$PercentPositiveWords = ReturnPercentPosWord(TestPOSwordsDF$Lyrics)

和

TestPOSwordsDF$PercentPositiveWords = apply(TestPOSwordsDF[, c('Lyrics'),drop=F], 1, ReturnPercentPosWord)

但是我收到一条消息，说the condition has length > 1 and only the first element will be used

我真的很感激这方面的任何帮助。 谢谢！

Answer 1

尝试使用这个：

TestPOSwordsDF$PercentPositiveWords <- sapply(
                   strsplit(TestPOSwordsDF$Lyrics, " "), function(x) 
                   mean(x %in% PositiveWords$word) * 100)

在这里，我们在空间上分割Lyrics ，得到PositiveWords$word中出现的单词的比率。

在 R 中，如何在使用列值的每个 dataframe 行上应用 function？

问题描述

1 个解决方案

解决方案1
1 已采纳 2020-04-16 12:21:30

在 R 中，如何在使用列值的每个 dataframe 行上应用 function？

问题描述

1 个解决方案

解决方案1 1 已采纳 2020-04-16 12:21:30

解决方案1
1 已采纳 2020-04-16 12:21:30