在 R 中，如何在使用列值的每个 dataframe 行上应用 function？

Question

Let's say I have a dataframe假设我有一个 dataframe

Author |作者 | Lyrics |歌词 |

Name1 Text (characters) Name1 文本（字符）

Name2 Text (characters) Name2 文本（字符）

I want to create another column through applying a function that for each row takes the Text from the Text column, separates by whitespaces, then iterates over each token to see if it is within another vector I made (so I can work out the percentage of tokens within the text that are within that other vector).我想通过应用 function 来创建另一列，该列对于每一行从 Text 列中获取 Text，用空格分隔，然后遍历每个标记以查看它是否在我制作的另一个向量中（这样我就可以计算出文本中位于该其他向量中的标记）。

The function as I have written so far is below到目前为止我写的 function 如下

ReturnPercentPosWord = function(textLyrics){

WhitespaceSplitText = strsplit(textLyrics, " ")

LengthSplitText = length(WhitespaceSplitText)

CountInPosList = 0

for (i in WhitespaceSplitText) {

if (i %in% PositiveWords$word) {
  CountInPosList = CountInPosList+1
}

}

 if (CountInPosList == 0) {
return(0)

}

PercentInPos = (CountInPosList/LengthSplitText)*100

return(PercentInPos)}

I want to apply this function to each row now.我现在想将此 function 应用于每一行。 I have tried我努力了

TestPOSwordsDF$PercentPositiveWords = ReturnPercentPosWord(TestPOSwordsDF$Lyrics)

and和

TestPOSwordsDF$PercentPositiveWords = apply(TestPOSwordsDF[, c('Lyrics'),drop=F], 1, ReturnPercentPosWord)

But I get a message saying the condition has length > 1 and only the first element will be used但是我收到一条消息，说the condition has length > 1 and only the first element will be used

I would really appreciate any help with this.我真的很感激这方面的任何帮助。 Thank you!谢谢！

Answer 1

Try using this:尝试使用这个：

TestPOSwordsDF$PercentPositiveWords <- sapply(
                   strsplit(TestPOSwordsDF$Lyrics, " "), function(x) 
                   mean(x %in% PositiveWords$word) * 100)

Here we split Lyrics on space, get the ratio of words which are present in PositiveWords$word .在这里，我们在空间上分割Lyrics ，得到PositiveWords$word中出现的单词的比率。

在 R 中，如何在使用列值的每个 dataframe 行上应用 function？

问题描述

1 个解决方案

解决方案1
1 已采纳 2020-04-16 12:21:30

在 R 中，如何在使用列值的每个 dataframe 行上应用 function？

问题描述

1 个解决方案

解决方案1 1 已采纳 2020-04-16 12:21:30

解决方案1
1 已采纳 2020-04-16 12:21:30