遍历每行中的每个单词并删除列表中的单词

Question

我在 dataframe 中有以下一列（每一行都是一个人，每个单元格中都有一个标记化单词列表）。

Q395_R

[due, car, accident, year, ago, medical, condi...
[spending, time, loved, one, commute, able, co...
[initially, understanding, need, lockdown, ero...
[time, focus, exercise, le, sport, do, poured,..
[spending, time, family, realisation, need, ru...

我还有一个单词列表：

words395 = ['rising',
 'accident',
 'le',
 'lasted',
 'understanding',
 'spending',
 'adopted',
 'raising',
 'fabulous',
 'loneliness',
 'contract',....]

我想创建一个 function

遍历每一行中的每个人
遍历每一行中的每个单词
如果单词在列表 words395 中，则删除每个单元格中的单词

我不确定如何通过每个人和单词创建两个循环到 go，有人可以帮忙吗？

预期结果：

Q395_R
    
[due, car, year, ago, medical, condi...
[time, loved, one, commute, able, co...
[initially, need, lockdown, ero...
[time, focus, exercise, sport, do, poured,..
[time, family, realisation, need, ru...

Answer 1

使用 lambda function 将值转换为列表到集合：

s = set(words395)
df['Q395_R'] = df['Q395_R'].apply(lambda x: [y for y in x if y not in s])

遍历每行中的每个单词并删除列表中的单词

问题描述

1 个解决方案

解决方案1
3 已采纳 2022-06-07 09:41:08

遍历每行中的每个单词并删除列表中的单词

问题描述

1 个解决方案

解决方案1 3 已采纳 2022-06-07 09:41:08

解决方案1
3 已采纳 2022-06-07 09:41:08