從數據框中刪除單詞列表

Question

我有一個由包含字符串的數據系列組成的數據框。 我有一個希望從每一行中刪除的字符串列表。

tcl_list = ["tab", "cr", "lf", "doublequote", "singlequote", "eof"]
df[['Summary', 'Description']] = re.sub("|".join(tcl_list), ' ', df[['Summary', 'Description']])

例如：

由此：

the tab dog is acting sneaky like a doublequote cat doublequote

對此：

the dog is acting sneaky like a cat

但是，我收到此錯誤：

TypeError: expected string or bytes-like object

我嘗試使用apply（）和lambda函數，但未成功。 有什么建議么？

Answer 1

我認為正則表達式需要應用於列的單個字符串

df['val'] = ['the tab dog is acting sneaky like a doublequote cat doublequote']

df.val.apply(lambda x: re.sub("|".join(tcl_list),'',x))

要么

df.val.str.replace("|".join(tcl_list),'')

出：

0    the  dog is acting sneaky like a  cat 
Name: val, dtype: object

從數據框中刪除單詞列表

問題描述

1 個解決方案

解決方案1
0 2018-11-08 05:41:25

從數據框中刪除單詞列表

問題描述

1 個解決方案

解決方案1 0 2018-11-08 05:41:25

解決方案1
0 2018-11-08 05:41:25