删除除逗号外的所有字符和数字

Question

I am trying to remove all the characters from string in the DataFrame column but keep the comma but it still removes everything including the comma.我试图从 DataFrame 列中的字符串中删除所有字符，但保留逗号，但它仍然会删除包括逗号在内的所有内容。

I know the question has been asked before but I tried many answers and all remove the comma as well.我知道之前有人问过这个问题，但我尝试了很多答案，并且都删除了逗号。

df[new_text_field_name] = df[new_text_field_name].apply(lambda elem: re.sub(r"(@[A-Za-z0-9]+)|([^0-9A-Za-z \t])|(\w+:\/\/\S+)|^rt|http.+?", "", str(elem)))

sample text:示范文本：

'100 % polyester, Paperboard (min. 30% recycled), 100% polypropylene', '100% 涤纶，纸板（至少 30% 回收），100% 聚丙烯'，

the required output:所需的 output：

' polyester, Paperboard, polypropylene', '聚酯，纸板，聚丙烯'，

Answer 1

Possible solution is the following:可能的解决方案如下：

# pip install pandas

import pandas as pd
pd.set_option('display.max_colwidth', 200)

# set test data and create dataframe
data = {"text": ['100 % polyester, Paperboard (min. 30% recycled), 100% polypropylene','Polypropylene plastic', '100 % polyester, Paperboard (min. 30% recycled), 100% polypropylene', 'Bamboo, Clear nitrocellulose lacquer', 'Willow, Stain, Solid wood, Polypropylene plastic, Stainless steel, Steel, Galvanized, Steel, 100% polypropylene', 'Banana fibres, Clear lacquer', 'Polypropylene plastic (min. 20% recycled)']}
df = pd.DataFrame(data)

def cleanup(txt):
    re_pattern = re.compile(r"[^a-z, ()]", re.I)
    return re.sub(re_pattern, "", txt).replace("  ", " ").strip()

df['text_cleaned'] = df['text'].apply(cleanup)
df

Returns退货

Answer 2

Character.isDigit() and Character.isLetter() functions can be used to identify whether it is number or character. Character.isDigit() 和Character.isLetter() 函数可以用来识别是数字还是字符。

删除除逗号外的所有字符和数字

问题描述

2 个解决方案

解决方案1
2 已采纳 2022-03-27 17:52:29

解决方案2
-1 2022-03-27 18:10:27

删除除逗号外的所有字符和数字

问题描述

2 个解决方案

解决方案1 2 已采纳 2022-03-27 17:52:29

解决方案2 -1 2022-03-27 18:10:27

解决方案1
2 已采纳 2022-03-27 17:52:29

解决方案2
-1 2022-03-27 18:10:27