简体繁体中英

How can I remove the non-alphanumeric (English) characters in a series containing strings while retaining spaces?

原文 2019-03-05 23:59:08 9 1 python/ pandas/ nlp

Currently, I have:

[re.sub(r'\W', '', i) for i in training_data.loc[:, 'Text']]

However with this the Hindi characters remain and all the spaces are removed. Any ideas?

1 answers

Negation might help

import re
import string    

re.sub(f'[^{string.printable}]', '', 'asdf #$שדגכ')

How can I remove all non-alphanumeric characters from a string, except for '#', with regex?

Pandas: How to remove non-alphanumeric columns in Series

Remove non-alphanumeric characters by regex substitution

Python Regex - Replacing Non-Alphanumeric Characters AND Spaces with Dash

Pandas remove non-alphanumeric characters from string column

How to query for strings that have apostrophes or any kind of non-alphanumeric characters

How to remove non-alphanumeric characters from keys and values in a dictionary and then write it to a text file?

How to remove leading and trailing non-alphanumeric characters of a certain string in python using regex?

Python: How to split string but preserve the non-alphanumeric characters

How to rermove non-alphanumeric characters at the beginning or end of a string

暂无

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question How can I remove all non-alphanumeric characters from a string, except for '#', with regex? Pandas: How to remove non-alphanumeric columns in Series Remove non-alphanumeric characters by regex substitution Python Regex - Replacing Non-Alphanumeric Characters AND Spaces with Dash Pandas remove non-alphanumeric characters from string column How to query for strings that have apostrophes or any kind of non-alphanumeric characters How to remove non-alphanumeric characters from keys and values in a dictionary and then write it to a text file? How to remove leading and trailing non-alphanumeric characters of a certain string in python using regex? Python: How to split string but preserve the non-alphanumeric characters How to rermove non-alphanumeric characters at the beginning or end of a string

Related Tags

粤ICP备18138465号 © 2020-2024 STACKOOM.COM