Remove zero width space unicode character from Python string

Question

I have a string in Python like this:

u'\u200cHealth & Fitness'

How can i remove the

\u200c

part from the string ?

Answer 1

You can encode it into ascii and ignore errors:

u'\u200cHealth & Fitness'.encode('ascii', 'ignore')

Output:

'Health & Fitness'

Answer 2

If you have a string that contains Unicode character, like

s = "Airports Council International \u2013 North America"

then you can try:

newString = (s.encode('ascii', 'ignore')).decode("utf-8")

and the output will be:

Airports Council International North America

Upvote if helps :)

Answer 3

I just use replace because I don't need it:

varstring.replace('\u200c', '')

Or in your case:

u'\u200cHealth & Fitness'.replace('\u200c', '')

Answer 4

对我来说以下工作

mystring.encode('ascii', 'ignore').decode('unicode_escape')

Answer 5

In the specific case in the question: that the string is prefixed with a single u'\\200c' character, the solution is as simple as taking a slice that does not include the first character.

original = u'\u200cHealth & Fitness'
fixed = original[1:]

If the leading character may or may not be present, str.lstrip may be used

original = u'\u200cHealth & Fitness'
fixed = original.lstrip(u'\u200c')

The same solutions will work in Python3. From Python 3.9, str.removeprefix is also available

original = u'\u200cHealth & Fitness'
fixed = original.removeprefix(u'\u200c')

Remove zero width space unicode character from Python string

Question

5 answers

solution1
46 ACCPTED 2017-09-11 11:29:15

solution2
27 2018-02-21 07:47:23

solution3
13 2019-03-28 15:06:15

solution4
3 2018-12-11 10:41:44

solution5
1 2021-01-12 17:50:04

Remove zero width space unicode character from Python string

Question

5 answers

solution1 46 ACCPTED 2017-09-11 11:29:15

solution2 27 2018-02-21 07:47:23

solution3 13 2019-03-28 15:06:15

solution4 3 2018-12-11 10:41:44

solution5 1 2021-01-12 17:50:04

solution1
46 ACCPTED 2017-09-11 11:29:15

solution2
27 2018-02-21 07:47:23

solution3
13 2019-03-28 15:06:15

solution4
3 2018-12-11 10:41:44

solution5
1 2021-01-12 17:50:04