Encoding/converting a string with spanish or polish characters

Question

1) How do I convert a variable with a string like "wdzi\\xc4\\x99czno\\xc5\\x9bci" into "wdzięczności" ?

2) Also how do I convert string variable with characters like "Â±", "Ä™", "Ä†" into correct letters?

I emphasise "variable" because all I've got from googling was examples with " u'some string' " and the like and I can't get anything like that to work.

I use "# -*- coding: utf-8 -*- " in second line of my script and I still crash into these problems.

Also I was said that simple print should output correctly - but it does not.

Answer 1

In Python 2.7 IDLE, I get this output:

>>> print "wdzi\xc4\x99czno\xc5\x9bci".decode('utf-8')
wdzięczności

Your first string appears to be a UTF-8 byte string, so all that's necessary is to decode it into a Unicode string. When Python prints that string, it will encode it back to the proper encoding based on your environment.

If you're using Python 3 then you have a string that has been decoded improperly and will need a little more work to fix the damage.

>>> print("wdzi\xc4\x99czno\xc5\x9bci".encode('iso-8859-1').decode('utf-8'))
wdzięczności

Encoding/converting a string with spanish or polish characters

Question

1 answers

solution1
3 2015-10-14 20:26:19

Encoding/converting a string with spanish or polish characters

Question

1 answers

solution1 3 2015-10-14 20:26:19

solution1
3 2015-10-14 20:26:19