简体繁体中英

How do you convert latin1 to utf8 character encoding?

原文 2011-03-30 15:37:14 9 2 php

So, I currently have this problem - I have a sql db dump and the character encoding in it is latin1, but there are some utf8 chars in the file that look like Ä (should be ā) Ä« (should be ī) Å¡ (should be š) Ä“ (should be ē) etc. How do I convert these leters back to the original utf8.?

Character in the file <-> what it should have been <-> bytes

Ä“ <-> ē <-> 5

Ä <-> ā <-> 2

Å¡ <-> š <-> 4

Ä« <-> ī <-> 4

2 answers

If you're seeing multiple bytes for what should be single characters, chances are it's already in UTF-8. Bear in mind that ISO-8859-1 is a single-byte-per-character encoding, whereas UTF-8 can take multiple bytes - and any non-ASCII character does take multiple bytes.

I suggest you open the file in a UTF-8-aware text editor, and check it there.

Encoding should be set on the connection on which you import data and read out data. If both of them are set to UTF-8, you will face no problems.

If you however import them with a latin1 connection, and later on reading it out with a UTF-8, you're in a world of trouble.

PHP internally only handles latin1, however that isn't nessecarily a problem for you.

If you have already wrongly imported the data, you would see a lot of ? or (diamond + ?) on your output I think.

But basically, when connecting frmo PHP, make sure to invoke SET NAMES 'utf8' first thing you do and see if that works.

If data still is wrong, you could use PHPs functions utf8_encode / utf8_decode to convert the data that is problematic.

In a working scenario they should never be used though.

Convert latin1 to UTF8

Character Encoding utf8 to latin1, explain these 2 characters

How to convert latin1 table to utf8 with serialized values?

How do I convert an utf8 string in “\uxxxx” format to latin1?

Convert latin1 characters on a UTF8 table into UTF8

Find *actual* character encoding of data in MySQL DB: UTF8 Latin1 illegal collation

UTF8 versus Latin1

Convert huge database from Latin1 to UTF8?

Detect latin1 characters in utf8 string

UTF8 -> Latin1 Difficulty, PHP

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Convert latin1 to UTF8 Character Encoding utf8 to latin1, explain these 2 characters How to convert latin1 table to utf8 with serialized values? How do I convert an utf8 string in “\uxxxx” format to latin1? Convert latin1 characters on a UTF8 table into UTF8 Find *actual* character encoding of data in MySQL DB: UTF8 Latin1 illegal collation UTF8 versus Latin1 Convert huge database from Latin1 to UTF8? Detect latin1 characters in utf8 string UTF8 -> Latin1 Difficulty, PHP

Related Tags

How do you convert latin1 to utf8 character encoding?

Question

2 answers

solution1
2 ACCPTED 2011-03-30 15:42:08

solution2
0 2011-03-30 16:38:13

How do you convert latin1 to utf8 character encoding?

Question

2 answers

solution1 2 ACCPTED 2011-03-30 15:42:08

solution2 0 2011-03-30 16:38:13

solution1
2 ACCPTED 2011-03-30 15:42:08

solution2
0 2011-03-30 16:38:13