简体   繁体   中英

Using PHP's substr() with special characters at the end results in question marks

When I use the substr() function in PHP, I get an question mark (a square with a question mark - depending on the browser) at the end of the string when this last character was a special one, like ë or ö, etc...

$introtext = html_entity_decode($item->description, ENT_QUOTES, "UTF-8");
$introtext = substr($introtext, 0, 200);

How can I escape this?

If your string has multibyte encoding (like UTF-8) does, you should use mb_substr to avoid problems like this:

$introtext=mb_substr($introtext,0,200);

In case someone tried the previous answers, and it still didn't work:

Try to add a Unicode name in mb_substr like:

$introtext = mb_substr($introtext, 0, 200, 'utf-8');

That is because substr does not work with multibyte characters. substr will probably cut a multibyte character "in half". You should instead use mb_substr . Also make sure that your file is saved in UTF-8.

$introtext = mb_substr($introtext, 0, 200);

Use mb_substr

use mb_substr instead of substr that solves problems like that, but before that check mb_string is enabled in your PHP configuration by:

php -i | grep mbstring

It would show you that the mb_string is enabled or not. If not you can install that by: (for PHP 8.0)

sudo apt-get install php8.0-mbstring

Now you can use mb_substr like this:

mb_substr(string $string, int $start, int $length, string $encoding): string

$introtext = mb_substr($introtext, 0, 200, 'UTF-8');

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM