php：file_get_contents編碼問題

Question

我的任務很簡單：向translate.google.com發帖請求並獲取翻譯。 在下面的例子中，我使用“hello”這個詞翻譯成俄語。

header('Content-Type: text/plain; charset=utf-8');  // optional
error_reporting(E_ALL | E_STRICT);

$context = stream_context_create(array(
    'http' => array(
        'method' => 'POST',
        'header' => implode("\r\n", array(
            'Content-type: application/x-www-form-urlencoded',
            'Accept-Language: en-us,en;q=0.5', // optional
            'Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7' // optional
        )),
        'content' => http_build_query(array(
            'prev'  =>  '_t',
            'hl'    =>  'en',
            'ie'    =>  'UTF-8',
            'text'  =>  'hello',
            'sl'    =>  'en',
            'tl'    =>  'ru'
        ))
    )
));

$page = file_get_contents('http://translate.google.com/translate_t', false, $context);

require '../simplehtmldom/simple_html_dom.php';
$dom = str_get_html($page);
$translation = $dom->find('#result_box', 0)->plaintext;
echo $translation;

標記為可選的行是那些沒有輸出相同的行。 但我得到了奇怪的人物......

������

我試過了

echo mb_convert_encoding($translation, 'UTF-8');

但我明白了

ÐÒÉ×ÅÔ

有人知道如何解決這個問題嗎？

更新：

忘了提到我所有的php文件都是用UTF-8編碼的，沒有BOM
當我將“to”語言更改為“en”時，即從英語翻譯成英語，它可以正常工作。
我不認為我正在使用的庫弄亂它，因為我試圖輸出整個$頁面而不將其傳遞給庫函數。
我正在使用PHP 5

Answer 1

如果它可以幫助CURL導入字符編碼問題，請嘗試查看此帖子

你也可以嘗試這個片段（取自php.net）

<?php
function file_get_contents_utf8($fn) {
     $content = file_get_contents($fn);
      return mb_convert_encoding($content, 'UTF-8',
          mb_detect_encoding($content, 'UTF-8, ISO-8859-1', true));
}
?>

Answer 2

首先，您的瀏覽器是否設置為UTF-8？ 在Firefox中，您可以在View-> Character Encoding中設置文本編碼。 確保選中“Unicode（UTF-8）”。 我還將View-> Character Encoding-> Auto-Detect設置為“Universal”。

其次，您可以嘗試傳遞FILE_TEXT標志，如下所示：

$page = file_get_contents('http://translate.google.com/translate_t', FILE_TEXT, $context);

Answer 3

Accept-Charset並不是那么可選。 你應該在那里指定UTF8。 俄語字符在ISO_8859-1中無效

php：file_get_contents編碼問題

問題描述

3 個解決方案

解決方案1
9 2009-04-03 10:28:13

解決方案2
9 已采納 2009-04-03 10:31:36

解決方案3
1 2009-04-03 10:34:36

php：file_get_contents編碼問題

問題描述

3 個解決方案

解決方案1 9 2009-04-03 10:28:13

解決方案2 9 已采納 2009-04-03 10:31:36

解決方案3 1 2009-04-03 10:34:36

解決方案1
9 2009-04-03 10:28:13

解決方案2
9 已采納 2009-04-03 10:31:36

解決方案3
1 2009-04-03 10:34:36