使用简单的html dom获取页面标题？

Question

我正在尝试使用简单的html dom（TITLE标记之间的页面的标题）从外部站点获取标题，但未检索任何内容。 有任何想法吗？

$html = new simple_html_dom();
$html->load('http://www.google.com');
$titleraw = $html->find('title');
$title = $titleraw->title;

Answer 1

$html = new simple_html_dom();
$html->load_file('http://www.google.com'); 
$titleraw = $html->find('title',0);
$title = $titleraw->innertext;

$html->load_file()从文件或URL加载内容。

$html->find('title')将返回一个数组

和$titleraw->innertext返回title元素的内容

Answer 2

->load()需要一个包含HTML而不是URL的字符串。

尝试：

$html = file_get_html('http://google.com');

代替。

除此之外，请注意，Google的ToS禁止屏幕抓取工具，因此希望您只是将该网址用作填充示例，而不是真正尝试抓取的内容。

Answer 3

只是

$mypage=file_get_html('http://myurl.com');
$title=$mypage->find('title',0);
echo $title->plaintext;

Answer 4

用这个

$html = new simple_html_dom();  
$html->load('http://www.google.com');
$titleraw = $html->find('title');
foreach($html->find('title') as $link_element) {        
    echo $link_element->plaintext;
}

而不是$title = $titleraw->title;

Answer 5

if(
  preg_match(
    '~<title>(.*)</title>~si',
    file_get_contents('http://www.google.com'),
    $result
  );
  var_dump($result[1]);
}else{ /* no result */ }

其他

$titleraw = $html->xpath('//title');

Answer 6

尝试

include_once 'simple_html_dom.php';
$oHtml = str_get_html($url);
$Title = array_shift($oHtml->find('title'))->innertext;
$Description = array_shift($oHtml->find("meta[name='description']"))->content;
$keywords = array_shift($oHtml->find("meta[name='keywords']"))->content;
echo $title;
echo $Description;
echo $keywords;

Answer 7

尝试这个

$html = new simple_html_dom()
$data = file_get_html('http://www.google.com/');

// Find all images
foreach($html->find('title') as $element)
       echo $element->plaitext . '<br>';

Answer 8

使用DOM和xpath，您可以执行以下操作：

function getTitle($url) {
   libxml_use_internal_errors(true);
   $doc = new DOMDocument();
   $doc->loadHTMLFile($url);
   $xpath = new DOMXPath($doc);
   $nlist = $xpath->query("//head/title");
   return $nlist->item(0)->nodeValue;
}
echo "Title: " . getTitle("http://www.google.com") . "\n";

使用简单的html dom获取页面标题？

问题描述

8 个解决方案

解决方案1
2 2013-02-05 02:52:39

解决方案2
2 2012-01-09 15:21:48

解决方案3
1 2014-05-30 13:12:17

解决方案4
1 2017-10-25 18:15:10

解决方案5
1 2012-01-09 15:21:10

解决方案6
0 2013-09-20 06:33:01

解决方案7
0 2013-09-20 06:46:05

解决方案8
0 2012-01-09 15:43:50

使用简单的html dom获取页面标题？

问题描述

8 个解决方案

解决方案1 2 2013-02-05 02:52:39

解决方案2 2 2012-01-09 15:21:48

解决方案3 1 2014-05-30 13:12:17

解决方案4 1 2017-10-25 18:15:10

解决方案5 1 2012-01-09 15:21:10

解决方案6 0 2013-09-20 06:33:01

解决方案7 0 2013-09-20 06:46:05

解决方案8 0 2012-01-09 15:43:50

解决方案1
2 2013-02-05 02:52:39

解决方案2
2 2012-01-09 15:21:48

解决方案3
1 2014-05-30 13:12:17

解决方案4
1 2017-10-25 18:15:10

解决方案5
1 2012-01-09 15:21:10

解决方案6
0 2013-09-20 06:33:01

解决方案7
0 2013-09-20 06:46:05

解决方案8
0 2012-01-09 15:43:50