[英]I need to obtain JSON with data extracted from an HTML document
我正在嘗試將包含圖片,名稱和電話號碼的公司電話清單轉換為JSON文件。 我試圖遍歷所有<a>
來找到img src
和div.employee-desc
文本,但是沒有成功。 我嘗試使用DOMdocument()
,但也失敗了。
<section>
<a href="tel:+471234567890">
<article class="clearfix">
<div class="employee-image">
<img src"image_1.jpg">
</div>
<div class="employee-desc">
Emma doe <br>
+471234567890
</div>
</article>
</a>
<a href="tel:+471234567890">
<article class="clearfix">
<div class="employee-image">
<img src"image_2.jpg">
</div>
<div class="employee-desc">
Frank doe <br>
+471234567890
</div>
</article>
</a>
<a href="tel:+xxxxxxxx">
<article class="clearfix">
<div class="employee-image">
<img src"image_3.jpg">
</div>
<div class="employee-desc">
John doe <br>
+471234567890
</div>
</article></a>
</section>
我的夢想是讓json文件看起來像這樣:
[
{
"image":"image_1.jpg",
"name":"Emma doe",
"phone":"+47 1234567890"
},
{
"image":"image_2.jpg",
"name":"Frank doe",
"phone":"+47 1234567890"
},
{
"image":"image_3.jpg",
"name":"John doe",
"phone":"+47 1234567890"
}
]
有誰知道如何在php中完成此工作?
您可以在下面找到代碼。 請注意,您的示例中的img標簽不正確。 應該是'img src =“”'而不是'img src“”“
我假設您的html在$ html變量中。
$json_arr = array();
$html = substr($html, strpos($html, '<section>') + 9);
$html = substr($html, 0, strpos($html, '</section>'));
$arr = explode('<a href="', $html);
foreach ($arr as $k => $line) {
if ($k == 0) continue;
$phone = substr($line, 0, strpos($line, '"'));
$phone = str_replace('tel:', '', $phone);
$phone = trim($phone);
$image = substr($line, strpos($line, '<img src="') + 10);
$image = substr($image, 0, strpos($image, '"'));
$name = substr($line, strpos($line, '<div class="employee-desc">') + 37);
$name = substr($name, 0, strpos($name, '</div>'));
$name = trim($name);
$name = substr($name, 0, strpos($name, '<br'));
$json_arr[$k - 1]['image'] = $image;
$json_arr[$k - 1]['name'] = $name;
$json_arr[$k - 1]['phone'] = $phone;
}
$json = json_encode($json_arr);
echo $json . "\n";
借助PHP Simple HTML DOM Parser的較短方法:
$html = HtmlDomParser::str_get_html($data);
foreach($html->find('a') as $element) {
$image=$element->children(0)->children(0)->children(0)->src;
list($name,$phone)=array_map('trim', explode('<br>',$element->children(0)->children(1)->innertext));
$row = (object)compact('image','name','phone');
$result[]=$row;
}
$output=json_encode($result,JSON_PRETTY_PRINT);
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.