[英]Looping through single nodes in HTMLAgilityPack in C#
我正在尝试从具有以下结构的项目列表中获取“ ID”和“ URL”:
<li class="ad container">
<article class="item_list view_listing " data-id="109199375"> <div class="item_list_section item_date"> <time datetime="2015-06-21 20:10:05"> 21 giu<br> 20:10 </time> </div> <div class="item_list_section item_image"> <a href="http://www.subito.it/hobby-collezionismo/elicottero-scorpio-210-treviso-109199375.htm" > <div class="item_image_wrapper"> <img src="http://s.sbito.it/bigthumbs/07/07201887042182.jpg"> <div class="item_extra_images"> <img src="http://s.sbito.it/1201434547094/img2/ico_photo.png"> <span class="item_extra_photo_number">3</span> </div> </div> </a> </div> <div class="item_list_section item_description"> <h2> <a href="http://www.subito.it/hobby-collezionismo/elicottero-scorpio-210-treviso-109199375.htm" name="109199375" title="Elicottero scorpio 2&10" > Elicottero scorpio 2&10 </a> </h2> <span class="item_price">35 €</span> <span class="item_category">Collezionismo</span> </div> <div class="item_list_section item_info"> <span class="item_type">In vendita</span> <span class="item_location">Colle Umberto <em class="item_city">(TV)</em></span> </div> </article>
</li>
<li class="ad container">
<article class="item_list view_listing " data-id="128462782"> <div class="item_list_section item_date"> <time datetime="2015-06-20 18:54:44"> 20 giu<br> 18:54 </time> </div> <div class="item_list_section item_image"> <a href="http://www.subito.it/hobby-collezionismo/elicottero-costruzione-tedesca-verona-128462782.htm" > <div class="item_image_wrapper"> <img src="http://s.sbito.it/bigthumbs/06/06201423464701.jpg"> </div> </a> </div> <div class="item_list_section item_description"> <h2> <a href="http://www.subito.it/hobby-collezionismo/elicottero-costruzione-tedesca-verona-128462782.htm" name="128462782" title="Elicottero costruzione tedesca" > Elicottero costruzione tedesca </a> </h2> <span class="item_price">170 €</span> <span class="item_category">Collezionismo</span> </div> <div class="item_list_section item_info"> <span class="item_type">In vendita</span> <span class="item_location">Legnago <em class="item_city">(VR)</em></span> </div> </article>
</li>
<li class="ad container">
<article class="item_list view_listing " data-id="128437705"> <div class="item_list_section item_date"> <time datetime="2015-06-20 14:50:00"> 20 giu<br> 14:50 </time> </div> <div class="item_list_section item_image"> <a href="http://www.subito.it/hobby-collezionismo/elicottero-rc-450-verona-128437705.htm" > <div class="item_image_wrapper"> <img src="http://s.sbito.it/bigthumbs/06/06201463968327.jpg"> <div class="item_extra_images"> <img src="http://s.sbito.it/1201434547094/img2/ico_photo.png"> <span class="item_extra_photo_number">2</span> </div> </div> </a> </div> <div class="item_list_section item_description"> <h2> <a href="http://www.subito.it/hobby-collezionismo/elicottero-rc-450-verona-128437705.htm" name="128437705" title="Elicottero rc 450" > Elicottero rc 450 </a> </h2> <span class="item_price">200 €</span> <span class="item_category">Collezionismo</span> </div> <div class="item_list_section item_info"> <span class="item_type">In vendita</span> <span class="item_location">Veronella <em class="item_city">(VR)</em></span> </div> </article>
</li>
问题是我想将它们添加到列表中,并且不能执行多个循环,否则它们将位于不同的索引中。 所以这就是我想要做的:
var list = page.DocumentNode.SelectNodes("//li[@class='ad container']");
foreach (var obj in list)
{
items.Add( new Item() { itemUrl = obj.SelectSingleNode("/a").Attributes["href"].Value, itemId = obj.SelectSingleNode("//article").Attributes["data-id"].Value });
}
但是它不起作用,例如对于“ data-id”,它重复找到的第一个元素。 我该如何运作?
谢谢。
使用相对路径itemUrl = obj.SelectSingleNode(".//a").Attributes["href"].Value, itemId = obj.SelectSingleNode(".//article").Attributes["data-id"].Value
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.