在WP7中解析HTML字符串

Question

我需要解析从服务器收到的HTML字符串。

 <html>
 <head/>
 <body style="margin: 0;padding: 0">
    <a href="http://itunes.apple.com/WebObjects/MZStore.woa   
/wa/viewSoftware?id=319737742&amp;mt=8&amp;uo=6" style="margin: 0;padding: 0"><img   
src="https://s3.amazonaws.com/sportschatter/postcard.jpg" style="margin: 0;padding: 
0"/></a>
</body>
</html>

这是我从服务器获得的响应。 我需要检索img URL https://s3.amazonaws.com/sportschatter/postcard.jpg以及href部分。 我有适用于WP7的HTML Agility包，但是我不知道如何编写查询来获取此信息。 我尝试过这样的事情：

HtmlAgilityPack.HtmlDocument document = new HtmlAgilityPack.HtmlDocument();
         document.LoadHtml(htmlString);


       var value  =  document.DocumentNode.Descendants("img src").
                                       Select(
                                           x =>
                                           x.InnerText);

这没有任何价值。 我也尝试过Regex ：

    string parseString = htmlstring;
        Regex expression = new Regex(@".*img src=(\d+).*$");
        Match match = expression.Match(parseString);
        MessageBox.Show(match.Groups[1].Value);

但这也不起作用。 请让我知道我在做什么错。

Answer 1

您显然误解了使用LINQ2XML语法的含义（没有XPath，因为Windows Phone不支持XPath）

您需要执行以下操作：

var image = document.DocumentNode.Descendants("img").First()
var source = image.GetAttribute("src", "").Value;

Answer 2

使用HtmlAgilityPack-不要使用正则表达式。

Descendants的“查询字符串”是XPath，而不是类似CSS的选择器。

这是一个示例： http : //htmlagilitypack.codeplex.com/wikipage?title=Examples这是有关XPath的一些信息： http : //msdn.microsoft.com/zh-cn/library/ms256086.aspx

在WP7中解析HTML字符串

问题描述

2 个解决方案

解决方案1
2 已采纳 2011-11-08 12:19:26

解决方案2
-1 2011-11-08 11:16:11

在WP7中解析HTML字符串

问题描述

2 个解决方案

解决方案1 2 已采纳 2011-11-08 12:19:26

解决方案2 -1 2011-11-08 11:16:11

解决方案1
2 已采纳 2011-11-08 12:19:26

解决方案2
-1 2011-11-08 11:16:11