简体   繁体   English

C# 从网站的 html 文档中提取单个变量

[英]C# extracting a single variable from html document from a website

This is what it looks like.这就是它的样子。 网页格式

I've tried something like this:我试过这样的事情:

var url = "https://www.tek-zence.no/";
var httpsClient = new HttpClient();
var html = await httpsClient.GetStringAsync(url);

var htmlDocument = new HtmlDocument();
htmlDocument.LoadHtml(html);

var element = htmlDocument.DocumentNode.Descendants("div")
    .Where(node => !node.GetAttributeValue("class", "").Contains("feature-nummer")).ToString();
Console.WriteLine(element.Innertext);

Any thoughts?有什么想法吗?

With HtmlAgilityPack, you can do this:使用 HtmlAgilityPack,您可以这样做:

var text = @"<div><div class='feature-nummer'>01</div></div>";

var doc = new HtmlAgilityPack.HtmlDocument();
doc.LoadHtml(text);

int number = -1;
var div = doc.DocumentNode.SelectSingleNode("//div[contains(@class, 'feature-nummer')]");
if (div != null && int.TryParse(div.InnerText, out int value))
{
    number = value;
}

The HTML here is just a sample like your HTML. Must work with your HTML too.此处的 HTML 只是一个示例,与您的 HTML 类似。必须也适用于您的 HTML。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM