从 html 源文件中提取文本值

Question

in this code var TempTxt holds An Html Body Content as string how can i extract element <table> or <td> inner text/ html using lambada syntax ?在此代码中 var TempTxt将 Html 正文内容作为字符串我如何使用 Lambada 语法提取元素<table>或<td>内部文本/ html？

    public  string  ExtractPageValue(IWebDriver DDriver, string url="") 
    {
        if(string.IsNullOrEmpty(url))
        url = @"http://www.boi.org.il/he/Markets/ExchangeRates/Pages/Default.aspx";
        var service = InternetExplorerDriverService.CreateDefaultService(directory);
        service.LogFile = directory + @"\seleniumlog.txt";
        service.LoggingLevel = InternetExplorerDriverLogLevel.Trace;

        var options = new InternetExplorerOptions();
        options.IntroduceInstabilityByIgnoringProtectedModeSettings = true;

        DDriver = new InternetExplorerDriver(service, options, TimeSpan.FromSeconds(60));
        DDriver.Navigate().GoToUrl(url);
        var TempTxt = DDriver.PageSource;
        return "";//Math.Round(Convert.ToDouble( TempTxt.Split(' ')[10]),2).ToString();

    }

Answer 1

If you are open to try HtmlAgilityPack如果您愿意尝试HtmlAgilityPack

HtmlAgilityPack.HtmlDocument doc = new HtmlAgilityPack.HtmlDocument();
doc.LoadHtml(html);

var table = doc.DocumentNode.SelectNodes("//table/tr")
               .Select(tr => tr.Elements("td").Select(td => td.InnerText).ToList())
               .ToList();

从 html 源文件中提取文本值

问题描述

1 个解决方案

解决方案1
1 已采纳 2012-11-18 17:00:22

从 html 源文件中提取文本值

问题描述

1 个解决方案

解决方案1 1 已采纳 2012-11-18 17:00:22

解决方案1
1 已采纳 2012-11-18 17:00:22