简体   繁体   中英

Parsing from aspx page to text document

Using C# or VB.Net Code to Download from a link (http://24.173.220.131/carter/currentinmates.aspx) .Then Parsing the attributes from the page into text document.

Output :

Name|BookDate|Charge|Bail|Release|Agency ANDERSON, JAYME RAMONE|05/04/2012|SENTENCED|$0.00|5/2/2022|TURNED SELF IN ANDERSON, JEFFERY CONARD|02/06/2012|SENTENCED|$0.00|2/5/2022|CARTER COUNTY SHERIFF DEPT

Add a reference to CsQuery , install it in NuGet or find it here https://github.com/jamietre/CsQuery

using System;
using System.Collections.Generic;
using System.Collections.Concurrent;
using System.Diagnostics;
using System.Linq;
using System.Threading;
using System.Threading.Tasks;
using System.Text;
using CsQuery;

class Program
{
    static void Main(string[] args)
    {

        var stringBuilder = new StringBuilder();
        var url = "http://24.173.220.131/carter/currentinmates.aspx";
        CQ.CreateFromUrlAsync(url)
           .Then(response =>
           {
               var dom = response.Dom;
               var trs = dom.Select("#dgrdLandRecords tr").Elements;
               foreach (var row in trs)
               {
                   stringBuilder.AppendLine();
                   var tds = row.ChildElements.ToList();

                   for (int i = 1; i < tds.Count; i++)
                   {
                       stringBuilder.Append(tds[i].Cq().Text());
                       stringBuilder.Append("|");
                   }
               }
               var result = stringBuilder.ToString();
               Console.Write(result);
           });


        Console.WriteLine("Press any key to exit.");
        Console.ReadKey();
    }
}

Use the WebClient class its exactly what you're looking for.

Public Class Test

Public Shared Sub Main(args() As String)
    Dim sURL as String
    If args Is Nothing OrElse args.Length = 0 Then
        'Throw New ApplicationException("Specify the URI of the resource to retrieve.")
         sURL = http://24.173.220.131/carter/currentinmates.aspx"
    Else
        sURL = args(0)
    End If
    Dim client As New WebClient()

    ' Add a user agent header in case the 
    ' requested URI contains a query.
    client.Headers.Add("user-agent", "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.2; .NET CLR 1.0.3705;)")

    Dim data As Stream = client.OpenRead(sURL)
    Dim reader As New StreamReader(data)
    Dim s As String = reader.ReadToEnd()
    Console.WriteLine(s)

    'Here write the variable `s` to a Text file, eg My.File.Create(s)

    data.Close()
    reader.Close()
End Sub 'Main
End Class 'Test

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM