How To Write Regex to Extract URL from Bing Search Result?

Question

I have this to extract URL from google search result ( https://www.google.com/search?q=myquery&num=100 )

@"(?<=<h3 class=\""r\""><a href=\""\/url\?q=)(.*?)(?=&amp;)";

Here's my code to extract URL from google search result

const string regexPattern = @"(?<=<h3 class=\""r\""><a href=\""\/url\?q=)(.*?)(?=&amp;)";

public static string[] TopUrls(string data)
    {
        Regex regex = new Regex(regexPattern);
        MatchCollection collection = regex.Matches(data);
        return collection.Cast<Match>()
            .Select(m => m.Value)
            .ToArray();
    }

string downloadUrl = "https://www.google.com" + "/search?q=" + keyword.ToString() + "&num=" + numResults + "&as_qdr=all&ei=LrUVVf7UMrPfsAS7lICgCw&sa=N&biw=1440&bih=690";
                fetch.Headers.Set(HttpRequestHeader.Host, "www.google.com");
                string data = fetch.DownloadString(downloadUrl);
                string[] results = TopUrls(data);

from that code i can extract each URL from google search result.

Here's the result: https : //www blogger com/ profile/ 15582992268736301561 https : //www blogger com/ profile /17377873899922361640

How to write regex for this URL? http://www.bing.com/search?q=myquery&count=100

Thank You :)

Answer 1

试试这样的<h2>*?<a\\s+[^>]*?href="([^"]*)"

Answer 2

Why not use Bing Search API s? If you really must parse the HTML, you're looking for algo results. Get the li tags with b_algo class and extract the URL from them.

Answer 3

Your first step is to use:

<cite>(.*?)</cite>

Then you need another regex to remove <strong> tags

How To Write Regex to Extract URL from Bing Search Result?

Question

3 answers

solution1
1 ACCPTED 2017-01-20 20:19:37

solution2
0 2017-01-20 19:48:38

solution3
0 2017-01-20 19:53:13

How To Write Regex to Extract URL from Bing Search Result?

Question

3 answers

solution1 1 ACCPTED 2017-01-20 20:19:37

solution2 0 2017-01-20 19:48:38

solution3 0 2017-01-20 19:53:13

solution1
1 ACCPTED 2017-01-20 20:19:37

solution2
0 2017-01-20 19:48:38

solution3
0 2017-01-20 19:53:13