使用C＃从网站读取数据

Question

我有一个除了一些字符串之外什么都没有的网页。 没有图像，没有背景颜色或任何东西，只是一些长度不是很长的纯文本。

我只是想知道，什么是最好的（通过这种方式，我的意思是最快和最有效）的方式来传递网页中的字符串，以便我可以用它来做其他事情（例如在文本框中显示）？ 我知道WebClient，但我不确定它是否会做我想做的事情，而且即使它确实有效，我也不想尝试它，因为我上次做的时间大约需要30秒一个简单的操作。

任何想法，将不胜感激。

Answer 1

WebClient类应该能够处理您描述的功能，例如：

System.Net.WebClient wc = new System.Net.WebClient();
byte[] raw = wc.DownloadData("http://www.yoursite.com/resource/file.htm");

string webData = System.Text.Encoding.UTF8.GetString(raw);

或（进一步由Fredrick在评论中提出建议）

System.Net.WebClient wc = new System.Net.WebClient();
string webData = wc.DownloadString("http://www.yoursite.com/resource/file.htm");

当你说花了30秒时，你可以再扩展一下吗？ 关于为什么会发生这种情况的原因有很多。 缓慢的服务器，互联网连接，狡猾的实施等。

你可以降低一级并实现这样的事情：

HttpWebRequest webRequest = (HttpWebRequest)WebRequest.Create("http://www.yoursite.com/resource/file.htm");

using (StreamWriter streamWriter = new StreamWriter(webRequest.GetRequestStream(), Encoding.UTF8))
{
    streamWriter.Write(requestData);
}

string responseData = string.Empty;
HttpWebResponse httpResponse = (HttpWebResponse)webRequest.GetResponse();
using (StreamReader responseReader = new StreamReader(httpResponse.GetResponseStream()))
{
    responseData = responseReader.ReadToEnd();
}

但是，在一天结束时，WebClient类会为您包装此功能。 所以我建议您使用WebClient并调查30秒延迟的原因。

Answer 2

如果您正在下载文本，那么我建议使用WebClient并获取文本的流读取器：

        WebClient web = new WebClient();
        System.IO.Stream stream = web.OpenRead("http://www.yoursite.com/resource.txt");
        using (System.IO.StreamReader reader = new System.IO.StreamReader(stream))
        {
            String text = reader.ReadToEnd();
        }

如果这需要很长时间，那么它可能是网络问题或Web服务器上的问题。 尝试在浏览器中打开资源，看看需要多长时间。 如果网页非常大，您可能希望查看以块为单位进行流式传输，而不是像在该示例中那样一直读到最后。 查看http://msdn.microsoft.com/en-us/library/system.io.stream.read.aspx以了解如何从流中读取。

Answer 3

关于建议所以我建议您使用WebClient并调查30秒延迟的原因。

从问题System.Net.WebClient的答案不合理地慢

尝试设置Proxy = null;

WebClient wc = new WebClient（）; wc.Proxy = null;

感谢Alex Burtsev

Answer 4

 WebClient client = new WebClient();
            using (Stream data = client.OpenRead(Text))
            {
                using (StreamReader reader = new StreamReader(data))
                {
                    string content = reader.ReadToEnd();
                    string pattern = @"((https?|ftp|gopher|telnet|file|notes|ms-help):((//)|(\\\\))+[\w\d:#@%/;$()~_?\+-=\\\.&]*)";
                    MatchCollection matches = Regex.Matches(content,pattern);
                    List<string> urls = new List<string>();
                    foreach (Match match in matches)
                    {
                            urls.Add(match.Value);
                    }

              }

使用C＃从网站读取数据

问题描述

4 个解决方案

解决方案1
30 已采纳 2011-01-21 11:36:39

解决方案2
5 2011-01-21 11:40:20

解决方案3
1 2013-02-06 12:36:21

解决方案4
0 2017-01-28 14:23:59

使用C＃从网站读取数据

问题描述

4 个解决方案

解决方案1 30 已采纳 2011-01-21 11:36:39

解决方案2 5 2011-01-21 11:40:20

解决方案3 1 2013-02-06 12:36:21

解决方案4 0 2017-01-28 14:23:59

解决方案1
30 已采纳 2011-01-21 11:36:39

解决方案2
5 2011-01-21 11:40:20

解决方案3
1 2013-02-06 12:36:21

解决方案4
0 2017-01-28 14:23:59