如何从RSS提要项中获取所有可能的图像URL？

Question

I try to use this example to get images urls from http://www.nydailynews.com/cmlink/NYDN.Article.rss 我尝试使用此示例从http://www.nydailynews.com/cmlink/NYDN.Article.rss获取图像网址

but no success 但没有成功

Could u help me to find all correct ways to gets all possible image URLs from RSS feed item by SyndicationItem class? 你可以帮我找到所有正确的方法，通过SyndicationItem类从RSS feed项中获取所有可能的图像URL吗？

There is draft solution here but I guess should be more generic solution. 这里有草案解决方案，但我想应该是更通用的解决方案。

Thank you! 谢谢！

 List<RssFeedItem> rssItems = new List<RssFeedItem>();
                    Stream stream = e.Result;
                    XmlReader response = XmlReader.Create(stream);
                    SyndicationFeed feeds = SyndicationFeed.Load(response);
                    foreach (SyndicationItem f in feeds.Items)
                    {
                        RssFeedItem rssItem = new RssFeedItem();

                        rssItem.Description = f.Summary.Text;
foreach (SyndicationLink enclosure in f.Links.Where<SyndicationLink>(x => x.RelationshipType == "enclosure"))
                            {
                                Uri url = enclosure.Uri;
                                long length = enclosure.Length;
                                string mediaType = enclosure.MediaType;
                                rssItem.ImageLinks.Add(url.AbsolutePath);
                            }
}

Answer 1

I found the solution. 我找到了解决方案。

foreach (SyndicationElementExtension extension in f.ElementExtensions)
{
    XElement element = extension.GetObject<XElement>();

    if (element.HasAttributes)
    {
        foreach (var attribute in element.Attributes())
        {
            string value = attribute.Value.ToLower();
            if (value.StartsWith("http://") && (value.EndsWith(".jpg") || value.EndsWith(".png") || value.EndsWith(".gif") ))
            {
                   rssItem.ImageLinks.Add(value); // Add here the image link to some array
             }
        }                                
    }                            
}

Answer 2

XDocument xDoc = XDocument.Load("http://www.nydailynews.com/cmlink/NYDN.Article.rss");
XNamespace media = XNamespace.Get("http://search.yahoo.com/mrss/");

var images = xDoc.Descendants(media+"content")
    .Where(m=>m.Attribute("type").Value=="image/jpeg")
    .Select(m=>m.Attribute("url").Value)
    .ToArray();

--EDIT-- - 编辑 -

var images = feeds.Items
     .SelectMany(i => i.ElementExtensions
                       .Select(e => e.GetObject<XElement>().Attribute("url").Value)
                )
     .ToArray();

Answer 3

Gets a list of images from string 从字符串中获取图像列表

var text = "your text with image links";
Regex regx = new Regex("http://([\\w+?\\.\\w+])+([a-zA-Z0-9\\~\\!\\@\\#\\$\\%\\^\\&amp;\\*\\(\\)_\\-\\=\\+\\\\\\/\\?\\.\\:\\;\\'\\,]*)?.(?:jpg|bmp|gif|png)", RegexOptions.IgnoreCase);
MatchCollection mactches = regx.Matches(text);

如何从RSS提要项中获取所有可能的图像URL？

问题描述

3 个解决方案

解决方案1
4 已采纳 2012-05-10 18:43:27

解决方案2
2 2012-05-10 17:50:36

解决方案3
2 2013-11-18 12:01:09

如何从RSS提要项中获取所有可能的图像URL？

问题描述

3 个解决方案

解决方案1 4 已采纳 2012-05-10 18:43:27

解决方案2 2 2012-05-10 17:50:36

解决方案3 2 2013-11-18 12:01:09

解决方案1
4 已采纳 2012-05-10 18:43:27

解决方案2
2 2012-05-10 17:50:36

解决方案3
2 2013-11-18 12:01:09