简体   繁体   English

用 AWS Cloudsearch 替换 Google Site Search

[英]Replacing Google Site Search with AWS Cloudsearch

So I'm working on a site that has pretty specific global site search functionality that utilizes GSS which, as many of you already know, is going away in April.因此,我正在开发一个具有非常具体的全球站点搜索功能的站点,该功能利用 GS​​S,正如你们许多人所知,该功能将于 4 月停止使用。 I need to crawl the site and send XML over to Cloudsearch, but I'm kind of confused as to how to go about this and I haven't found much material on building a global site search using AWS Cloudsearch after scouring the internet for a couple of days.我需要爬取站点并将 XML 发送到 Cloudsearch,但我对如何进行此操作感到有些困惑,并且在搜索 Internet 以获取有关使用 AWS Cloudsearch 构建全球站点搜索的大量材料几天。 So far, I'm planning on crawling the site with Apache Nutch, but I would really appreciate some input.到目前为止,我正计划使用 Apache Nutch 来抓取该站点,但我非常感谢您提供一些意见。

Did you come across our blog?你有没有看到我们的博客? Index the web with AWS CloudSearch Index the web with StormCrawler (revisited) . 使用 AWS CloudSearch 索引 Web 使用 StormCrawler 索引 Web(重新访问) I described how to use Nutch and StormCrawler to index to AWS Cloudsearch.我描述了如何使用 Nutch 和 StormCrawler 为 AWS Cloudsearch 建立索引。

If you need the search to be hosted, I'd recommend Elasticsearch and Elastic Cloud instead.如果您需要托管搜索,我建议改为使用 Elasticsearch 和Elastic Cloud I found Cloudsearch slow, cumbersome and expensive and also there are more resources for Elasticsearch for StormCrawler and Apache Nutch.我发现 Cloudsearch 缓慢、繁琐且昂贵,而且 Elasticsearch 有更多资源可用于 StormCrawler 和 Apache Nutch。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM