简体   繁体   English

Apache Stanbol可扩展性和实际应用程序

[英]Apache Stanbol scalability and real-world applications

I'm starting a project with requirements such as NLP, storage of semantic data, content managment etc. and Apache Stanbol seems like a nice fit, but I'm not exactly sure it's ready so I'm trying to make an appropriate assessment before starting to work with it, as there are few things that worry me: 我正在启动一个具有诸如NLP,语义数据存储,内容管理等要求的项目,并且Apache Stanbol看起来很合适,但是我不确定它是否已准备就绪,因此在尝试进行适当评估之前开始使用它,因为几乎没有什么让我担心的:

  1. Stanbol seems a bit young and immature (newest version 0.12). Stanbol似乎有点年轻且不成熟(最新版本0.12)。 Has anybody used it in a commercial project/application/setup (I failed to find this information online)? 是否有人在商业项目/应用程序/设置中使用过它(我无法在线找到此信息)? What is the scale of those projects? 这些项目的规模是多少?

  2. How horizontally scalable is Stanbol? Stanbol在水平方向上可扩展性如何? What are its cloud/clustering capabilities? 它的云/集群功能是什么? As far as I know it relies on Apache Jena for storage, and Jena storage isn't horizontally scalable which would make Stanbol unable to scale horizontally as well. 据我所知,它依赖于Apache Jena进行存储,并且Jena存储不能水平扩展,这将使Stanbol也无法水平扩展。 I might be wrong about this, but this is my current understanding, please correct me if I'm wrong. 我可能错了,但这是我目前的理解,如果我错了,请纠正我。 Maybe Jena can be swapped with something else to be used as RDF storage provider and I'm not aware of it? 也许可以将Jena与其他东西互换以用作RDF存储提供程序,但我不知道吗?

  3. Learning resources for Stanbol seem a little scarce. Stanbol的学习资源似乎很少。 Does anyone know of a place/book/whatever where I can get more understanding about Stanbol under the hood (other than the official Stanbol website and the IKS website)? 有谁知道一个地方/一本书/在哪里可以让我更深入地了解Stanbol(官方Stanbol网站和IKS网站除外)? Are there any good alternatives? 有什么好的选择吗? I know there are nice alternatives regarding NLP (eg GATE, UIMA), but they lack CMS capabilities. 我知道NLP有很多不错的选择(例如GATE,UIMA),但是它们缺少CMS功能。

Thanks. 谢谢。

To your question: 对你的问题:

  • 1) I've been working on a project involving Stanbol(version 0.10). 1)我一直在从事一个涉及Stanbol(版本0.10)的项目。 Its still in the pre production stage. 它仍处于预生产阶段。 For CMS, we evaluated JackRabbit and Alfresco. 对于CMS,我们评估了JackRabbit和Alfresco。 Alfresco (CMIS) was found to be a better choice in our case. 在我们的案例中,发现Alfresco(CMIS)是更好的选择。 What I like about stanbol is the enhancement chains and the set of Enhancement Engines that come by default. 我喜欢stanbol的是默认情况下的增强链和一组增强引擎 This is a small to mid size project. 这是一个中小型项目。
  • 3) I found this book (Instant Apache Stanbol, Packt Publishing) very practical and useful while going about with my work especially the sections on Entity hubs and Enhancement engines. 3)在进行我的工作时,尤其是关于实体中心和增强引擎的部分,我发现这本书 (Instant Apache Stanbol,Packt Publishing)非常实用和有用。

A viable option is to use Redlink that offers content analysis and linked data services in the cloud using Apache Stanbol and Apache Marmotta in the back-end. 一个可行的选择是使用Redlink ,它在后端使用Apache StanbolApache Marmotta在云中提供内容分析和链接的数据服务

The Readlink team has worked on IKS and Apache Stanbol ; Readlink团队致力于IKSApache Stanbol for these reasons getting in contact with them can be a good starting point when deciding to use these technologies in production environments. 由于这些原因,在决定在生产环境中使用这些技术时,与它们联系可能是一个很好的起点。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM