简体   繁体   English

最好的NoSQL日志记录解决方案

[英]The best NoSQL solution for logging

I am working on a project that is logging a lot of information about viewers from an online streaming platform. 我正在从事一个项目,该项目正在从在线流媒体平台记录很多有关观众的信息。 The problem today with the MySQL solution is that is too slow to query, and such. 今天,MySQL解决方案的问题在于查询太慢,等等。 Even with scaling and better performance tuning, that will now work because there are just to much data real time thats write/reads. 即使有了扩展和更好的性能调整,它仍然可以工作,因为实时数据就是大量的读写。

What will be a good(the best) NoSQL solution for me? 对我来说,什么是好的(最好的)NoSQL解决方案?

Extra: 额外:

  • We are currently also using Amazon Web services, where we store our data. 我们目前还使用Amazon Web服务,用于存储数据。
  • With Java API, and a open source solution is preferred. 使用Java API,并且首选开源解决方案。
  • Object orientated. 面向对象。

Not exactly a NoSQL solution , but have you looked at Scribe (from Facebook)? 并非完全是NoSQL解决方案,但是您是否看过Scribe (来自Facebook)? You can use http://code.google.com/p/scribe-log4j/ to write from Java 您可以使用http://code.google.com/p/scribe-log4j/从Java编写

The first challenge you may have is how to collect huge amount of data reliably with ease of management. 您可能面临的第一个挑战是如何在易于管理的情况下可靠地收集大量数据。 There're some open-source log collector implementation such as syslog, Fluentd , Scribe , and Flume :) 有一些开源日志收集器实现,例如syslog, FluentdScribeFlume :)

The big problem is how to store and process data. 最大的问题是如何存储和处理数据。 As you pointed out, using NoSQL solution works really well, but you need to choose among them depending on your data volume. 正如您指出的那样,使用NoSQL解决方案确实可以很好地工作,但是您需要根据数据量在其中进行选择。

At first, you can use MongoDB to store all of your data, but at some moment you end up using Apache Hadoop to architect a massively scalable architecture. 最初,您可以使用MongoDB存储所有数据,但是到最后,您最终将使用Apache Hadoop来构建可大规模扩展的架构。

The poing here is you should have a distributed logging layer which abstracts away the storage backend, and choosing the right NoSQL solution for data volume. 这里的提示是您应该具有一个分布式日志记录层,该层将存储后端抽象化,并为数据量选择正确的NoSQL解决方案。

Here're some links to put the Apache Logs into MongoDB, or Hadoop HDFS by Fluentd. 这里是一些链接,可将Apache日志放入Fluentd的MongoDB或Hadoop HDFS中。

I would spend some time looking at these options: 我会花一些时间看这些选项:

All of these solutions have their pros and cons, but their wikis should provide enough information to get you started. 所有这些解决方案都有其优点和缺点,但是它们的Wiki应该提供足够的信息来帮助您入门。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM