简体   繁体   English

列式数据库比较和DBA工作

[英]Columnar Database Comparisons and DBA efforts

I'm trying to find a database solution and I came across Infobright and Amazon Redshift as potential solutions. 我正在尝试寻找数据库解决方案,我遇到了InfobrightAmazon Redshift作为潜在的解决方案。 Both are columnar databases. 两者都是柱状数据库。 Infobright has been around for quite sometime whereas Amazon Redshift is newer. Infobright已经出现了很长一段时间,而亚马逊Redshift则更新。

  1. What is the DBA effort between Infobright and Amazon Redshift? Infobright和Amazon Redshift之间的DBA工作是什么?
  2. How accessible is Infobright (API, query interface, etc.) vs AWS? Infobright(API,查询界面等)与AWS相比如何?
  3. Where do both sit in your system architecture? 两者都位于您的系统架构中? Do the operate as a layer on top of your traditional RDBMS? 在传统的RDBMS之上进行操作吗?
  4. What is the DevOps effort to setting up both Infobright and Redshift? 什么是DevOps设置Infobright和Redshift的努力?

I'm leaning a bit more towards Redshift because my application is hosted on AWS and I thought this would create tangible benefits in the long-run since everything is in AWS. 我更倾向于Redshift,因为我的应用程序托管在AWS上,我认为这将在长期内创造切实的好处,因为一切都在AWS中。 Thank you in advance! 先感谢您!

Firstly, I'll admit that I work for Infobright. 首先,我承认我为Infobright工作。 I've done significant research into Redshift, and I feel I can give an honest opinion. 我对Redshift做了大量研究,我觉得我可以给出一个诚实的意见。 I just wrote up a comparison between the two technologies; 我刚刚写了两种技术之间的比较; it can be found here: https://www.infobright.com/wp-content/plugins/download-monitor/download.php?id=37 它可以在这里找到: https//www.infobright.com/wp-content/plugins/download-monitor/download.php?id = 37

  1. DBA Effort - Infobright requires very little administration. DBA努力 - Infobright需要很少的管理。 You cannot index; 你不能索引; you don't need to partition/etc. 你不需要分区/等。 It's an SMP architecture and scales well. 它是一个SMP架构,可以很好地扩展。 Thus, you won't be dealing with multiple nodes. 因此,您将不会处理多个节点。 Redshift is also fairly simple. Redshift也相当简单。 You will need to maintain sorts as well as ensure Analyse is run enough. 您需要维护各种类型以及确保Analyze运行足够。

  2. Infobright uses a MySQL Shell. Infobright使用MySQL Shell。 Thus, any tool that can utilize MySQL can utilize Infobright. 因此,任何可以利用MySQL的工具都可以使用Infobright。 Therefore, you have the same set of tools/interfaces/APIs for Infobright as you do with MySQL. 因此,您使用与Infobright相同的工具/接口/ API集合,就像使用MySQL一样。 AWS does have an SQL interface, and it does have some API capabilities. AWS确实有一个SQL接口,它确实有一些API功能。 It does require that you load directly from S3. 它确实需要您直接从S3加载。 Infobright loads from flat files and named pipes from local or remote servers. Infobright从本地或远程服务器的平面文件和命名管道加载。

  3. Both databases are analytic databases. 两个数据库都是分析数据库。 You would not want to use either as a transactional database. 您不希望将它们用作事务数据库。 Instead, you typically push data from your transactional system to your analytic database. 相反,您通常会将数据从事务系统推送到分析数据库。

  4. DevOps to setup Infobright will be lower than Redshift. 设置Infobright的DevOps将低于Redshift。 However, Redshift is not that overly complicated either. 但是,Redshift也不是那么复杂。 Maintenance of the environment is more of a requirement for Redshift, though. 但是,维护环境对Redshift来说更是必需。

Infobright does have many AWS-specific installations. Infobright确实有许多特定于AWS的安装。 In fact, we have implementations that approach nearly 100TB of raw storage on one server. 事实上,我们的实现在一台服务器上接近100TB的原始存储。 That said, Redshift with many nodes can achieve petabyte scale on an implementation. 也就是说,具有许多节点的Redshift可以在实现上达到PB级。

There are other factors that can impact your choice. 还有其他因素会影响您的选择。 For example, Redshift has very nice failover/HA options already built-in. 例如,Redshift已经内置了非常好的故障转移/ HA选项。 On the flipside, Infobright can support many concurrent queries and users; 另一方面,Infobright可以支持许多并发查询和用户; Redshift limits queries to 15 regardless of cluster size. 无论群集大小如何,Redshift都将查询限制为15。

Take a look at the document, and feel free to contact me if you have any specific questions about either technology. 请查看该文档,如果您对这两种技术有任何具体问题,请随时与我联系。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM