简体   繁体   English

大量或非规范化记录的存储解决方案

[英]Storage solution for large number or denormalized records

We currently have a MSSQL server with the data stored in a denormalized format in a single table (per record type) where there can be millions of rows containing many columns in some cases. 当前,我们有一个MSSQL服务器,其数据以非规范化格式存储在单个表中(每种记录类型),在某些情况下,其中可能有数百万行包含许多列。 The requirement of the application is to read the data for each record in the application ie each matching row must be sent and read by the application (using a C# DataReader approach), so no aggregations can be made in the SQL server since the actual data is needed. 应用程序的要求是读取应用程序中每个记录的数据,即每个匹配的行必须由应用程序发送和读取(使用C#DataReader方法),因此由于实际数据无法在SQL Server中进行聚合是必需的。

The querying are in most cases pretty simple on a Date field (integer represented) and on two or three text columns. 在大多数情况下,在“日期”字段(表示整数)和两个或三个文本列上的查询非常简单。

Based on the requirement do you guys think that SQL server a good fit or are there other data storages ex: NoSQL solutions thst perhaps would be better/easier to use. 根据需求,你们认为SQL Server非常适合,还是存在其他数据存储,例如:NoSQL解决方案也许会更好/更易于使用。

Some of our systems contains hundreds of gigabytes of data in this format and its read only data once imported ie data is never updated. 我们的某些系统以这种格式包含数百GB的数据,并且一旦导入即只读数据,即数据永远不会更新。

The bottleneck for us right now seems to be Disk IO performance when returning large results, and because of the size there are no way of having everything cached at all time. 对于我们来说,目前的瓶颈似乎是返回大结果时的磁盘IO性能,并且由于大小原因,无法始终缓存所有内容。

Any tips on different storage solutions that we should look into. 我们应该研究的有关不同存储解决方案的所有技巧。

Thanks in advance. 提前致谢。

SQL Server is a good fit and come back when you have real amounts of data. SQL Server非常适合,当您有大量数据时再回来。 Hundreds of GB is a joke - this is 2014, not 1990. 数百GB是个玩笑-这是2014年,而不是1990年。

The bottleneck for us right now seems to be Disk IO performance 目前,我们的瓶颈似乎是磁盘IO性能

What discs? 什么碟?

The use of ALL SSD setups in such scenarios is standard for at least 5 years now. 在至少5年的时间里,在这种情况下使用ALL SSD设置是标准的。 I have personally a 3000gb analysis database and am pulling more than 1gb/second from the underlying SSD. 我个人有一个3000gb的分析数据库,并且正在从基础SSD上拉出超过1gb /秒的速度。

OBVIOUSLY you need hardware appropriate to your requirements. 显然,您需要适合您要求的硬件。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM