简体繁体中英

alternatives for aggregating a lot of data fast

原文 2012-10-21 21:09:21 1 1 database/ nosql/ aggregate-functions/ infinidb

i'm using InfiniDB to aggregate a lot of rows (about 100-500 million) down to about less than 5000 groups. (in most querys the 100-500 million rows are filtered, so the aggregation will work on less rows)

It is used as a prototype of a travel-search-engine for a website, and you can think about it as "give me the best price per accommodation for all combinations of rooms for a specific number of persons".

It's working fine, until i have to self-join the table several times, to find the best-price combination (it's already reduced with logical filters, so the number of combinations per join are reduced too)

it is possible for me to split the content of the table in different tables, and it is working with acceptable performance, but now i'm asking myself if infinidb (or column oriented databases in general) is the best solution for this problem.

What are alternatives? i think every map/reduce mechanism (mongodb, hadoop) will be much slower, or is there a point i miss about it?

it should not require more than 2-5 server.

to make it clear: i don't expect a "this would be pefect!" answer, but good hints for alternatives. i also think that infinidb is a bad solution for my scenario.

Thanks for thoughts!

1 answers

I used infinidb 3 scaled on 9 machines with tables having > 30 billion rows without any problems, even with self-joins.

Give-me an example ddl + dql. Maybe I can help you out to improve the query.

Before Infinidb we tried hbase / cassandra / mongodb and the technology and we didn't like-it. For 500 million rows you can use simple Mysql if you need to do this not more than 2-3 times per day.

Mysql aggregating data with if statement

Querying and aggregating nested data in mongoDB

Demography package issue with aggregating data

Efficiently Reading in a lot of Data

Having a lot of data in an application

Aggregating SQL Data and concatenating strings / combining records

Aggregating 15-minute data into weekly values

Selecting and aggregating data from multiple databases

Web form data structure alternatives

Trying to input a lot of Data into mysqldb

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Mysql aggregating data with if statement Querying and aggregating nested data in mongoDB Demography package issue with aggregating data Efficiently Reading in a lot of Data Having a lot of data in an application Aggregating SQL Data and concatenating strings / combining records Aggregating 15-minute data into weekly values Selecting and aggregating data from multiple databases Web form data structure alternatives Trying to input a lot of Data into mysqldb

Related Tags

alternatives for aggregating a lot of data fast

Question

1 answers

solution1 1 2012-11-17 18:31:58

solution1
1 2012-11-17 18:31:58