Brisk for small files

原文 2011-09-28 10:46:00 5 1 hadoop/ cassandra-0.7/ brisk

I am a newbie to Cassandra and Hadoop. While looking for integration of the two products i came across Brisk. From the description i understand that Brisk replaces HDFS for CassandraFS. So this replacement is a solution for small file problem of Hadoop? If so what about large files ? Currently i need to implement a resource storage containing both large binary data files with their meta data and small files such as images.

1 answers

It's both, really (although I think Brisk has now been rolled into a commercial product, DataStax Enterprise, and isn't being actively developed in its own right).

Brisk includes CassandraFS (cfs) which is a drop-in replacement for HDFS, so supports large files. Under the hood, these are broken into chunks and stored in Cassandra rows/columns.

For small files, you can store the data in native Cassandra rows instead of CassandraFS, and run Hadoop jobs over the rows instead.

Using Brisk or not?

Crushing small files in HDFS

Handling small files with PIG

HDFS performance for small files

HDFS and small files - part 2

Writing small files in HDFS

Small files and HDFS blocks

Small files in hadoop

Merging small files in hadoop

Hive Merge Small ORC Files

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question Using Brisk or not? Crushing small files in HDFS Handling small files with PIG HDFS performance for small files HDFS and small files - part 2 Writing small files in HDFS Small files and HDFS blocks Small files in hadoop Merging small files in hadoop Hive Merge Small ORC Files

Related Tags

Brisk for small files

Question

1 answers

solution1 0 ACCPTED 2011-11-14 16:36:26

solution1
0 ACCPTED 2011-11-14 16:36:26