简体繁体中英

Hadoop (HDFS) - file versioning

原文 2017-03-13 09:45:54 3 2 hadoop/ version-control/ hdfs

At the given time I have user file system in my application (apache CMIS). As it's growing bigger, I'm doubting to move to hadoop (HDFS) as we need to run some statistics on it as well. The problem: The current file system provides versioning of the files. When I read about hadoop - HDFS- and file versioning, I found most of the time that I have to write this (versioning) layer myself. Is there already something available to manage versioning of files in HDFS or do I really have to write it myself (don't want to reinvent the hot water, but don't find a proper solution either).

Answer

For full details: see comments on answer(s) below

Hadoop (HDFS) doesn't support versioning of files. You can get this functionality when you combine hadoop with (amazon) S3: Hadoop will use S3 as the filesystem (without chuncks, but recovery will be provided by S3). This solution comes with the versioning of files that S3 provides. Hadoop will still use YARN for the distributed processing.

2 answers

Versioning is not possible with HDFS.
Instead you can use Amazon S3 , which provides Versioning and is also compatible with Hadoop.

HDFS supports snapshots. I think that's as close as you can get to "versioning" with HDFS.

hadoop hdfs points to file:/// not hdfs://

Writing to a file in HDFS in Hadoop

Hadoop HDFS and Sequence File

Hadoop: compress file in HDFS?

hadoop - HDFS file distribution

Updating a hadoop HDFS file

Hadoop\HDFS: “no such file or directory”

Hadoop copying local file to HDFS?

hadoop writing output to hdfs file

Hadoop HDFS maximum file size

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question hadoop hdfs points to file:/// not hdfs:// Writing to a file in HDFS in Hadoop Hadoop HDFS and Sequence File Hadoop: compress file in HDFS? hadoop - HDFS file distribution Updating a hadoop HDFS file Hadoop\HDFS: “no such file or directory” Hadoop copying local file to HDFS? hadoop writing output to hdfs file Hadoop HDFS maximum file size

Related Tags

Hadoop (HDFS) - file versioning

Question

2 answers

solution1
3 ACCPTED 2017-03-13 13:17:52

solution2
1 2017-03-13 13:27:30

Hadoop (HDFS) - file versioning

Question

2 answers

solution1 3 ACCPTED 2017-03-13 13:17:52

solution2 1 2017-03-13 13:27:30

solution1
3 ACCPTED 2017-03-13 13:17:52

solution2
1 2017-03-13 13:27:30