简体繁体中英

How HDFS delete the block?

原文 2012-08-02 01:50:36 1 2 hadoop/ block/ hdfs

I am new to Hadoop.

After my deployment, I ran the benchmark, $hadoop jar hadoop-0.20.0-test.jar TestDFSIO -write -nrFiles 1000 -fileSize 10.

After the work done, I looked at the Cluster Summary, Say 1000 files and directories, 1000 blocks = 2000 total.

Then I use the shell cmd to delete the corresponding files. Then the Cluster Summary is: 0 files and directories 1000 blocks = 1000 total.

I think after the deletion there should be no files and no blocks in the Cluster Summary. But the actual situation are: Before I ask this question is 1000 blocks. But now there're 0 blocks.

How can this happen?

2 answers

Delete is just instructing the NameNode to remove the file from its metadata store. After this has been committed, the NameNode will then reach out to the Datanodes which host the blocks and instruct them to delete the blocks.

The actual file deletion on the data nodes could take a few seconds (depending on the number of datanodes, block size and number of blocks to delete), and then the data nodes will report back to the nameNode, at which point it will update the global number of blocks. As you observed, after a few moments, the total number of blocks will be back to 0

块删除操作异步完成。

how to delete specific block from HDFS

How is a HDFS block sequential

How to delete a snapshot in HDFS?

How data in an HDFS block is stored?

How to delete files from the HDFS?

How to change hdfs block size of DataFrame in pysark

How does HDFS manage block size?

How do block pools function in a HDFS Federation

How to change hdfs block size in pyspark?

How to delete intermediate output file from Hdfs

暂无

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

Related Question how to delete specific block from HDFS How is a HDFS block sequential How to delete a snapshot in HDFS? How data in an HDFS block is stored? How to delete files from the HDFS? How to change hdfs block size of DataFrame in pysark How does HDFS manage block size? How do block pools function in a HDFS Federation How to change hdfs block size in pyspark? How to delete intermediate output file from Hdfs

Related Tags

How HDFS delete the block?

Question

2 answers

solution1
3 ACCPTED 2012-08-02 13:51:40

solution2
0 2012-08-02 07:59:57

How HDFS delete the block?

Question

2 answers

solution1 3 ACCPTED 2012-08-02 13:51:40

solution2 0 2012-08-02 07:59:57

solution1
3 ACCPTED 2012-08-02 13:51:40

solution2
0 2012-08-02 07:59:57