简体   繁体   中英

Read/Write files on hdfs using Python

I am a newbie to Python, I want to read a file from hdfs (which I have achieved).

after reading the file I am doing some string operations and I want to write these modified contents into the output file.

Reading the file I achieved using subprocess (which took a lot of time) since open didn't work for me.

cat = Popen(["hadoop", "fs", "-cat", "/user/hdfs/test-python/input/test_replace"],stdout=PIPE)

Now, how to write to the output file with the modified contents is the question.

Your help is highly appreciated

您可以使用库来读取和写入HDFS,例如https://github.com/mtth/hdfs

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM