简体   繁体   中英

Dataset for parquet format

I have a mapreduce program which I want to take in a parquet file. I am not able to find any parquet sample dataset on any repo or site.

Please suggest some.

I think you may just write a MR to read a dfs file and write as a parquet file.

Commands:

Use the ALTER command to set the store.format option.

ALTER SESSION SET `store.format` = 'parquet';
ALTER SYSTEM SET `store.format` = 'parquet';

Source

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM