简体   繁体   中英

To store data in a hcatalog table, the table must be empty. What about incremetal loading?

as far as I'm concerned there is no support for incremental loading in Hadoop. I'm playing a bit with pig and hive. For testing purposes I just read existing values from table in hcatalog, union it with new data, and than create new table in hcatalog, insert data. Than it needs to maintained with hive to delete old table, rename new one etc...

Other solution is to add new data to a partition in existing table.

Generaly - can you suggest BEST solution for incremental loading?

Regards

Pawel

Adding partition data to Hive/HCatalog table is BEST solution for incremental updates. Further you can have static and dynamic partitions.

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM