[英]Writing a Parquet.Net file using RLE_DICTIONARY encoding
The Parquet.Net
specification says I can read and write in RLE_DICTIONARY
encoding. Parquet.Net
规范说我可以用RLE_DICTIONARY
编码读写。 I am trying to read the docs of Parquet.Net
and the github repo code, but how do I write my DataTable to use this encoding?我正在尝试阅读
Parquet.Net
的文档和 github 回购代码,但是如何编写我的 DataTable 以使用这种编码?
The demo I am basing this off of is found here:我基于此的演示可在此处找到:
Stack Overflow Parquet Size Discrepancy Question Stack Overflow Parquet 尺寸差异问题
I'm finding this answer either, I just found an answer like this: if you are using spark just write DataFrame with我也正在寻找这个答案,我刚刚找到了这样的答案:如果您使用的是 spark,只需将 DataFrame 与
df.write
.option("parquet.enable.dictionary",true) //true is RLE_DICTIONARY and false plain (maybe)
.parquet(yourPath)
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.