简体   繁体   English

使用 RLE_DICTIONARY 编码编写 Parquet.Net 文件

[英]Writing a Parquet.Net file using RLE_DICTIONARY encoding

The Parquet.Net specification says I can read and write in RLE_DICTIONARY encoding. Parquet.Net规范说我可以用RLE_DICTIONARY编码读写。 I am trying to read the docs of Parquet.Net and the github repo code, but how do I write my DataTable to use this encoding?我正在尝试阅读Parquet.Net的文档和 github 回购代码,但是如何编写我的 DataTable 以使用这种编码?

The demo I am basing this off of is found here:我基于此的演示可在此处找到:

Stack Overflow Parquet Size Discrepancy Question Stack Overflow Parquet 尺寸差异问题

I'm finding this answer either, I just found an answer like this: if you are using spark just write DataFrame with我也正在寻找这个答案,我刚刚找到了这样的答案:如果您使用的是 spark,只需将 DataFrame 与

df.write
.option("parquet.enable.dictionary",true) //true is RLE_DICTIONARY and false plain (maybe)
.parquet(yourPath)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM