简体   繁体   中英

Structured Streaming in pyspark

Am trying to stream data from another server to HBase and be able to define different column families in Python. I have looked around in the Spark docs and only seeing:

writestream.format('jdbc').start('jdbc:///')

How can I have the same implementations to write directly to HBase with the ability to map data to different column families?

您可以使用foreach (Scala或Java)将数据写入HBase: http : //spark.apache.org/docs/latest/structured-streaming-programming-guide.html#using-foreach

The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM