Structured Streaming in pyspark

Question

Am trying to stream data from another server to HBase and be able to define different column families in Python. I have looked around in the Spark docs and only seeing:

writestream.format('jdbc').start('jdbc:///')

How can I have the same implementations to write directly to HBase with the ability to map data to different column families?

Answer 1

您可以使用foreach （Scala或Java）将数据写入HBase： http : //spark.apache.org/docs/latest/structured-streaming-programming-guide.html#using-foreach

Structured Streaming in pyspark

Question

1 answers

solution1
1 2017-04-25 21:17:36

Structured Streaming in pyspark

Question

1 answers

solution1 1 2017-04-25 21:17:36

solution1
1 2017-04-25 21:17:36