pyspark中的结构化流

Question

Am trying to stream data from another server to HBase and be able to define different column families in Python. 我试图将数据从另一台服务器流式传输到HBase，并能够在Python中定义不同的列系列。 I have looked around in the Spark docs and only seeing: 我在Spark文档中环顾四周，仅看到：

writestream.format('jdbc').start('jdbc:///')

How can I have the same implementations to write directly to HBase with the ability to map data to different column families? 如何将直接映射到HBase并具有将数据映射到不同列族的功能？

Answer 1

您可以使用foreach （Scala或Java）将数据写入HBase： http : //spark.apache.org/docs/latest/structured-streaming-programming-guide.html#using-foreach

pyspark中的结构化流

问题描述

1 个解决方案

解决方案1
1 2017-04-25 21:17:36

pyspark中的结构化流

问题描述

1 个解决方案

解决方案1 1 2017-04-25 21:17:36

解决方案1
1 2017-04-25 21:17:36