简体繁体 English

Spark流媒体和模拟HDFS

[英]Spark streaming and mocking hdfs

原文 2018-08-14 00:53:21 5 1 java/ apache-spark/ hadoop/ cucumber/ hdfs

There is a requirement to implement a test for a spark streaming code. 需要对Spark Streaming代码实施测试。 This particular code is running in a separate jvm by using this library And the input for above application is hdfs. 使用此库，此特定代码在单独的jvm中运行。上述应用程序的输入为hdfs。 I've started MiniDFSCluster like in this example (java version) But i don't think it will work because these are in two different JVMs. 我已经像本例（Java版本）中那样启动了MiniDFSCluster，但是我认为这不会起作用，因为它们位于两个不同的JVM中。

What would be the best approach to mock the hdfs input if i were to successfully test the spark streaming code. 如果我要成功测试Spark Streaming代码，那么模拟hdfs输入的最佳方法是什么。

I explained above scenario generally. 我已经大致解释了上述情况。 The real requirement is to implement a successful cucumber test. 真正的要求是实施成功的黄瓜测试。

1 个解决方案

可以尝试在本地模式下运行Spark并指定诸如“ file：/// foo / bar”之类的路径，而不是尝试模拟hdfs-然后将使用本地文件系统代替hdfs。

Spark流输出未保存到HDFS文件 - Spark streaming output not saved to HDFS file

如何使用 Spark Streaming Java API 将推特推文写入 HDFS - How to write twitter tweets to HDFS using Spark Streaming Java API

如何使用Spark Streaming从HDFS读取数据？ - How to read data from HDFS using spark streaming?

将Spark Streaming输出写入HDFS时跳过了数据 - Data skipped while writing Spark Streaming output to HDFS

结构化流以将JSON保存到HDFS - Structured Streaming to Save JSON to HDFS

Spark / Hdfs / Hdfs-客户端兼容性 - Spark / Hdfs / Hdfs-client compatibility

在Spark Streaming中使用Spark SQL - Using Spark SQL with Spark Streaming

Cassandra的Java Spark流 - Java Spark Streaming with Cassandra

停止火花流 - Stop spark streaming

NoClassDefFound 异常 Spark Streaming - NoClassDefFound Exception Spark Streaming

暂无

暂无

声明:本站的技术帖子网页，遵循CC BY-SA 4.0协议，如果您需要转载，请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 Spark流输出未保存到HDFS文件 - Spark streaming output not saved to HDFS file 如何使用 Spark Streaming Java API 将推特推文写入 HDFS - How to write twitter tweets to HDFS using Spark Streaming Java API 如何使用Spark Streaming从HDFS读取数据？ - How to read data from HDFS using spark streaming? 将Spark Streaming输出写入HDFS时跳过了数据 - Data skipped while writing Spark Streaming output to HDFS 结构化流以将JSON保存到HDFS - Structured Streaming to Save JSON to HDFS Spark / Hdfs / Hdfs-客户端兼容性 - Spark / Hdfs / Hdfs-client compatibility 在Spark Streaming中使用Spark SQL - Using Spark SQL with Spark Streaming Cassandra的Java Spark流 - Java Spark Streaming with Cassandra 停止火花流 - Stop spark streaming NoClassDefFound 异常 Spark Streaming - NoClassDefFound Exception Spark Streaming

相关标签

粤ICP备18138465号 © 2020-2024 STACKOOM.COM