简体   繁体   English

如何在数据块集群上运行非火花代码?

[英]How to run a non-spark code on databricks cluster?

I am able to pull the data from databricks connect and run spark jobs perfectly.我能够从 databricks connect 中提取数据并完美地运行 spark 作业。 My question is how to run non-spark or native python code on remote cluster.我的问题是如何在远程集群上运行非 spark 或本机 python 代码。 Not sharing the code due to confidentiality.由于机密性,不共享代码。

When you're using databricks connect, then your local machine is a driver of your Spark job, so non-Spark code will be always executed on your local machine .当您使用 databricks connect 时,您的本地计算机就是您的 Spark 作业的驱动程序,因此非 Spark 代码将始终在您的本地计算机上执行 If you want to execute it remotely, then you need to package it as wheel/egg, or upload Python files onto DBFS (for example, via databricks-cli ) and execute your code as Databricks job (for example, using the Run Submit command of Jobs REST API, or create a Job with databricks-cli and use databricks jobs run-now to execute it)如果你想远程执行它,那么你需要 package 它作为 wheel/egg,或者将 Python 文件上传到 DBFS(例如,通过databricks-cli )并将你的代码作为 Databricks 作业执行(例如,使用Run Submit 命令作业 REST API,或使用 databricks-cli 创建作业并使用databricks jobs run-now来执行它)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 在不创建新集群的情况下在 azure 数据块中执行 spark-submit - Execute spark-submit in azure databricks without creating new cluster 我如何从 databricks spark 连接到 docu.net db 启用 TLS 的集群? - How can i connect to documnet db TLS enabled cluster from databricks spark? 如何找到数据块交互式集群的创建日期? - How to find creation date of databricks interactive cluster? 您不能在运行 Databricks Basic 的集群上运行笔记本作业 - You cannot run a notebook job on a cluster running Databricks Basic 在 spark 集群配置数据块中参数化 azure 存储帐户名称 - parameterize azure storage account name in spark cluster config databricks 在集群 Spark Config 中为 Azure Databricks 设置数据湖连接 - Setting data lake connection in cluster Spark Config for Azure Databricks 使用数据块集群执行 azure 存储上存在的 python 代码 - Executing python code present on azure storage using databricks cluster 将本地 Jupyter Hub 连接到 Azure Databricks Spark 集群 - Connect local Jupyter Hub to Azure Databricks Spark Cluster 带有数据块集群的托管身份 - Managed identities with databricks cluster 如何在 Databricks 的 Iceberg 表上执行 Spark SQL 合并语句? - How to execute a Spark SQL merge statement on an Iceberg table in Databricks?
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM