简体   繁体   English

触发 Databricks Notebook 的方法

[英]Ways to Trigger a Databricks Notebook

Can someone let me know the possible ways to trigger a Databricks notebook?有人可以让我知道触发 Databricks 笔记本的可能方法吗? My preferred method is via Azure Data Factory, but my company is sadly reluctant to deploy ADF at this present moment in time.我的首选方法是通过 Azure 数据工厂,但遗憾的是我的公司目前不愿意及时部署 ADF。

Basically, I would like my Databricks notebook to be triggered when a blob is uploaded to Blob store.基本上,我希望在将 blob 上传到 Blob 存储时触发我的 Databricks 笔记本。 Is that possible?那可能吗?

You can try Auto Loader: Auto Loader supports two modes for detecting new files: directory listing and file notification.您可以试试 Auto Loader:Auto Loader 支持两种检测新文件的模式:目录列表和文件通知。

Directory listing: Auto Loader identifies new files by listing the input directory.目录列表: Auto Loader 通过列出输入目录来识别新文件。 Directory listing mode allows you to quickly start Auto Loader streams without any permission configurations other than access to your data on cloud storage.目录列表模式允许您快速启动 Auto Loader 流,无需任何权限配置,除了访问您在云存储上的数据。 In Databricks Runtime 9.1 and above, Auto Loader can automatically detect whether files are arriving with lexical ordering to your cloud storage and significantly reduce the amount of API calls it needs to make to detect new files.在 Databricks Runtime 9.1 及更高版本中,Auto Loader 可以自动检测文件是否按词法顺序到达您的云存储,并显着减少检测新文件所需的 API 调用量。

File notification: Auto Loader can automatically set up a notification service and queue service that subscribe to file events from the input directory.文件通知: Auto Loader 可以自动设置通知服务和队列服务,从输入目录订阅文件事件。 File notification mode is more performant and scalable for large input directories or a high volume of files but requires additional cloud permissions for set up.文件通知模式对于大型输入目录或大量文件具有更高的性能和可扩展性,但需要额外的云权限才能设置。

在此处输入图像描述

Refer - https://learn.microsoft.com/en-us/azure/databricks/spark/latest/structured-streaming/auto-loader参考 - https://learn.microsoft.com/en-us/azure/databricks/spark/latest/structured-streaming/auto-loader

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 数据块笔记本中的多个单元格 - Multiple cells in databricks notebook 我可以遍历数据块笔记本中的小部件吗? - Can I iterate through the widgets in a databricks notebook? 通过 Azure Databricks Notebook 传递参数 URL - Passing parameter via Azure Databricks Notebook URL 如何将已经存在的数据块笔记本移动到回购协议中? - How to move an already extant databricks notebook into a repo? Azure 与数据块笔记本通信时出现功能错误 - Azure functions error communicating with a databricks notebook 您不能在运行 Databricks Basic 的集群上运行笔记本作业 - You cannot run a notebook job on a cluster running Databricks Basic Databricks Python Notebook中如何输入googleapiclient授权码 - How to input googleapiclient authorization code in Databricks Python Notebook 我如何从 Azure Devops 自动化 Databricks 笔记本 - How do i automate Databricks notebook from Azure Devops Azure Databricks API,如何通过 API 将集群附加到上传的笔记本 - Azure Databricks API, how to attach a cluster to an uploaded notebook via API Azure Databricks Jupyter Notebook Python 和 R 在一个单元格中 - Azure Databricks Jupyter Notebook Python and R in one cell
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM