Azure Synapse 专用池数据拉入 jupyter notebbok

Question

I have data saved in one of view created in Azure synapse dedicated pools?我在 Azure 突触专用池中创建的视图之一中保存了数据？ I need to access this data into the jupyter notebook for further processing ?我需要将此数据访问到 jupyter 笔记本中以进行进一步处理吗？ would there any way to access/extract the data from dedciated pools in jupyter notebook written in python.有什么方法可以从用 python 编写的 jupyter notebook 中的专用池中访问/提取数据。

Answer 1

The Azure Synapse Dedicated SQL Pool Connector for Apache Spark in Azure Synapse Analytics enables efficient transfer of large data sets between the Apache Spark runtime and the Dedicated SQL pool. Azure Synapse Analytics 中用于 Apache Spark 的 Azure Synapse 专用 SQL 池连接器支持在 Apache Spark 运行时和专用 SQL 池之间高效传输大型数据集。 The connector is shipped as a default library with Azure Synapse Workspace.该连接器作为 Azure Synapse Workspace 的默认库提供。

Sample code -示例代码 -

# Add required imports
import com.microsoft.spark.sqlanalytics
from com.microsoft.spark.sqlanalytics.Constants import Constants
from pyspark.sql.functions import col

# Read from existing internal table
dfToReadFromTable = (spark.read
                     # If `Constants.SERVER` is not provided, the `<database_name>` from the three-part table name argument
                     # to `synapsesql` method is used to infer the Synapse Dedicated SQL End Point.
                     .option(Constants.SERVER, "<sql-server-name>.sql.azuresynapse.net")
                     # Defaults to storage path defined in the runtime configurations
                     .option(Constants.TEMP_FOLDER, "abfss://<container_name>@<storage_account_name>.dfs.core.windows.net/<some_base_path_for_temporary_staging_folders>")
                     # Three-part table name from where data will be read.
                     .synapsesql("<database_name>.<schema_name>.<table_name>")
                     # Column-pruning i.e., query select column values.
                     .select("<some_column_1>", "<some_column_5>", "<some_column_n>")
                     # Push-down filter criteria that gets translated to SQL Push-down Predicates.
                     .filter(col("Title").contains("E"))
                     # Fetch a sample of 10 records
                     .limit(10))

# Show contents of the dataframe
dfToReadFromTable.show()

You can refer this link for more information您可以参考此链接以获取更多信息

Azure Synapse 专用池数据拉入 jupyter notebbok

问题描述

1 个解决方案

解决方案1
0 2022-06-01 14:26:25

Azure Synapse 专用池数据拉入 jupyter notebbok

问题描述

1 个解决方案

解决方案1 0 2022-06-01 14:26:25

解决方案1
0 2022-06-01 14:26:25