在 Synapse 中声明 Pyspark 变量并在 Kusto 查询中使用它

Question

I want to declare Pyspark variable in Synapse and use the variable in Kusto queries.我想在 Synapse 中声明 Pyspark 变量并在 Kusto 查询中使用该变量。

The variable declared in Pyspark as below Pyspark中声明的变量如下

s = "02-01-2022"
print(s)
e = "02-10-2022"
print(e)

Want to use the variable ' s ' and ' e ' in Kusto queries as shown below想要在 Kusto 查询中使用变量“ s ”和“ e ”，如下所示

%%pyspark

s = "02-01-2022" 
print(s)
e = "02-10-2022"
print(e)

# Read data from Azure Data Explorer table(s)
# Full Sample Code available at: https://github.com/Azure/azure-kusto-spark/blob/master/samples/src/main/python/SynapseSample.py

sales_data  = spark.read \
    .format("com.microsoft.kusto.spark.synapse.datasource") \
    .option("spark.synapse.linkedService", "LinkedServiceName") \
    .option("kustoDatabase", "DatabaseName") \
    .option("kustoQuery", "let starttime = startofday(todatetime('s')); let endtime = startofday(todatetime('e')); Table | where Time between (starttime .. endtime)  | summarize amount = count() by Date= bin(TIMESTAMP,5h) | project Date,amount | order by Date asc") \
    .load()

display(sales_data)

Answer 1

You can use the variable in the following way in pyspark:在pyspark中可以通过以下方式使用该变量：

option("kustoQuery", "let starttime = startofday(todatetime('" + s + "')); let endtime = startofday(todatetime('" + e + "')); Table | where Time between (starttime .. endtime)  | summarize amount = count() by Date= bin(TIMESTAMP,5h) | project Date,amount | order by Date asc")

Also, please refer Azure Data Explorer (Kusto) connector for Apache Spark另外，请参阅Azure Data Explorer (Kusto) connector for Apache Spark

在 Synapse 中声明 Pyspark 变量并在 Kusto 查询中使用它

问题描述

1 个解决方案

解决方案1
0 已采纳 2022-02-16 07:45:01

在 Synapse 中声明 Pyspark 变量并在 Kusto 查询中使用它

问题描述

1 个解决方案

解决方案1 0 已采纳 2022-02-16 07:45:01

解决方案1
0 已采纳 2022-02-16 07:45:01