繁体   English   中英

我需要使用来自 AWS GLUE 的 ETL 和 python 从 SQL Server 数据库中提取信息

[英]I need to extract information from SQL Server database using ETL from AWS GLUE with python

我需要从 SQL Server 数据库中提取信息,使用来自 AWS GLUE 的 ETL 和 Python,这就是我想要做的

import sys
from awsglue.transforms import *
from awsglue.utils import getResolvedOptions
from pyspark.context import SparkContext
from awsglue.context import GlueContext
from awsglue.job import Job
from pyspark.sql import SQLContext
from awsglue.dynamicframe import DynamicFrame
import pymssql

args = getResolvedOptions(sys.argv, ['JOB_NAME'])
sc = SparkContext()
glueContext = GlueContext(sc)
spark = glueContext.spark_session
job = Job(glueContext)
job.init(args['JOB_NAME'], args)

dbread = pymssql.connect('Host', 'user', 'pass', 'DB')
cursorread = dbread.cursor()
cursorread.execute("select tra.codigo_tramo, pozI.descripcion + ' - ' + pozF.descripcion, ope.razon_social as nombre_transportador \
                    from m_tramo tra, m_pozo pozI, m_pozo pozF, m_operador ope \
                    where tra.estado <> 'I' and tra.codigo_pozo_ini = pozI.codigo_pozo \
                    and tra.codigo_pozo_fin = pozF.codigo_pozo \
                    and tra.codigo_trasportador = ope.codigo_operador")
                    
dbread.close()
consulta = spark.createDataFrame(consulta, ['codigo_tramo', 'descripcion_tramo','codigo_transportador'])
consulta.show()

我怎样才能查阅这些信息?

我已经解决了

import sys
from awsglue.transforms import *
from awsglue.utils import getResolvedOptions
from pyspark.context import SparkContext, SparkConf
from awsglue.context import GlueContext
from awsglue.job import Job
from pyspark.sql import SQLContext
from awsglue.dynamicframe import DynamicFrame

args = getResolvedOptions(sys.argv, ['JOB_NAME'])
sc = SparkContext()
glueContext = GlueContext(sc)
spark = glueContext.spark_session
job = Job(glueContext)
job.init(args['JOB_NAME'], args)

# Construct JDBC connection options
connection_sqlserver_options = {
    "url": "",
    "dbtable": "",
    "user": "",
    "password": ""}
    
# Read from JDBC databases with custom driver
DyF = glueContext.create_dynamic_frame.from_options(connection_type="sqlserver",
                                                          connection_options=connection_sqlserver_options)

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM