简体   繁体   English

如何使用pymysql将mySQL查询结果存储到pandas DataFrame中?

[英]How to store mySQL query result into pandas DataFrame with pymysql?

I'm trying to store a mySQL query result in a pandas DataFrame using pymysql and am running into errors building the dataframe.我正在尝试使用pymysql将 mySQL 查询结果存储在 Pandas pymysql并且在构建数据帧时pymysql错误。 Found a similar question here and here , but it looks like there are pymysql -specific errors being thrown: 在此处此处找到了一个类似的问题,但似乎抛出了pymysql特定的错误:

import pandas as pd
import datetime
import pymysql

# dummy values 
connection = pymysql.connect(user='username', password='password', databse='database_name', host='host')

start_date = datetime.datetime(2017,11,15)
end_date = datetime.datetime(2017,11,16)

try:
    with connection.cursor() as cursor:
    query = "SELECT * FROM orders WHERE date_time BETWEEN %s AND %s"

    cursor.execute(query, (start_date, end_date)) 

    df = pd.DataFrame(data=cursor.fetchall(), index = None, columns = cursor.keys())
finally:
    connection.close()

returns: AttributeError: 'Cursor' object has no attribute 'keys'返回: AttributeError: 'Cursor' object has no attribute 'keys'

If I drop the index and columns arguments:如果我删除indexcolumns参数:

try:
    with connection.cursor() as cursor:
    query = "SELECT * FROM orders WHERE date_time BETWEEN %s AND %s"

    cursor.execute(query, (start_date, end_date)) 

    df = pd.DataFrame(cursor.fetchall())
finally:
    connection.close()

returns ValueError: DataFrame constructor not properly called!返回ValueError: DataFrame constructor not properly called!

Thanks in advance!提前致谢!

Use Pandas.read_sql() for this: 为此使用Pandas.read_sql()

query = "SELECT * FROM orders WHERE date_time BETWEEN ? AND ?"
df = pd.read_sql(query, connection,  params=(start_date, end_date))

Thank you for your suggestion to use pandas.read_sql().感谢您建议使用 pandas.read_sql()。 It works with executing a stored procedure as well!它也适用于执行存储过程! I tested it in MSSQL 2017 environment.我在 MSSQL 2017 环境中对其进行了测试。

Below is an example (I hope it helps others):下面是一个例子(我希望它可以帮助其他人):

def database_query_to_df(connection, stored_proc, start_date, end_date):
    # Define a query
    query ="SET NOCOUNT ON; EXEC " + stored_proc + " ?, ? " + "; SET NOCOUNT OFF"

    # Pass the parameters to the query, execute it, and store the results in a data frame
    df = pd.read_sql(query, connection, params=(start_date, end_date))
    return df

Try This:试试这个:

import pandas as pd
import pymysql

mysql_connection = pymysql.connect(host='localhost', user='root', password='', db='test', charset='utf8')
                    
sql = "SELECT * FROM `brands`"
df = pd.read_sql(sql, mysql_connection, index_col='brand_id')
print(df)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM