Pandas to_sql Trying to Index Nullable Column

Question

I want to set up a job that dumps data into a SQL table every day, overwriting the existing data.

df.to_sql(table_name, engine, schema='dbo', 
          index=True, index_label='IdColumn', 
          if_exists='replace')

However behind the scenes SQLAlchemy is trying to create the table with IdColumn VARCHAR(max), and being nullable. So SQL throws an error when it tries to create the index.

It's pretty trivial to truncate the table before I write the data to it, but I feel like there should be a more elegant solution to this problem.

Answer 1

If you want the write the index to the sql table as a normal column, you can do a reset_index before the to_sql call:

df.reset_index().to_sql(table_name, engine, schema='dbo', index=False, if_exists='replace')

The only problem is the name of that column, if you want a custom one you first have to set the index name ( df.index.name = 'IdColumn' ) or rename after the reset_index.

Answer 2

Consider using the dtype argument which takes a dictionary mapping data frame column names to specified sqlalchemy data types . You can try Varchar :

import sqlalchemy

df.to_sql(table_name, engine, schema='dbo', 
          index=True, index_label='IdColumn', 
          if_exists='replace',
          dtype={'IdColumn': sqlalchemy.types.VARCHAR(length=255)})

or generic String type, specifying a length:

from sqlalchemy.types import String

df.to_sql(table_name, engine, schema='dbo', 
          index=True, index_label='IdColumn', 
          if_exists='replace',
          dtype={'IdColumn': String(length=255)})

Pandas to_sql Trying to Index Nullable Column

Question

2 answers

solution1
1 2015-12-30 09:39:09

solution2
0 2015-12-30 04:12:14

Pandas to_sql Trying to Index Nullable Column

Question

2 answers

solution1 1 2015-12-30 09:39:09

solution2 0 2015-12-30 04:12:14

solution1
1 2015-12-30 09:39:09

solution2
0 2015-12-30 04:12:14