Pandas to_sql replace duplicates

Question

I am making request to API every second with parameter since(to return changes since last request) I convert it to dataframe and would like quickly insert it into MySQL with replacement of duplicate rows something like this:

REPLACE INTO table (column1,column2...) VALUES (val1,val2...)

I really like function DataFrame.to_sql but the problem is that it does not have replace duplicate rows option. The way I can see with DataFrame.to_sql is to drop table each time and recreate it with option if_exists: replace, but I think it will influence performance significantly. Can you advise what is the better way to insert data from dataframe with replacement of duplicate values?

Answer 1

如果您的 DF 不是那么大，您可以遍历它，生成INSERT ... ON DUPLICATE KEY UPDATE SQL 并在您的 MySQL 数据库中执行它们。

Answer 2

It seems there is no way to replace duplicates with DataFrame.to_sql in pandas. Hopefully they will integrate this function in future. I managed to find a post on how to ignore duplicates, but in my case I just decided to choose another approach and as @MaxU mentioned iterate through Dataframe and execute

REPLACE INTO table (column1,column2...) VALUES (val1,val2...)

Pandas to_sql replace duplicates

Question

2 answers

solution1
1 2016-05-04 14:08:35

solution2
0 2016-05-07 15:15:46

Pandas to_sql replace duplicates

Question

2 answers

solution1 1 2016-05-04 14:08:35

solution2 0 2016-05-07 15:15:46

solution1
1 2016-05-04 14:08:35

solution2
0 2016-05-07 15:15:46