[英]PyMongo & Pandas - Update multiple records in MongoDB collection from dataframe by matching id
是否可以通過匹配 id 從 dataframe 更新 MongoDB 集合中的多個記錄?
dataframe
_id text sentiment
0 5ec299fa905e038dec3c8e93 Kederi Yusof· basikal salah Najib.Tayar pa... 1
1 5ec49452bfcd4786382fe21f Serindik.com·2 laaa mimpi UMNO.... XPM7 pul... 0
2 5ec40e8d28fb32986041df16 Newpaper24·4m1MDB: Najib Razak’s court, accuse... -1
3 5ec44c0b255995f0522fe1ec falseprophet· low pesuruh najib. Budak des... 1
4 5ed2ab347d23a5d56d59a730 Kamaluddin 阿列克斯 தீன்·11m-anak-najib-dalam-sena... 0
代碼
updates = []
for document in db.twitter.find():
for index, row in document.iterrows():
if(row['_id']==a['_id']):
updates.append(UpdateOne({'_id': row['_id']}, {'$set': {'sentiment': row['sentiment']}}, upsert=True))
break
db.twitter.bulk_write(updates)
但我得到了AttributeError: 'dict' object has no attribute 'iterrows'
錯誤消息
您不需要查找循環,只需使用iterrows()
獲取數據並使用UpdateOne
執行 upsert。
from pymongo import MongoClient, UpdateOne
import pandas as pd
db = MongoClient()['mydatabase']
data = [['ec299fa905e038dec3c8e93', 'Kederi Yusof· basikal salah Najib.Tayar pa...', 1],
['ec49452bfcd4786382fe21f', 'Serindik.com·2 laaa mimpi UMNO.... XPM7 pul...', 0]]
df = pd.DataFrame(data, columns=['_id', 'text', 'sentiment'])
updates = []
for _, row in df.iterrows():
updates.append(UpdateOne({'_id': row.get('_id')}, {'$set': {'sentiment': row.get('sentiment')}}, upsert=True))
db.twitter.bulk_write(updates)
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.