[英]MemoryError: Unable to allocate 43.5 GiB for an array with shape (5844379795,) and data type int64
I have a large dataframe and I am trying to update one column:我有一个大的 dataframe 并且我正在尝试更新一列:
Dataframe: Dataframe:
I would like to update the last column IsFraudsterStatus .我想更新最后一列IsFraudsterStatus 。
My Code:我的代码:
df= pd.concat(chunk_list,)
def expand_fraud(no_fraud, fraud, col_name):
t = pd.merge(no_fraud, fraud, on=col_name)
if len(t):
df.loc[df.ID.isin(t.ID_x), "IsFraudsterStatus"] = 1
return True
return False
while True:
added_fraud = False
fraud = df[df.IsFraudsterStatus == 1]
no_fraud = df[df.IsFraudsterStatus == 0]
added_fraud |= expand_fraud(no_fraud, fraud, "DeviceId")
added_fraud |= expand_fraud(no_fraud, fraud, "Email")
added_fraud |= expand_fraud(no_fraud, fraud, "MobileNo")
if not added_fraud:
break
Error:错误:
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.