[英]Converting dtype=object to binary values
我有一個數據集,其中有一列包含兩個不同的文本(PAIDOFF、COLLECTION),我想將其轉換為二進制值,因此我嘗試了以下操作:
y = df['loan_status'].values
y[0:5]
輸出:
array(['PAIDOFF', 'PAIDOFF', 'PAIDOFF', 'PAIDOFF', 'PAIDOFF'],
dtype=object)
定義目標列后,嘗試將其轉換為二進制值:
#Convert y to binary values
le_loan_status=preprocessing.LabelEncoder()
le_loan_status.fit(['PAIDOFF','COLLECTION'])
y[:,0]= le_loan_status.transform(y[:,0])
輸出:
---------------------------------------------------------------------------
IndexError Traceback (most recent call last)
<ipython-input-10-917e44b54b88> in <module>
2 le_loan_status=preprocessing.LabelEncoder()
3 le_loan_status.fit(['PAIDOFF','COLLECTION'])
----> 4 y[:,0]= le_loan_status.transform(y[:,0])
IndexError: too many indices for array: array is 1-dimensional, but 2 were indexed
你有什么想法來解決這個問題嗎?
轉換為假人
dummies = pd.get_dummies(df["loan_status"],drop_first=True)
new_data = pd.concat([df,dummies],axis=1)
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.