[英]Change pandas DataFrame to numpy array but keeping column names
I have a pandas DataFrame from the sklearn.datasets Boston house price data and am trying to convert this to a numpy array but keeping column names.我有一个来自 sklearn.datasets 波士顿房价数据的 pandas DataFrame 并试图将其转换为 numpy 数组,但保留列名。 Here is the code I tried:
这是我尝试过的代码:
from sklearn import datasets ## imports datasets from scikit-learn
import numpy as np
import pandas as pd
data = datasets.load_boston() ## loads Boston dataset from datasets library
df = pd.DataFrame(data.data, columns=data.feature_names)
X = df.to_numpy()
print(X.dtype.names)
However this returns None
and therefore column names are not kept.但是,这会返回
None
,因此不会保留列名。 Does anyone understand why?有谁明白为什么?
Thanks谢谢
try this:尝试这个:
w = (data.feature_names).reshape(13,1)
X = np.vstack((w.T, data.data))
print (X)
声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.