简体   繁体   English

将 pandas DataFrame 更改为 numpy 数组但保留列名

[英]Change pandas DataFrame to numpy array but keeping column names

I have a pandas DataFrame from the sklearn.datasets Boston house price data and am trying to convert this to a numpy array but keeping column names.我有一个来自 sklearn.datasets 波士顿房价数据的 pandas DataFrame 并试图将其转换为 numpy 数组,但保留列名。 Here is the code I tried:这是我尝试过的代码:

from sklearn import datasets ## imports datasets from scikit-learn
import numpy as np
import pandas as pd

data = datasets.load_boston() ## loads Boston dataset from datasets library

df = pd.DataFrame(data.data, columns=data.feature_names)
X = df.to_numpy()
print(X.dtype.names)

However this returns None and therefore column names are not kept.但是,这会返回None ,因此不会保留列名。 Does anyone understand why?有谁明白为什么?

Thanks谢谢

try this:尝试这个:

w = (data.feature_names).reshape(13,1)
X = np.vstack((w.T, data.data))
print (X)

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM