简体   繁体   English

我可以使用什么类型的模型来训练这些数据

[英]what type of model can i use to train this data

I have downloaded and labeled data from http://archive.ics.uci.edu/ml/datasets/pamap2+physical+activity+monitoring我已经从http://archive.ics.uci.edu/ml/datasets/pamap2+physical+activity+monitoring下载并标记了数据

my task is to gain an insight into the data from what is given, I have round 34 attributes in a data frame(all clean no nan values)我的任务是从给定的内容中深入了解数据,我在数据框中有大约 34 个属性(全部干净,没有 nan 值)

and want to train a model based on one target attribute 'heart_rate' given the rest of the attributes(all are numbers of a participant performing various activities )并希望根据一个目标属性“heart_rate”训练一个模型,给定其余属性(都是执行各种活动的参与者的数量)

I wanted to use Linear regression model but can not use my dataframe for some reason, however, I do not mind starting from 0 if you think I am doing it wrong我想使用线性回归模型但由于某种原因无法使用我的数据框,但是,如果您认为我做错了,我不介意从 0 开始

my DataFrame columns:我的 DataFrame 列:

> Index(['timestamp', 'activity_ID', 'heart_rate', 'IMU_hand_temp',
>        'hand_acceleration_16_1', 'hand_acceleration_16_2',
>        'hand_acceleration_16_3', 'hand_gyroscope_rad_7',
>        'hand_gyroscope_rad_8', 'hand_gyroscope_rad_9',
>        'hand_magnetometer_μT_10', 'hand_magnetometer_μT_11',
>        'hand_magnetometer_μT_12', 'IMU_chest_temp', 'chest_acceleration_16_1',
>        'chest_acceleration_16_2', 'chest_acceleration_16_3',
>        'chest_gyroscope_rad_7', 'chest_gyroscope_rad_8',
>        'chest_gyroscope_rad_9', 'chest_magnetometer_μT_10',
>        'chest_magnetometer_μT_11', 'chest_magnetometer_μT_12',
>        'IMU_ankle_temp', 'ankle_acceleration_16_1', 'ankle_acceleration_16_2',
>        'ankle_acceleration_16_3', 'ankle_gyroscope_rad_7',
>        'ankle_gyroscope_rad_8', 'ankle_gyroscope_rad_9',
>        'ankle_magnetometer_μT_10', 'ankle_magnetometer_μT_11',
>        'ankle_magnetometer_μT_12', 'Intensity'],
>       dtype='object')

first 5 rows:前 5 行:

timestamp   activity_ID heart_rate  IMU_hand_temp   hand_acceleration_16_1  hand_acceleration_16_2  hand_acceleration_16_3  hand_gyroscope_rad_7    hand_gyroscope_rad_8    hand_gyroscope_rad_9    ... ankle_acceleration_16_1 ankle_acceleration_16_2 ankle_acceleration_16_3 ankle_gyroscope_rad_7   ankle_gyroscope_rad_8   ankle_gyroscope_rad_9   ankle_magnetometer_μT_10    ankle_magnetometer_μT_11    ankle_magnetometer_μT_12    Intensity
2928    37.66   lying   100.0   30.375  2.21530 8.27915 5.58753 -0.004750   0.037579    -0.011145   ... 9.73855 -1.84761    0.095156    0.002908    -0.027714   0.001752    -61.1081    -36.8636    -58.3696    low
2929    37.67   lying   100.0   30.375  2.29196 7.67288 5.74467 -0.171710   0.025479    -0.009538   ... 9.69762 -1.88438    -0.020804   0.020882    0.000945    0.006007    -60.8916    -36.3197    -58.3656    low
2930    37.68   lying   100.0   30.375  2.29090 7.14240 5.82342 -0.238241   0.011214    0.000831    ... 9.69633 -1.92203    -0.059173   -0.035392   -0.052422   -0.004882   -60.3407    -35.7842    -58.6119    low
2931    37.69   lying   100.0   30.375  2.21800 7.14365 5.89930 -0.192912   0.019053    0.013374    ... 9.66370 -1.84714    0.094385    -0.032514   -0.018844   0.026950    -60.7646    -37.1028    -57.8799    low
2932    37.70   lying   100.0   30.375  2.30106 7.25857 6.09259 -0.069961   -0.018328   0.004582    ... 9.77578 -1.88582    0.095775    0.001351    -0.048878   -0.006328   -60.2040    -37.1225    -57.8847    low

if you check the timestamp attribute you will see that the data acquired is in milliseconds so it might be a good idea to use the data from this dataframe as in every 2-5 seconds and train the model如果您检查时间戳属性,您将看到获取的数据以毫秒为单位,因此每隔 2-5 秒使用此数据帧中的数据并训练模型可能是个好主意

also as an option, I want to use as one of these models for this task Linear,polynomial, multiple linear, agglomerative clustering and kmeans clustering.也作为一种选择,我想将线性、多项式、多元线性、凝聚聚类和 kmeans 聚类用作这些模型之一。

my code:我的代码:

target = subject1.DataFrame(data.target, columns=["heart_rate"])
X = df
y = target[“heart_rate”]
lm = linear_model.LinearRegression()
model = lm.fit(X,y)
predictions = lm.predict(X)
print(predictions)[0:5]

Error:错误:

AttributeError                            Traceback (most recent call last)
<ipython-input-93-b0c3faad3a98> in <module>()
      3 #heart_rate
      4 # Put the target (housing value -- MEDV) in another DataFrame
----> 5 target = subject1.DataFrame(data.target, columns=["heart_rate"])

c:\python36\lib\site-packages\pandas\core\generic.py in __getattr__(self, name)
   5177             if self._info_axis._can_hold_identifiers_and_holds_name(name):
   5178                 return self[name]
-> 5179             return object.__getattribute__(self, name)
   5180 
   5181     def __setattr__(self, name, value):

AttributeError: 'DataFrame' object has no attribute 'DataFrame'

for fixing the error I have used:用于修复我使用过的错误:

subject1.columns = subject1.columns.str.strip()

but still did not work但仍然没有工作

Thank you, sorry if I was not precise enough.谢谢,对不起,如果我不够准确。

Try this:尝试这个:

X = df.drop("heart_rate", axis=1)
y = df[[“heart_rate”]]
X=X.apply(zscore)
test_size=0.30
seed=7
X_train, X_test, y_train, y_test=train_test_split(X, y, test_size=test_size, random_state=seed)
lm = linear_model.LinearRegression()
model = lm.fit(X,y)
predictions = lm.predict(X)
print(predictions)[0:5]

暂无
暂无

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

相关问题 逻辑回归-不能使用分类变量来训练我的 model - Logical Regression-Can't use categorical variables to train my model 如何训练具有张量流的简单非线性回归模型? - How can I train a simple, non-linear regression model with tensor flow? 我应该将唯一的数据集拆分为训练和测试,还是可以将整个数据集用于回归问题? - Should I split the only dataset to a train and test or I can use whole of it for regression problem? 如果分类列在训练集中具有多个值,但在测试数据中只有一个值怎么办? 这样的功能对模型训练是否有用? - What if a categorical column has multiple values in the train set but only one in test data? Would such a feature be useful in model training at all? 我可以在广义加法 Model (GAM) 中将泊松分布用作连续非负数据的族吗? - Can I use poisson distribution as family in Generalized Additive Model (GAM) for continuous, non-negative data? 给定时间序列数据的最佳回归模型训练方法 - best way to train a regression model given time series data 我如何制作此数据以使用SVM开发模型 - How do i make this data to use SVM for developing a model 是否有任何机器学习回归算法可以训练有序数据? - Are there any machine learning regression algorithms that can train on ordinal data? 如何对这些数据执行简单的线性回归模型? - How can I peform a simple linear regression model on this data? 访问回归训练模型 - Get access to regression train model
 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM