简体   繁体   English

用于时间序列数据预测的线性回归模型选择

[英]Linear regression model selection for time series data prediction

I have a signal and want to predict y which present Number of requests, using regression models. 我有一个信号,想使用回归模型预测y表示的请求数。 Currently, I am using OLS regression model to predict y. 目前,我正在使用OLS回归模型来预测y。 But the prediction error is very high, as my signal has a lot of variations (ups and downs) as shown below. 但是预测误差非常高,因为我的信号有很多变化(上下波动),如下所示。

I noticed that my model most of the time overestimate y (Number of Requests), especially if the points to be predicted is preceded by large value of y's. 我注意到我的模型在大多数情况下都高估了y(请求数),尤其是在要预测的点之前带有较大y值的情况下。 As indicated below in the yellow and red circle. 如下面黄色和红色圆圈所示。

在此处输入图片说明

So I am not sure if there's a robust regression models to accommodate this problem of having a lot of variations in my datasets. 因此,我不确定是否有健壮的回归模型来解决这个问题,因为我的数据集中有很多变化。 Also is there any way to segment out these large values by adapting the window size such that it doesn't include these values? 还可以通过调整窗口大小以使其不包括这些值来分割出这些大值吗?

Could you please advise 您能否提一些建议

From the visualization of the error I would say a linear model is not appropriate and you should consider using something that handles periodic data as well as moving average - your data appears to have periodic elements, and a moving average element that goes beyond something "linear". 从错误的可视化中,我会说线性模型是不合适的,您应该考虑使用处理周期数据以及移动平均值的东西-您的数据似乎具有周期元素,而移动平均值元素超出了“线性”范围”。 Consider something like ARIMA. 考虑类似ARIMA之类的东西。 Here's a link to a tutorial on ARIMA: https://machinelearningmastery.com/arima-for-time-series-forecasting-with-python/ Please post the results :) 这是ARIMA教程的链接: https : //machinelearningmastery.com/arima-for-time-series-forecasting-with-python/请发布结果:)

Vishaal Vishaal

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM