Residual plot for residual vs predicted value in Python

Question

I have run a KNN model. Now i want to plot the residual vs predicted value plot. Every example from different websites shows that i have to first run a linear regression model. But i couldn't understand how to do this. Can anyone help? Thanks in advance. Here is my model-

train, validate, test = np.split(df.sample(frac=1), [int(.6*len(df)), int(.8*len(df))])
x_train = train.iloc[:,[2,5]].values
y_train = train.iloc[:,4].values
x_validate = validate.iloc[:,[2,5]].values
y_validate = validate.iloc[:,4].values
x_test = test.iloc[:,[2,5]].values
y_test = test.iloc[:,4].values
clf=neighbors.KNeighborsRegressor(n_neighbors = 6)
clf.fit(x_train, y_train)
y_pred = clf.predict(x_validate)

Answer 1

Residuals are nothing but how much your predicted values differ from actual values. So, it's calculated as actual values-predicted values. In your case, it's residuals = y_test-y_pred . Now for the plot, just use this;

import matplotlib.pyplot as plt

plt.scatter(residuals,y_pred)

plt.show()

Answer 2

What is the question? The residuals are simply y_test-y_pred . Now use seaborn's regplot.

Residual plot for residual vs predicted value in Python

Question

2 answers

solution1
3 ACCPTED 2020-07-01 19:00:43

solution2
1 2020-07-01 16:47:57

Residual plot for residual vs predicted value in Python

Question

2 answers

solution1 3 ACCPTED 2020-07-01 19:00:43

solution2 1 2020-07-01 16:47:57

solution1
3 ACCPTED 2020-07-01 19:00:43

solution2
1 2020-07-01 16:47:57