sklearn PLSRegression - Variance of X explained by latent vectors

Question

I performed a partial least squares regression using Python's sklearn.cross_decomposition.PLSRegression

Is there a way to retrieve the fraction of explained variance for X, ie R ² (X) , for each PLS component? I'm looking for something similar to the explvar() function from the R pls package. However, I'd also appreciate any suggestions on how to compute it myself.

There is a similar question and there is one answer that explains how to get the variance of Y. I guess, that "variance in Y" is what was asked for in that case. That's why I opened a new question - hope that's OK

Answer 1

I managed to find a solution for the problem. The following gives the fraction of variance in X explained by each latent vector after PLS regression:

import numpy as np
from sklearn import cross_decomposition

# X is a numpy ndarray with samples in rows and predictor variables in columns
# y is one-dimensional ndarray containing the response variable

total_variance_in_x = np.var(X, axis = 0)

pls1 = cross_decomposition.PLSRegression(n_components = 5)
pls1.fit(X, y) 

# variance in transformed X data for each latent vector:
variance_in_x = np.var(pls1.x_scores_, axis = 0) 

# normalize variance by total variance:
fractions_of_explained_variance = variance_in_x / total_variance_in_x

sklearn PLSRegression - Variance of X explained by latent vectors

Question

1 answers

solution1
4 2017-09-27 13:32:31

sklearn PLSRegression - Variance of X explained by latent vectors

Question

1 answers

solution1 4 2017-09-27 13:32:31

solution1
4 2017-09-27 13:32:31