Y axis is feature names. Each line on scatter plot represents a feature.
X axis is SHAP values. There are equal number of points in every line: ie number of data points in your data set. Each point on line depicts SHAP value produced by this particular point. Clustering of values mean these feature [values] tend to produce similar SHAP values (due to insensitivity of output or low dispersion of feature values themselves).
The coloring of points are feature values in original units.
So putting it all together, one might state for the bottom row:
Note, any time I say "insensitive" I mean "average marginal contribution is low over all possible coalitions".
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.