I want to split my dataset to training and validation sets I don't know whether I should split the dataset before applying pca dimensionality reduction or after pca to avoid leakage of data.
Any help would be appreciated.
The dataset should split before applying PCA to avoid leakage of data
The technical post webpages of this site follow the CC BY-SA 4.0 protocol. If you need to reprint, please indicate the site URL or the original address.Any question please contact:yoyou2525@163.com.