[英]How to split dataset into (X_train, y_train), (X_test, y_test)?
[英]how can I overcome on this problem X_train, X_test, Y_train, Y_test = train_test_split(X, Y, test_size=0.25, stratify=Y, random_state=2)
X_train, X_test, Y_train, Y_test = train_test_split(X, Y, test_size=0.25, stratify=Y, random_state=2)
ValueError: The least populated class in y has only 1 member, which is too few.
The minimum number of groups for any class cannot be less than 2.
當 Y 按值排序時,可能會發生這種情況。
嘗試:
np.random.seed(42)
np.random.shuffle(X)
np.random.seed(42)
np.random.shuffle(Y)
X_train, X_test, Y_train, Y_test = train_test_split(X, Y, test_size=0.25, stratify=Y, random_state=2)
隨機種子可以是任何東西,但 X 和 Y 必須相同。
另一個可能的問題可能是 Y 中只有一個類別。
嘗試運行:
print(np.unique(Y))
看看有多少類。
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.