[英]Random seed on SVM sklearn produces different results
當我運行SVM時,即使使用固定的random_state=42
,我也會得到不同的結果。
我有10個類別和200個示例的數據集。 我的數據集的尺寸dim_dataset=(200,2048)
這是我的代碼:
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.svm import LinearSVC
from sklearn import svm
import random
random.seed(42)
def shuffle_data(x,y):
idx = np.random.permutation(len(x))
x_data= x[idx]
y_labels=y[idx]
return x_data,y_labels
d,l=shuffle_data(dataset,true_labels) # dim_d=(200,2048) , dim_l=(200,)
X_train, X_test, y_train, y_test = train_test_split(d, l, test_size=0.30, random_state=42)
# hist intersection kernel
gramMatrix = histogramIntersection(X_train, X_train)
clf_gram = svm.SVC(kernel='precomputed', random_state=42).fit(gramMatrix, y_train)
predictMatrix = histogramIntersection(X_test, X_train)
SVMResults = clf_gram.predict(predictMatrix)
correct = sum(1.0 * (SVMResults == y_test))
accuracy = correct / len(y_test)
print("SVM (Histogram Intersection): " + str(accuracy) + " (" + str(int(correct)) + "/" + str(len(y_test)) + ")")
# libsvm linear kernel
clf_linear_kernel = svm.SVC(kernel='linear', random_state=42).fit(X_train, y_train)
predicted_linear = clf_linear_kernel.predict(X_test)
correct_linear_libsvm = sum(1.0 * (predicted_linear == y_test))
accuracy_linear_libsvm = correct_linear_libsvm / len(y_test)
print("SVM (linear kernel libsvm): " + str(accuracy_linear_libsvm) + " (" + str(int(correct_linear_libsvm)) + "/" + str(len(y_test)) + ")")
# liblinear linear kernel
clf_linear_kernel_liblinear = LinearSVC(random_state=42).fit(X_train, y_train)
predicted_linear_liblinear = clf_linear_kernel_liblinear.predict(X_test)
correct_linear_liblinear = sum(1.0 * (predicted_linear_liblinear == y_test))
accuracy_linear_liblinear = correct_linear_liblinear / len(y_test)
print("SVM (linear kernel liblinear): " + str(accuracy_linear_liblinear) + " (" + str(
int(correct_linear_liblinear)) + "/" + str(len(y_test)) + ")")
我的代碼有什么問題?
使用numpy.random.seed()
代替簡單的random.seed
如下所示:
np.random.seed(42)
Scikit內部使用numpy生成隨機數,因此僅執行random.seed不會影響numpy仍然是隨機的行為。
請查看以下鏈接以獲得更好的理解:
聲明:本站的技術帖子網頁,遵循CC BY-SA 4.0協議,如果您需要轉載,請注明本站網址或者原文地址。任何問題請咨詢:yoyou2525@163.com.