SVC sigmoid kernel is not working properly #18955

marmor97 · 2020-12-02T09:36:47Z

Describe the bug

I am testing an SVM with a sigmoid kernel on the iris data. Its performance is extremely poor with an accuracy of 25 %. I'm using exactly the same code and normalizing the features as https://towardsdatascience.com/a-guide-to-svm-parameter-tuning-8bfe6b8a452c (sigmoid section) which should increase performance substantially. However, I am not able to reproduce his results and the accuracy only increases to 33 %.

Using other kernels (e.g linear kernel) produces good results (accuracy of 82 %).
Could there be an issue within the SVC(kernel = 'sigmoid') function?

Steps/Code to Reproduce

##sigmoid iris example
from sklearn import datasets 
iris = datasets.load_iris()
from sklearn.svm import SVC 

sepal_length = iris.data[:,0] 
sepal_width = iris.data[:,1]

#assessing performance of sigmoid SVM
clf = SVC(kernel='sigmoid') 
clf.fit(np.c_[sepal_length, sepal_width], iris.target) 
pr=clf.predict(np.c_[sepal_length, sepal_width])
pd.DataFrame(classification_report(iris.target, pr, output_dict=True))

from sklearn.metrics.pairwise import sigmoid_kernel 
sigmoid_kernel(np.c_[sepal_length, sepal_width]) 

#normalizing features
from sklearn.preprocessing import normalize 
sepal_length_norm = normalize(sepal_length.reshape(1, -1))[0] 
sepal_width_norm = normalize(sepal_width.reshape(1, -1))[0] 
clf.fit(np.c_[sepal_length_norm, sepal_width_norm], iris.target) 
sigmoid_kernel(np.c_[sepal_length_norm, sepal_width_norm]) 

#assessing perfomance of sigmoid SVM with normalized features
pr_norm=clf.predict(np.c_[sepal_length_norm, sepal_width_norm])
pd.DataFrame(classification_report(iris.target, pr_norm, output_dict=True))

Versions

System:
python: 3.8.6 (default, Oct 8 2020, 14:06:32) [Clang 12.0.0 (clang-1200.0.32.2)]
executable: /Users/Marie/Desktop/5 semester/Bachelor/Bachelor_vcode/hello/.venv/bin/python
machine: macOS-10.16-x86_64-i386-64bit

Python dependencies:
pip: 20.2.1
setuptools: 49.2.1
sklearn: 0.23.2
numpy: 1.19.2
scipy: 1.5.3
Cython: None
pandas: 1.1.3
matplotlib: 3.3.2
joblib: 0.17.0
threadpoolctl: 2.1.0

Built with OpenMP: True

jeremiedbb · 2020-12-22T13:26:35Z

The default value of the gamma parameter changed in version 0.22. The blog post you linked was written before 0.22 so gamma="auto" by default. You're on version 0.23.2 so ```gamma="scale"`` by default. Changing gamma to "auto" retreive the same scores as in the blog post. Besides if you want to tune the gamma parameter, you should do it by cross-validation, not just by looking for the best score on the training set.

cmarmo · 2021-01-18T15:15:04Z

Hi @marmor97 , it seems to me that your question was answered and you are happy with this answer... :)
I'm closing this issue. Feel free to reopen if you think something still need to be solved.

marmor97 added the Bug: triage Reported bugs that are not confirmed label Dec 2, 2020

marmor97 closed this as completed Dec 2, 2020

marmor97 reopened this Dec 2, 2020

cmarmo closed this as completed Jan 18, 2021

Feb	MAR	Apr
	26
2022	2023	2024

SVC sigmoid kernel is not working properly #18955

SVC sigmoid kernel is not working properly #18955

marmor97 commented Dec 2, 2020 •

edited

jeremiedbb commented Dec 22, 2020 •

edited

cmarmo commented Jan 18, 2021

SVC sigmoid kernel is not working properly #18955

SVC sigmoid kernel is not working properly #18955

Comments

marmor97 commented Dec 2, 2020 • edited

Describe the bug

Steps/Code to Reproduce

Versions

jeremiedbb commented Dec 22, 2020 • edited

cmarmo commented Jan 18, 2021

marmor97 commented Dec 2, 2020 •

edited

jeremiedbb commented Dec 22, 2020 •

edited