Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
How to select the C parameter for linear SVMs?
Hi,
I want to use the GridOptimization operator for parameter optimization but in what range should I search the C value for a SVM with linear kernel. Does the C depend on the count of the examples? I sometimes see values between 0 and 30000.
Thanks
I want to use the GridOptimization operator for parameter optimization but in what range should I search the C value for a SVM with linear kernel. Does the C depend on the count of the examples? I sometimes see values between 0 and 30000.
Thanks
0
Answers
no general answer here but trying, sorry. It totally depends on your data. You could have a look into my Gecco 2007 paper to see how much C sometimes might vary for different data sets. as a rule of thumb, you could try (without parameter optimization first) a low value like and 0.0001 and higher values like 1000, 10000, and even 100000. Check if the runtime is ok for you and have a look at the predictive power. Then start with a rough grid parameter optimization to find a good range and start (at least) a second one to further optimize it.
Cheers,
Ingo
no, there are no differences in principle. The LibSVM, however, is already able to work on more than 2 classes in a classification setting. For the JMySVM, you have to wrap a Polynominal2Binominal meta learner around the SVM. Another thing: the kernel parameter sigma (JMySVM) corresponds to the parameter gamma (LibSVM). Please note that sigma is 1 / gamma.
Cheers,
Ingo