The RapidMiner community is on read-only mode until further notice. Technical support via cases will continue to work as is. For any urgent licensing related requests from Students/Faculty members, please use the Altair academic forum here.

[Solved] Data splitting for SVM parameters selection vs Neural Networks

njasajnjasaj Member Posts: 18 Contributor II
edited June 2019 in Help
Hi,
I have a question about SVM parameter selection and data spliting. I need to  know that is it sufficient and efficient to split the whole data set into just 2 part (train set and test set) then use cross validation on train set and select the C and gamma which to lead the best performance. Some people split data to 3 sets( train, cross-validation, test) for neural networks and select the parameters when the performance of cross-validation start to reduce and train performance increase.Is split the data set into just 2 parts and cross validate on train set good enough and acceptable for  modeling with support vector machine or the same procedure(split into 3 set) should be done for SVM? Is splitting into 2 part procedure applicable to Neural Network?

Thanks.
Tagged:

Answers

  • MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn
    Hi,

    since the cross validation already performs the split into training and test set it should be sufficient to use *only* the cross validation and not split the data manual at all. If you are unsure, just increase the number of folds in the X-Validation.

    Best regards,
    Marius
  • njasajnjasaj Member Posts: 18 Contributor II
    Thank you Marius.
Sign In or Register to comment.