Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

Speeding up Relevance Vector Mahine

edwinanto2003edwinanto2003 Member Posts: 1 Learner III
edited November 2018 in Help
Hello,
I am relatively new to Rapidminer and am learning through video tutorials. I am trying to perform a text classification on the movie-review dataset which has about 1000 positive and negative reviews.
I wanted to know if there is a way I can speed up the process. I tried running the algorithm on the whole dataset and it keeps running out of memory. I referred to earlier problems and increased the MAX_JAVA_MEMORY in
the script files to about 4 gb and and tried running a subset of 100 files. The process has been running for 2 days now. Please let me know if there is a way I can speed up the process or if it would be even possible to
classify on the whole dataset (2000 files - positive and negative reviews).
Any help would be greatly appreciated. Thank you very much .

Answers

  • MariusHelfMariusHelf RapidMiner Certified Expert, Member Posts: 1,869 Unicorn
    Hi,

    did you try an SVM with linear kernel instead? For an SVM it should be no problem to handle 2000 examples. Just keep in mind that for good results with the SVM you have to optimize the C parameter. A good range you should try is 1e-6 to 1 or 10 on a logarithmic scale.


    Best regards,
    Marius
Sign In or Register to comment.