Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Creating a new clustering algorithm with java and rapidminer
Hi, I'm new to RapidMiner. I have a project that consists in modeling a new clustering algorithm using the Java programming language. For example, my model starts by applying K-means in the first time and then add other techniques. In this case, I don't know what I should do. I researched on the net and I don't understand: do I must integrate RapidMiner in my java code then use the algorithms implemented in rapidminer even be able to see the representation as in RapidMiner (in my case clustering the data) to compare my algorithm with existing algorithms, or I must to model this new model in RapidMiner then add it to my java application.
I have integrated RapidMiner in Eclipse like is described in : http://rapid-i.com/content/view/25/48/lang,en/.
I didn't understand the utility of this, can I use for example the K-means algorithm from rapidMiner and recuperate the results in my java code ?
Things are mixed in my head. Can you help me to know how to start.
best regards.
I have integrated RapidMiner in Eclipse like is described in : http://rapid-i.com/content/view/25/48/lang,en/.
I didn't understand the utility of this, can I use for example the K-means algorithm from rapidMiner and recuperate the results in my java code ?
Things are mixed in my head. Can you help me to know how to start.
best regards.
Tagged:
0
Answers
If I could reply I would say that I had the same problem when I started. Fortunatelly, whole rapidminer is written in java, so you can integrate it to your application by using its source code. The easiest way is to create the process in rapidminer, run it in your application and let the results print to the console. Everything you do in rapidminer GUI you can do in eclipse.
http://rapid-i.com/wiki/index.php?title=Integrating_RapidMiner_into_your_application this page may help with basics understanding.
Did it help you a bit?
Thank you for the answer. I integrated RapidMiner in my Eclipse. My problem is that I could not find the starting point. I must create a new clustering algorithm. At first, I want to apply k-means directly into RapidMiner (since I can now start RapidMiner from eclipse) and then retrieve the results (clusters and their contents) in my java code to continue and apply other technical programming, but I don't know if it is feasible and how to do it? Since I am also beginner in java. I documented but I found nothing; there is a lack of documentation in RapidMiner.
Thank you
I would suggest what I always suggest
Create the process(es) you need in RapidMiner GUI, and then just execute the process on your data via java and then continue to work with the results. To see how this is done, see for example here.
If you want to create your own operator, have a look at the existing operators (check OperatorsCore.xml file to see the classes behind the the RM GUI operators) and go from there.
Regards,
Marco
do you use an IDE? Please consider using for example Eclipse and then press Ctrl+Shift+o while in your java class. That will handle the imports. You are missing required imports in your class.
Regards,
Marco
I use Eclipse. No imports is introduced, it does not ask for missing imports. It shows me three errors:
1) He does not know command_line "command_line cannot be resolved or is not a field" in this instruction: when I saw the possible methods proposed by Eclipse, it isn't COMMAND_LINE ? what it means here ?
2) Can not instantiate the Process type from this statement: 3) The method run (IOContainer) is undefined for the Process type in this statement: .
when I saw the possible methods for myProcess I didn't find the run () method.
I could not solve these three problems.
Best Regards
I solved the first problem by this statement: But the two last problems, not yet.
Thanks
as stated in the Development FAQ, integrating RM is not recommended if you're a java beginner.
The errors show your imports are messed up.
Regards,
Marco
Thank you , I have corrected the errors. But when I run the program, it shows me this error message: Can you help me ?
please read the available documentation, especially the manual and the How to extend RapidMiner whitepaper, before proceeding. You are try to access a repository which does not exist.
Regards,
Marco