🎉 🎉   RAPIDMINER 9.5 BETA IS OUT!!!   🎉 🎉
GRAB THE HOTTEST NEW BETA OF RAPIDMINER STUDIO, SERVER, AND RADOOP. LET US KNOW WHAT YOU THINK!
🦉 🎤   RapidMiner Wisdom 2020 - CALL FOR SPEAKERS   🦉 🎤
We are inviting all community members to submit proposals to speak at Wisdom 2020 in Boston.
Whether it's a cool RapidMiner trick or a use case implementation, we want to see what you have.
Form link is below and deadline for submissions is November 15. See you in Boston!
Difference between result of Rapid miner and Excel removing duplicates function
I am new in using RM.
I need to remove duplicates from my dataset within preprocessing step.
I have 7621 examples as original set.
I used "remove duplicates' function of excel and got 6830 rows ( examples) as a result.
Since, I` m runing the project in RM , I need to clean my data via its operator. Thus, I used "Remove Duplicates operator" , I have choosen "Project name" attribute and run process. As an outcome I got 6854 examples.
My question is why do I have difference between the resulting examples ( 6854 via RM & 6830 via Excel).
I attached my process to this message and asking support for dealing with this problem, please.
Thank you in advance.