Design a model to do data cleaning
Best Answers
-
lionelderkrikor Moderator, RapidMiner Certified Analyst, Member Posts: 1,195
Unicorn
Hi @JoeJoe,
Have you access to Turbo Prep inside RapidMiner ?
If Yes, you can go to CLEANSE --> AUTO CLEANSING..
Hope this helps,
Regards,
Lionel2 -
IngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751
RM Founder
Hi,Probably none of both settings would be best. However, for association rules you would need binary input data so you should first clean the data (without those two settings) and then discretize all numerical into binary bins. Finally, you may need to perform one-hot encoding for nominals with more than two values. Cut-off points for discretization or which value is positive vs. negative will depend on your biz problem you want to solve.Best,Ingo7
Answers