RapidMiner

How to recode dummy coded variables to useful data?

Contributor

How to recode dummy coded variables to useful data?

Dear Data Mining and Rapid Miner Experts,

 

I have a dataset that contains both numerical and nominal data and I want to do linear regression with it. The thing is, I am not sure how to handle nominal data in this case.

 

I have tried to convert nominal data to numerical data but how can I recode the dummy coded variables to useful data so that I can use it in my Linear Regression operator (as in combine with the process that I have done with numerical data) to generate performance?

1 REPLY
Highlighted
RM Certified Expert

Re: How to recode dummy coded variables to useful data?

If you have more information on what the dummy values originally corresponded to, you can use the Map operator or Replace operator or Generate Attributes operator to repopulate the data with numerical values.  Of course, that only applies if the original nominal categories corresponded to ranges of a numeric variable, for instance.  If they are truly nominal in nature (e.g., unordered categories) then the dummy attribute coding for each category is the best form for them in a regression framework.

 

 

Brian T., Lindon Ventures - www.lindonventures.com
Analytics Consulting by Certified RapidMiner Analysts