Options

# Can we encode categorical data to numerical and then find the correlation in Rapidminer

Can we encode categorical data to numerical and then find the correlation in Rapidminer? if so please let me know the process

Tagged:

0

## Answers

344Unicorn1,635UnicornFor example, if the data is actually nominal in nature, meaning it is not inherently ordered (think of things like colors or names) then a simple numerical replacement (where each nominal category is given a successive integer value) is actually very misleading. That type of numerical conversion is only appropriate when the nominal categories correspond to some kind of ordered scale (similar to a Likert scale). For other nominal data, you would want to do dummy coding conversion, which takes each nominal value and turns it into a zero/one variable (called a dummy code) and then you can run a correlation analysis on those attributes.

Lindon Ventures

Data Science Consulting from Certified RapidMiner Experts

1,751RM FounderThis is BTW what the correlation matrix in RapidMiner's Auto Model is doing. You can open the process and see how it is done on your data #noblackboxes

Best,

Ingo