# [SOLVED] Nominal To Number Operator

Hi All

Im trying to massage data in order to run a clustering algorithm on it

The dataset I have has many nominal attributes and I wish to convert them to numbers in order that the clustering algorithm works correctly

I have used the nominal to number operator but am having problems with the dummy values replacing the nominal values with numbers

What I would like is something like below where each number actually represents a value

I am unable to get this working at present. Can anyone help me out…It’s a bit of a show stopper at present

Old Value Converted Value

CH 1

IE 2

CH 1

DE 3

IL 4

Im trying to massage data in order to run a clustering algorithm on it

The dataset I have has many nominal attributes and I wish to convert them to numbers in order that the clustering algorithm works correctly

I have used the nominal to number operator but am having problems with the dummy values replacing the nominal values with numbers

What I would like is something like below where each number actually represents a value

I am unable to get this working at present. Can anyone help me out…It’s a bit of a show stopper at present

Old Value Converted Value

CH 1

IE 2

CH 1

DE 3

IL 4

0

## Answers

1,869UnicornBest regards,

Marius

28Contributor IIThank you for the quick response

This was originally what i was doing. I had a read Database operator which linked to a select attributes operator which selected a column that had only 3 available nominal values.

I then connected up the Nominal To Numeric operator.

What i was expecting was

valueConverted ValueValue_1 1

Value_2 2

Value_2 2

What i got instead was

Value_1 Value_2

row 1 1 0

row 2 0 1

row 3 0 1

This looks like more of what i would expect from a nominal to binomial operator

Again thanks for your time

It is a very frustrating problem

458UnicornI reckon the coding type should be "unique integers" as in the following regards

Andrew

1,869UnicornThe dummy coding overcomes this and is the method of choice if you want to apply clustering, linear regression or any other algorithm that depends on only numerical values.

Best regards,

Marius

458Unicornregards

Andrew

28Contributor IIWhat you say makes a lot of sense and it appears i was heading down this road which would have for sure given me a poor output from the model

Thank you very much for your time