"declared id attributes used in clustering?"

pep · May 2011

Hi, can anyone help with the following questions pls?
Is a (numeric) attribute whose role is declared to be id, used by default by the software in building clusters (by the attribute participating in the computation of distances. etc)? What about building a supervised learning model as a:
- decision tree - does the implemented algorithm compute by default the gain ratio for an id attribute?
- naive bayes classifiers - does the algorithm compute conditional probabilities (and implicitly sample means and standard deviations) in the case of the declared id attribute?

cheers

IngoRM · May 2011

Hi,

whose role is declared to be id, used by default by the software in building clusters (by the attribute participating in the computation of distances. etc)?

no, in general, attributes with the role "id" are only used for identification purposes like in the plotters but never for data mining schemes. For modeling, usually only the regular attributes (i.e. no specific role), the label, and sometimes the weight are used.

What about building a supervised learning model as a:

Same here as for clustering.

- decision tree - does the implemented algorithm compute by default the gain ratio for an id attribute?

No, this will not happen.

- naive bayes classifiers - does the algorithm compute conditional probabilities (and implicitly sample means and standard deviations) in the case of the declared id attribute?

Dito.

Cheers,
Ingo

Howdy, Stranger!

Quick Links

Categories

Altair RapidMiner Community

GET HELP. LEARN BEST PRACTICES. NETWORK WITH YOUR PEERS.

"declared id attributes used in clustering?"

Answers