RapidMiner

RapidMiner

which stopping criterion in dicretize by entropy

pep
Contributor II

which stopping criterion in dicretize by entropy

Hi
The specification of the discretize by entropy operator says: "The discretization is performed by selecting a bin boundary minimizing the entropy in the induced partitions. The method is then applied recursively for both new partitions until the stopping criterion is reached."
Which of the stopping criteria is used in this implementation (the one based on threshold of entropy? maximum number of bins?), and how can its parameter be set please? 
Cheers.
1 REPLY
Elite

Re: which stopping criterion in dicretize by entropy

Hi,
actually I don't know myself, but the source code says, you should refer to

a) Multi-interval discretization of continued-values attributes for classification learning (Fayyad,Irani) and
b) Supervised and Unsupervised Discretization (Dougherty,Kohavi,Sahami).

Tell me if you know...

Greetings,
  Sebastian