Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Discretize by Density
michaelhecht
Member Posts: 89 Maven
In the Bayes software Genie there is a discretisation method by giving the number of bins and getting the clusters around most dense areas of an attribute. If you have e.g. two or three separable gauss distributions in your attribute and define three bins the clusters are hierarchical i.e. density based placed around each gauss set.
It would be nice to have this also in RapidMiner.
It seems, that entropy based discretisation is comparable but the number of bins cannot be preselected.
It would be nice to have this also in RapidMiner.
It seems, that entropy based discretisation is comparable but the number of bins cannot be preselected.
0
Answers
using a hierarchical clustering on a data set containing only the attribute to be discretized should yield the desired result. Simply flat the cluster model afterwards specifying the number of discrete values you would like to obtain. Please find attached a process that shows how it works: Best,
Tobias