Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.

# Twoing Algorithm

Member Posts: 1 Learner III
edited November 2018 in Help

Hi Gurus,

You have to help me. We have a projekt for CART(Classification and Regression Trees) (Twoing Algortihm or Gini Algorithm). But ıf we could look for the twoing algorithm in RapidMiner Studion we found nothing. which algorithm is the best to them.

We thought that the bests are Classification by Regression or 'Classification and Decision Trees'. Could you recommend one of them or anything else

best regards

Eser Baybek

Tagged:

• Administrator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 364 RM Data Scientist

We have Decision Tree, Random Tree operators for CART.

When you grow the tree based on some splitting criteria or impurity measurements, you can choose from

• information_gain: The entropy of all the attributes is calculated. The attribute with minimum entropy is selected for split. This method has a bias towards selecting attributes with a large number of values.
• gain_ratio: It is a variant of information gain. It adjusts the information gain for each attribute to allow the breadth and uniformity of the attribute values.
• gini_index: This is a measure of impurity of an ExampleSet. Splitting on a chosen attribute gives a reduction in the average gini index of the resulting subsets.
• accuracy: Such an attribute is selected for split that maximizes the accuracy of the whole Tree.

According to the definition by IBM ftp://public.dhe.ibm.com/software/analytics/spss/support/Stats/Docs/Statistics/Algorithms/13.0/TREE-CART.pdf

When the number of classes in your example, C=2, twoing is equivalent to the usual impurity index. A quick check on the classic R library RPART (Recursive Partitioning and Regression Trees) , and twoing is neither a part of RPART. But this paper about rpartOrdinal

offer some twoing options in R. http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2899711/

And as you may know it is quite easy to integrate your R codes in rapidminer.

Hope this hepls.