Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
LDA (Latent Dirichlet Allocation) Operator & Gradient Boost
CraigBostonUSA
Employee, Member Posts: 34 RM Team Member
I am trying to perform topic modelling , it would be great to have a native LDA (latent dirichlet allocation) operator and also a Gradient Boost Operator.
Tagged:
4
Comments
rapidminer 7.2 added Gradient Boosted Trees as a new learner
Quoted from the operator help
A gradient boosted model is an ensemble of either regression or classification tree models. Both are forward-learning ensemble methods that obtain predictive results through gradually improved estimations. Boosting is a flexible nonlinear regression procedure that helps improving the accuracy of trees. By sequentially applying weak classification algorithms to the incrementally changed data, a series of decision trees are created that produce an ensemble of weak prediction models. While boosting trees increases their accuracy, it also decreases speed and human interpretability. The gradient boosting method generalizes tree boosting to minimize these issues
There is an LDA operator in the KobRA-Projekt extension at http://kobra.tu-dortmund.de/mediawiki/index.php?title=Software. The documentation is in German.
There's a corpus linguistics plugin 1.1.1 in Marketplace that should offer LDA functionality.
LDA is an extension but not updated (see here); Gradient Boosted Trees released in RM 7.2.