The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.

Patent and Scientific Papers Analysis

IngoRMIngoRM Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Community Manager, RMResearcher, Member, University Professor Posts: 1,751 RM Founder



Thank you. :-)

I downloaded this software for being open source, in contrast with SPSS.

I would like to do Patent and Scientific Papers Analysis with this software, downloading Data from PatStat (EPO Global Patent MySQL Database); and Google Academic (Through Publish and Perish), WOS or Scopus.

I wonder how the Text Mining capabilities of this software are, and if I will be able to do analysis as these:


Thanks for letting me know.  For all types of text analytics, you will need the Text Mining extension for RapidMiner which you can download for free from our Marketplace.  You can find it in the menu “Extensions” – “Marketplace” and type “Text” in the search box (here is also a link directly to our marketplace:  There are also many more extensions on our Marketplace so make sure that you check them out…


There is a community member who created a nice set of tutorials for text analysis with RapidMiner:


From the links you posted it looks like you are mainly interested in relationships like who cited whom or what terms are frequently used together etc. – RM can definitely be used to create all the necessary data sets for this.


On your two questions:


  1. Yes, you can create your own blocks (or “operators” as we call them).  You can find more information on our doc server:
    In addition to create own extensions in Java, you can also invoke command line calls or embed scripts written in R, Python, or Groovy directly in the process.
  2. No, there is currently no RAM limitation on the free version.





Sign In or Register to comment.