Patent and Scientific Papers Analysis
Thank you. :-)
I downloaded this software for being open source, in contrast with SPSS.
I would like to do Patent and Scientific Papers Analysis with this software, downloading Data from PatStat (EPO Global Patent MySQL Database); and Google Academic (Through Publish and Perish), WOS or Scopus.
I wonder how the Text Mining capabilities of this software are, and if I will be able to do analysis as these:
Thanks for letting me know. For all types of text analytics, you will need the Text Mining extension for RapidMiner which you can download for free from our Marketplace. You can find it in the menu “Extensions” – “Marketplace” and type “Text” in the search box (here is also a link directly to our marketplace: https://marketplace.rapidminer.com/UpdateServer/faces/product_details.xhtml?productId=rmx_text). There are also many more extensions on our Marketplace so make sure that you check them out…
There is a community member who created a nice set of tutorials for text analysis with RapidMiner: http://vancouverdata.blogspot.com/2010/11/text-analytics-with-rapidminer-loading.html
From the links you posted it looks like you are mainly interested in relationships like who cited whom or what terms are frequently used together etc. – RM can definitely be used to create all the necessary data sets for this.
On your two questions:
- Yes, you can create your own blocks (or “operators” as we call them). You can find more information on our doc server: http://docs.rapidminer.com/developers/
In addition to create own extensions in Java, you can also invoke command line calls or embed scripts written in R, Python, or Groovy directly in the process.
- No, there is currently no RAM limitation on the free version.