Using RapidMiner with Qlik

by RMStaff on ‎06-22-2016 11:24 AM - edited on ‎01-04-2017 04:35 PM by RMStaff

Question

Hi Ingo,

 

thank you for your message. I am happy to be part of the RapidMiner Community and looking forward to discover and test all different features. Currently I am looking for different ways to enhance QlikView applications with statistical data discovery techniques/tools. Therefore I would appreciate it to get resources about possible integrations of QlikView and RapidMiner, especially in an enterprise domain. Are there any whitepapers or documentations available?

 

Thank you in advance for any information.

Answer

Glad to hear from you.  Here is some information around RapidMiner and Qlik:

 

 

In case of any questions, please feel free to reach out to our community at http://community.rapidminer.com

Season's Greetings from RapidMiner

by Community Manager ‎12-20-2016 04:47 AM - edited ‎12-20-2016 05:40 AM

 

 

We made a little holiday video for you. Greetings from our offices in Boston, Dortmund, Budapest and London

 

 

 

 

 

 

 

 

Beta Mode - Get a Sneak Peak at Future Features in RapidMIner

by Community Manager ‎12-15-2016 04:36 AM - edited ‎12-15-2016 04:38 AM

As part of our RapidMiner Studio 7.3 release, we added an in-product Beta Mode that allows you to take a sneak peek at experimental features that are currently in beta testing for upcoming product releases, with the click of a button. As opposed to a dedicated beta release, the Beta Mode allows you to interactively test features which are more substantial in nature and require broader and more extensive testing. To kick-off our new Beta Mode feature, we are introducing an initial, experimental implementation of a new core data management for the RapidMiner platform for you to try!

 

This page outlines describes how to activate the beta mode the data core and future features. 

 

 

 

Deep Learning Starter

by Community Manager ‎11-11-2016 11:14 AM - edited ‎11-14-2016 06:01 AM

@ingomierswa looks at contrasts between Deep Learning and other machine learning methods and how easy it can be to start with deep learning in RapidMiner.

 

Also, discusses common myths and use cases.

 

VILT Classes Schedule

by Community Manager on ‎10-04-2016 06:21 AM

VILT Classes Schedule

 
DELIVERY STYLE COURSE DATES REGISTER
Weekly Lecture RapidMiner Basics Part 1 Oct 24 – Nov 14 in USD / EUR / GBP
2 Day Class RapidMiner Basics Part 1 Nov 14-15 in USD / EUR / GBP
2 Day Class RapidMiner Basics Part 2 Nov 16-17 in USD / EUR / GBP
Analyst Bootcamp RapidMiner Basics + Certification Nov 14-18 in USD / EUR / GBP
Weekly Lecture RapidMiner Basics Part 2 Nov 28 – Dec 19 in USD / EUR / GBP
Weekly Lecture RapidMiner Basics Part 1 Jan 23 – Feb 13 in USD / EUR / GBP
2 Day Class RapidMiner Basics Part 1 Feb 13-14 in USD / EUR / GBP
2 Day Class RapidMiner Basics Part 2 Feb 15-16 in USD / EUR / GBP
Analyst Bootcamp RapidMiner Basics + Certification Feb 13-17 in USD / EUR / GBP
Weekly Lecture RapidMiner Basics Part 2 Feb 27 – Mar 20 in USD / EUR / GBP

Live-Online Training Program

by Community Manager on ‎09-26-2016 11:23 AM

 

 

To help meet the demand for training on RapidMiner, we are rolling out a series of "Live-Online" courses. This means you can study without having to travel to one of our classroom sessions, while getting the guidance from a real-life instructor (Virtual Instructor-Led Training or VILT). So wherever you are in the world, you can get up to speed and get Certified status on RapidMIner.

 

We appreciate that not all people will want to learn in the say way, so have set up three class types:

 

2 Day Classes

This version consist of 4 hours live online training on two consecutive days (4 hrs. each). The live portion is amended by an additional two times 4 hours of self-study and solving exercises.

 

Weekly Lecture

In this delivery mode you are attending four 2-hour live-training sessions over a period of 4 to 5 weeks (depending on holiday weeks). In between the live session you will have time to work through exercises and preform self-study in preparation for the next session

 

RapidMiner Analyst Bootcamp

In this bundle you attend the “2-day” classes of RapidMiner in one week and then you are granted access to complete the RapidMiner Analyst Certification exam during the following 60 days.

 

More details of these Learning options  and the all the schedules can be found here:  

 

https://rapidminer.com/training/#live

Integration of Rapidminer with Business Intelligence and Dashboard Applications

by RMStaff ‎07-09-2016 03:26 PM - edited ‎07-25-2016 09:29 AM

RapidMiner is a product used by quarter million users for their data science needs. In most datascience projects the final mile is delivering the results to a broader audience, the audience which can act on based on predictions. In RapidMiner we call this operationalization of Predictive analytics. As part of operationalization a very common practice is delivering the results via Dashboards, Reports or Visualizations. The following article describes some of the ways The following article describes some of the ways RapidMiner can provide highly effective solutions for these needs RapidMiner can provide highly effective solutions for these needs

 

1) Rapidminer Web Apps

RapidMiner server provides dashboard, reporting and business Intelligence capabilities that can be tied with RapidMiner workflows to create a really powerful solution. Advanced analytics solutions can be delivered in browser based rich HTML5 based solution. All of this is done using configurations and also can be customized with advanced Javascript and CSS control. Details about how to use RapidMiner Webapps can be found here

 

Web apps example.gif

 

 

2) Integration with Tableau

Rapidminer provides following three ways to integrate with Tableau.

a) Tableau Data Extract One way is use the "Tableau Data Writer" operator, this method allows creating Tableau Data extract file using RapidMiner  workflows. Such processes can then be scheduled to push updates to Tableau data files thereby updating the Dashboards. Details about how to use this method are available here

You can download the extension from our marketplace or here

 

b) Odata Service : Tableau provides ability to read data from an OData webservice. RapidMiner server provides ability to expose RapidMiner workflows as a webservice that can provide data using OData format. Details about creating webservices are available here and here. Ensure that when you create webservice chase the output format as Odata.

 

c) Tableau Web Data Connector : Tableau provides extensibility with with Web Data Connector. If this data source is preferred you can still use the webservices exposed from RapidMiner , by handling the data with web data connector API.Details about creating webservices are available here and here

 

3) Integration with Qlikview & QlikSense

RapidMiner  provides atleast two ways to integrate with Qlikview.

a) QVX (Qlikview Extract) -  This methods allows writing data out to Qlikview extract formats that can be used in both Qlikview & Qliksense applications.Such process can then be scheduled to push updates to Qlik Extract files, thereby uploading the dashboards.

You will need to download the extension from here or from marketplace

Details are available here http://docs.rapidminer.com/server/advanced-topics/qlikview-integration/

 

b) Web Service - RapidMiner provides ability to expose any processes as a webservice. Each webservices can return data in various formats including XML, JSON, HTML table etc. Using this technique we can pull data directly from RapidMiner server based process into a Qlikview or QlikSense dashboard. Details about this method are available here

4) Other Applications

If webservices are not possible to be invoked from your BI application, you can still rely on RapidMiner to write its output to databases or files as needed. Your BI Applications can then get the data from these intermediate stores as needed.

Documentation for RapidMiner

by Community Manager on ‎07-13-2016 11:35 AM

Question

Where can one find Installation Guides and system documentation?

Answer

Visit docs.rapidminer.com for installation guides, reference guides, tutorials and more—tools you can use to get started and broaden your knowledge of the RapidMiner platform. 

 

 

 

Documentation is publicly accessible.

 

Beyond the 14 day trial

by RMStaff on ‎06-22-2016 11:47 AM - edited on ‎06-23-2016 07:02 AM by Community Manager

Question

Hi Ingo

 

Thank you for the invitation to the community.

I’m a masters student in Data Analytics at National University of xxxxxxxx and as a part of our course we build and test different prediction models using different data science tools.

I’m a first time user of rapidminer and so far enjoyed working on it. However the trial version of 14 days is too short to get a feel of the product.

 

I would really appreciate if you can provide the tool available to students for free at least for a year so that we could explore the potential of rapidminer.

Answer

Glad to hear that your first experiences have been positive.  Don’t worry about the 14 days trial: you can continue to use after the first 14 days although some features will be limited then.  But there is also an academic program for RapidMiner.  Check it out here:  https://rapidminer.com/academia/

 

Much success with your studies,

Unwanted Columns

by RMStaff on ‎06-22-2016 11:45 AM

Question

I am trying to import a cvs file and what operator do I use to eliminate unwanted columns?

Answer

The data import can be done via two different ways: using the import wizard into the repository (preferred way, check out this: http://docs.rapidminer.com/studio/getting-started/2-importing-data.html) or by using the operator Read CSV (http://docs.rapidminer.com/studio/getting-started/3-data-loading.html).  Both videos show examples for Excel but CSV works in principle in the same way.

 

For removing columns, you have again two options: Select Attributes (http://docs.rapidminer.com/studio/operators/blending/attributes/selection/select_attributes.html) can be used to manually select which columns (called “attributes” in RapidMiner).  You can keep (or remove) single columns or subsets which is defined in the parameters of the operator.  The second option is Remove Useless Attributes (http://docs.rapidminer.com/studio/operators/blending/attributes/selection/remove_useless_attributes....) which automatically removes useless attributes for you, i.e. for example columns which only have one single value.

 

In general the following video series might be helpful: http://docs.rapidminer.com/studio/getting-started/

 

You also should definitely check out the built-in tutorials which can be found under “Questions” in the top right corner of the product.

 

Last but not least: these are the type of questions our user community is more than happy to help you with as well.  You can find it at http://community.rapidminer.com

Embedding R script in RapidMiner

by RMStaff on ‎06-22-2016 11:43 AM

Question

Thank you Dr Ingo,

 

Good day.

I had a go through all the "getting start video", it is very helpful and easy to follow up.

After complete all the basic tutorial, i start to fall in love with Rapidminer studio. 

I am going to re-try all my previous assignments/analysis using RapidMiner to master the skillset on using RapidMiner Studio.

 

However is hard to find resourceful video tutorial on RapidMiner with R extension. 

While the documentation in RapidMiner with R extension also very limited. 

I will try more active in the community on this R extension related topic discussion.

Answer

I am glad to hear that you make some good progress with RapidMiner and that you like it Smiley Happy

 

I think the following video from one of our recent office hours might be helpful for you:  https://www.youtube.com/watch?v=95v4clxg9mE

 

It covers the embedding of R scripting into RapidMiner processes.  That is probably the best video I am currently aware of on this topic.

 

Hope that helps & that you continue to enjoy RapidMiner,

Ingo

Looking for Government Data Sets

by RMStaff on ‎06-22-2016 11:39 AM

Question

Hi Ingo, i'm  cascading Business Analytics in different universities.

Currently im doing Data Warehousing and Predictive Modeling training in different regions. Can you help

me to find data sets (government agencies) that produce and/or consolidate data with examples and model 

to be run in rapidminer software.

 

Thanks hope to hear from.

Answer

Nice to hear from you and thanks for sharing this with me.  With respect to data sets, those are two good sources:

 

http://www.kdnuggets.com/datasets/index.html

http://mlr.cs.umass.edu/ml/datasets.html

 

We also have built some new sections recently on our community portal:

 

http://community.rapidminer.com/t5/Data-Sources/tkb-p/Data

http://community.rapidminer.com/t5/Networking/ct-p/GenaralS

http://community.rapidminer.com/t5/Examples-Inspiration/ct-p/verticals

 

You might want to post your request there – some other community members might have data and RapidMiner processes they could share with you.

Getting from Analytics to Advanced Analytics

by RMStaff on ‎06-22-2016 11:36 AM

Question

Hi Ingo,

I heard other teams use RapidMiner at my company, so I’m trying it out. In the end I’m trying to up our analytical skills from something like “creating a chart in Tableau and seeing a vague relationship” to statistically significant findings.

Thanks

Answer

This sounds like a good mission to me Smiley Happy   And we built RapidMiner exactly with the idea in mind to empower people to easily (well, relatively easy compared to coding everything in some programming language) find meaningful patterns in their data.  Please make sure to check out our built-in tutorials (under “Questions” in the top right corner) and also our Getting Started video guides at http://docs.rapidminer.com/studio/getting-started/ - this should help to get a better feeling about the software.

 

Since you mentioned Tableau, another helpful resource could be the following Webinar which we did some weeks ago: https://rapidminer.com/resource/datavisualization/.  This one features Qlik instead of Tableau but the main concepts are the same.

 

Let us know if you need any support or check out our community portal at http://community.rapidminer.com if you want to discuss data science topics or use cases with other RapidMiner users.

 

Much success & fun,

Ingo

What happens after the 14 day trial license expires?

by RMStaff on ‎06-22-2016 11:31 AM

Question

Hi Ingo,

Our group director asked us to download your software prior to join our next meeting on June, the 10th.

 

Just a quick question about the license : I understood this is for a free 14 days trial, but I was told there was no license needed…

 

 

Answer

Thanks for writing back.  After the 14-day trial the product continues to work but some of the features which are only available to our commercial customers stop working then.  So you can continue to work with the free, open source version or might decide to become a customer to continue to experience all the awesomeness of the full RapidMiner experience – up to you ;-)

 

Have fun,

Ingo

Patent and Scientific Papers Analysis

by RMStaff on ‎06-22-2016 11:28 AM

Question

Hi!

Thank you. :-)

I downloaded this software for being open source, in contrast with SPSS.

I would like to do Patent and Scientific Papers Analysis with this software, downloading Data from PatStat (EPO Global Patent MySQL Database); and Google Academic (Through Publish and Perish), WOS or Scopus.

I wonder how the Text Mining capabilities of this software are, and if I will be able to do analysis as these:
http://www.vosviewer.com/
http://www.citnetexplorer.nl/

Answer

Thanks for letting me know.  For all types of text analytics, you will need the Text Mining extension for RapidMiner which you can download for free from our Marketplace.  You can find it in the menu “Extensions” – “Marketplace” and type “Text” in the search box (here is also a link directly to our marketplace: https://marketplace.rapidminer.com/UpdateServer/faces/product_details.xhtml?productId=rmx_text).  There are also many more extensions on our Marketplace so make sure that you check them out…

 

There is a community member who created a nice set of tutorials for text analysis with RapidMiner: http://vancouverdata.blogspot.com/2010/11/text-analytics-with-rapidminer-loading.html

 

From the links you posted it looks like you are mainly interested in relationships like who cited whom or what terms are frequently used together etc. – RM can definitely be used to create all the necessary data sets for this.

 

On your two questions:

 

  1. Yes, you can create your own blocks (or “operators” as we call them).  You can find more information on our doc server: http://docs.rapidminer.com/developers/
    In addition to create own extensions in Java, you can also invoke command line calls or embed scripts written in R, Python, or Groovy directly in the process.
  2. No, there is currently no RAM limitation on the free version.

 

Best,

Ingo

 

Missing Values, Inconsistent Values

by RMStaff on ‎06-22-2016 11:21 AM

Question

Hello,

I would like to solve the kaggle 2013 Expedia competition, but unfortunately rapidminer can't handle the dataset.
Any tips how to work this out?

Answer

Nice to hear from you.  I think the problem with this data set is that it uses the word “NULL” to indicate missing values.  I was successfully loading the data with the following two changes:

 

  1. De-select “Skip Comments” (might not be necessary for this data set actually but can cause problems if there are “#” symbols in the data)
  2. When you go through the input wizard, change the type of the column “prop_review_score” to “Polynominal”

 

Reason: RM guesses the type of the columns based on a sample from the data loaded at the beginning of the wizard.  This column is completely numerical in the data from the sample but has the value NULL later on.  This causes the problem since NULL is not a number RM can parse.  You can import the data first (might take 10 or so minutes) and fix it later with the operator “Declare Missing Value”.

 

In case of any problems like that, also feel free to ask our user community .

 

And of course this is exactly what our customer success team is more than happy to help you with ;-)

 

Have fun,

Ingo

Keyword Frequency in Text Mining

by RMStaff on ‎06-22-2016 11:15 AM

Question

Hi dear Dr. Mierswa

I am M.A student and using your software to conducting my thesis. My project is going to search some keywords which has more frequency in some articles that I downloaded. Unfortunately I worked with your software and tried hard but I didn't access to the frequencies I was looking for. If you can help me and guide me how to do such a thing, I will be very grateful.
I already appreciate your help.
I am looking forward to hear from you.
Your sincerely

Answer

Great to hear from you!

 

For all text analytics you will need the Text Mining extension for RapidMiner which you can download for free from our Marketplace.  You can find it in the menu “Extensions” – “Marketplace” and type “Text” in the search box (here is also a link directly to our marketplace: https://marketplace.rapidminer.com/UpdateServer/faces/product_details.xhtml?productId=rmx_text).  There are also many more extensions on our Marketplace so make sure that you check them out…

 

There is a community member who created a nice set of tutorials for text analysis with RapidMiner: http://vancouverdata.blogspot.com/2010/11/text-analytics-with-rapidminer-loading.html

 

In your case it might be sufficient to load in the text, tokenize it, and calculate the term frequencies.  This can all be done with 3 operators in RM.

 

Much success with your thesis,

Ingo

Using sample data sets such as Bloomberg

by RMStaff on ‎06-22-2016 11:10 AM

Question

Hi 

I started RapidMiner for studying data mining at graduate school of logistics.

 

I want you ask how i can get and bring the data from site like bloomberg etc.

and i want get useful raw data like famouse exercise (boston housing, flight delays...)

but when i researched it, it was very difficult to get because there are datas contain just one..two attributes only

so i can't run various modelling. could you let me know how get useful data for exercising?

 

 

have a nice day! i hope you're good day !

Answer

You are welcome – hope you will have a lot of fun using RapidMiner.

 

For Bloomberg and other financial data, one of our community members offers a RapidMiner extension for accessing those data source.  You can find it in the menu “Extensions” – “Marketplace” and type “Finance” in the search box (here is also a link directly to our marketplace: https://marketplace.rapidminer.com/UpdateServer/faces/product_details.xhtml?productId=rmx_quantx1).   There are also many more extensions on our Marketplace so make sure that you check them out…

 

One of the best sources for data sets for trying out things is the UCI Machine Learning repository: http://archive.ics.uci.edu/ml/

 

There are also many data sets delivered in RapidMiner’s Sample repository – try those as well.  I also recommend to run through the tutorials which you find in the “Questions?” button in the top right.  It introduces some of the data sets as well.

Using RapidMiner to teach Data Science

by RMStaff on ‎06-22-2016 11:05 AM

Question

Dr. Mireswa.

 

I have been doing some data analytics work using Python. Recently when I tried using Excel for data analytics, I found it inconvenient. Looking for an alternative, I stumbled upon RapidMiner and want to try it out. I would use it for a little experimenting and teaching data analysis for non-computer science professionals.

 

I plan to start a 120 hours certification course on Data Analytics for people without programming background, using a tool like RapidMiner.

Answer

This is great to hear – thanks for sharing!  If you know Python you might also want to check out our Python extension for RapidMiner.  You can find it in the menu “Extensions” – “Marketplace” and type “Python” in the search box (here is also a link directly to our marketplace: https://marketplace.rapidminer.com/UpdateServer/faces/product_details.xhtml?productId=rmx_python_scr...).  You can then also use Python scripts as part of RM processes.  There are also many more extensions on our Marketplace so make sure that you check them out…

Welcome to the Getting Started Knowledge Board

by RMStaff ‎06-22-2016 10:56 AM - edited ‎06-22-2016 10:58 AM

From tme to time people ask us (well, me) questions about getting started using RapidMiner. 

 

Here is a selection of those questions and our (my) answers. 

 

They are all real, only they have been anonymized. I hope they help you get started faster with RapidMIner.