RapidMiner

Moderator ‎04-24-2017 08:50 AM
346 Views
0 Comments

30 Years of Pain?

So will there be 30 years of pain? Maybe. Will automation kill a lot of jobs, probably. Will the smart companies keep highly skilled workers in the next 30 years? Yes. In this new knowledge economy, brain power is key.

Read more...

Moderator ‎04-20-2017 08:35 AM
226 Views
0 Comments

The Sudden Interest in Data Science Platforms

For years we've hearing how Big Data will unlock all kinds of insights in a corporation's data. Everyone raced to stand up clusters, jam all kinds of data into them, and then stumble when extracting insight. The cluster became hard to tame, hard to use, and seemed like a big waste of money.

Read more...

Moderator ‎04-17-2017 08:08 AM
229 Views
0 Comments

Let's talk Marketplace Extensions

One of the cool things about RapidMiner is the extension ecoystem. The default installation of RapidMiner Studio has a complete suite to do 90% of any ETL, Modeling, and Testing that you need to do on a daily basis. Sometimes you'll need that extra 10% to do something special, like Text Mining!

Read more...

Moderator ‎04-13-2017 08:02 AM
381 Views
0 Comments

Write for Us program at the Community!

We're getting ready unveil and new program at the RapidMiner Community where our members can take part in the growth and influence this place has. We'll be rolling out in phases a new "Write for Us" program where Community members can submit guest blog articles, knowledge base articles, and building blocks for cash and swag.

Read more...

Moderator ‎04-10-2017 12:42 PM
1898 Views
0 Comments

Great News! RapidMiner has a new Data Core!

RapidMiner’s new data core is a big thing. It improves data and memory management and it allows you to work with much bigger data sets keeping your memory demand at bay.

 

 

Read more...

Moderator ‎04-06-2017 09:50 AM
252 Views
0 Comments

Data Science Link Roundup

Greetings Community! Here's a quick interesting link roundup for your Data Science needs!

Read more...

Moderator ‎04-03-2017 08:00 AM
652 Views
0 Comments

PDF Table Extraction Extension Released!

In my last post, I introduced the ‘Web Table Extraction’ extension, which provides a convenient way to retrieve data tables from Wiki-like HTML pages. In this post, I will introduce you to the ‘PDF Table Extraction’ - another extension developed at RapidMiner Research, as part of the Data Search for Data Mining (DS4DM project, http://ds4dm.de) and released today. So let us see how this extension adds value to RapidMiner processes.

Read more...

Moderator ‎03-30-2017 08:11 AM
1092 Views
4 Comments

The Web Table Extraction Operator

Data scientists are often confronted with a situation where data must be read from web pages. For instance, there are a lot of data tables available on Wikipedia, which can be utilized but the fine-grained data scraping approaches get complicated for ordinary users as they often require regular expressions based parsing and extraction of data from a web page’s content.

Read more...

‎03-29-2017 03:39 PM
954 Views
4 Comments

Advanced Reporting Extension published by Old World Computing

Today Old World Computing is happy to announce the Advanced Reporting Extension for RapidMiner. With it's three operators, it looks tiny in comparison to some of the more bulky extensions out there, but it adds operators in a blind spot of RapidMiner and is designed to take away some worries from the common data scientist.

The idea is to use the capabilities of RapidMiner to automate any regular reporting task that results in an Excel sheet. There have been many projects and data science departments that simply drown in these kind of request, consuming all resource before you can get to the really fun part of data science. Now you can simply start at the beginning to create a nearly zero overhead reporting, even if you don't have or can't use real business intelligence tools like tableau or qlik.

Read more...

Moderator ‎03-28-2017 09:19 AM
310 Views
0 Comments

New Online Training Dates Scheduled

This Spring, join us for the new online training season. We are introducing new options for our Data Science courses RapidMiner Basics Part 1 and RapidMiner Basics Part 2. For the first time, you can also enhance your data science skills in Text and Web mining with RapidMiner online. 

Read more...

Moderator ‎03-27-2017 07:43 AM
458 Views
0 Comments

Operator Toolbox 0.2.0 and Converters 0.2.0 are out!

A few weeks ago the RapidMiner Research Team published two new extensions to the Marketplace that are making a splash, the Operator Toolbox and Converters! We didn't stop there! Today I'm happy to announce the release of the version 0.2.0 for both extensions!  Here's a quick preview of the new enhancements you'll find!

Read more...

Moderator ‎03-23-2017 08:40 AM
223 Views
0 Comments

Data Science Link Roundup

Just some interesting community and data science links I've come across this week. Enjoy!

Read more...

Moderator ‎03-17-2017 09:13 AM
350 Views
0 Comments

RapidMiner at the General Online Research conference 2017 in Berlin

The General Online Research Conference is annually organized by the German Society for Online Research in cooperation with a local partner. In 2017 the GOR conference will take place in Berlin, Germany, with the HTW Hochschule für Technik und Wirtschaft Berlin/University of Applied Sciences being the local organizer.

Read more...

RMStaff ‎03-14-2017 05:38 PM
239 Views
0 Comments

We are looking for YOU!!!

This is Ingo, the founder of RapidMiner.  Today we are here to look for a new team member for our Boston office.  Are you a data scientist with some experience in RapidMiner?  Then the following might be of interest to you!

Read more...

Moderator ‎03-13-2017 08:13 AM
704 Views
0 Comments

Gartner Data Analytics Summit 2017

Last week I attended the Gartner Data Analytics (DA) summit in Grapevine Texas. It was quite an event, filled with great exhibition booths and presentations. I did booth duty, along with my colleagues, but managed to attend a few great presentations.

Read more...

Moderator ‎03-06-2017 08:30 AM
1077 Views
0 Comments

Exploring Model Management in RapidMiner

I recently reached a point in my daily work for the PRESED project (Predictive Sensor Data mining for Product Quality Improvement, www.presed.eu) within the funded R&D Research Team at RapidMiner, which probably sounds familiar for many data scientists.

Read more...

Moderator ‎02-27-2017 09:32 AM
1103 Views
3 Comments

Introducing the Operator Toolbox Extension

As RapidMiner users we are used to one operator solutions. Want to add a PCA? Add the operator. Want to do an ensemble? Add the operator. Over time the RapidMiner ecosystem evolved in a way that most tasks are easy to handle like this. However, doing data science every day, I experienced a few things where RapidMiner has no one operator solution. How do we solve that?

Read more...

Moderator ‎02-20-2017 08:31 AM
1267 Views
0 Comments

Spark RM - What is it?

SparkRM is a new Radoop operator - but not just any new operator to be added to the 70+ collection that the Radoop extension includes - it’s an operator that opens a wealth of new use cases for exploiting and analyzing Hadoop data with RapidMiner.

Read more...

stevefarr ‎09-06-2016 11:34 AM
392 Views
0 Comments

Goats are Monogomous

IMG_8990.JPG

 

Getting those data scientists young. 

Read more...

RMStaff ‎07-28-2016 03:03 PM
552 Views
0 Comments

Gradient Boosted Trees? Deep Learning? In less than 5 minutes? You Bet!

As most of you are already aware, RapidMiner is a kick-ass platform offering pretty much everything you need for doing data science in a very efficient way.  But what you don’t know is that …

 

RapidMiner Studio just got even more awesome!

 

Wait… is this even possible?  Well, it was no easy task – but we have done it: Introducing RapidMiner Studio 7.2. Let’s take a look at some of the new features.

Read more...

RMStaff ‎07-28-2016 04:58 AM
722 Views
1 Comment

How to Filter Attributes Based on the Minimum Value

For my recent blog post i needed to filter out all attributes having at least one value above a threshold. Traditionally i did this with Transpose, Filter Examples, Transpose again.

I realized that there is a way nicer way which i would like to share with you.

Read more...

stevefarr ‎07-27-2016 06:36 AM
485 Views
0 Comments

Some Changes to the Community

We have moved some things - to help you!

 

rapidminer_logoC5_RGB_v1.png

Read more...

stevefarr ‎07-19-2016 08:47 AM
292 Views
0 Comments

Feedback on Marketplace Extensions

What is the best way of giving feedback on extensions that you use? If you are a developer, would you like a more formal feedback mechanism. Jan Czogalla @jczogalla at Dortmund Uni wants your feedback on his two entensions.

Extensions.PNG

 

Read more...

RMStaff ‎07-04-2016 04:35 AM
573 Views
0 Comments

RapidMiner at the Data Science Meetup in Bonn

Join us at the Bonn Data Science Meetup on the  14th of July.

 

meetup_logo.jpg

Read more...

stevefarr ‎06-28-2016 09:13 AM
551 Views
0 Comments

Advanced and Generic Joins

Our data scientist Balázs describes an extension of the join concept. The built-in Join operator only supports equality comparisons, but some problems can be better solved with different kinds of operations, just like in SQL.

Some joins are not straightforewardSome joins are not straightforeward

Read more...

stevefarr ‎05-27-2016 08:52 AM
1336 Views
0 Comments

Monte Carlo or Bust

Monte Carlo simulation in a casino?

 

th.jpg

Read more...

RMStaff ‎05-26-2016 04:10 PM
951 Views
0 Comments

Incognito Career Day - Maastricht

Eight rounds of speed-dating where each company had five minutes to talk to a group of students.

maas.jpg

Read more...

RMStaff ‎05-26-2016 01:15 PM
1233 Views
0 Comments

Creative Misuse of RapidMiner

Using RapidMiner, that serious data science tool to Create the Lyrics of "99 Bottles of Beer on the Wall"

bb.jpg

Read more...

LouJordano ‎05-26-2016 01:03 PM
1002 Views
0 Comments

What You Don’t Know About Your Data Can Jeopardize Analysis

 

Data analyses and predictions are reliable only to the extent that the data being analyzed is understood and of acceptable quality.

loupic.jpg

Read more...