RapidMiner

RapidMiner

RapidMiner Hadoop Configuration For apache distribution

Contributor

RapidMiner Hadoop Configuration For apache distribution

Hi All,

I am new to rapidminer i have succesfully installed and configured rapidminer7.3 version its works fine the problem is when i am trying to configure hadoop it gives the below error

[Nov 23, 2016 12:39:42 PM] SEVERE: java.util.concurrent.TimeoutException
[Nov 23, 2016 12:39:42 PM] SEVERE: Hive server 2 connection test timed out. Please check that the server/daemon runs and is accessible on the address and port you specified.
[Nov 23, 2016 12:39:42 PM] SEVERE: Test failed: Hive connection
[Nov 23, 2016 12:39:42 PM] SEVERE: Connection test for 'Hadoop' failed.

 

My Hadoop Distribution is Apache

my hive version is hive1.2.1

and all my ports are default ports. If anybody knows  please help me 

 

Thanks in advance,

Praveen G

3 REPLIES
RMStaff

Re: RapidMiner Hadoop Configuration For apache distribution

Hi Praveen,

 

it is difficult to figure out the problem only from this information, but there are a couple of hints:

The error simply states that there were no response from the HiveServer2 instance (specified by either the Master Address or the Hive Server Address fields, and the Hive Port) in a given time.

I would try the following:

  • Check the Hive log on the cluster. Does the SHOW TABLES command that the test sends appear in the log? (It can take seconds on first try.) That confirms that Hive may be accessible, but it may take longer time than the timeout.
  • If the log shows that the command was sent to Hive, then you can increase the timeout in Studio: go to Preferences -> Radoop, and increase the Connection timeout and Hive command timeout values to, let's say, 30. (These timeouts are used for detecting connection problems.)
  • If there is nothing in the log, then I would make sure that the specified address and port can be reached from the machine that runs Studio. If that works, I would check the health of Hive on the cluster from Beeline, for example.

 

Best,

Peter

Contributor

Re: RapidMiner Hadoop Configuration For apache distribution

Hi Peter,

 

Thanks for the Reply

I have verified hive cluster through beeline it working fine  . Now if i am trying to connect i am getting below error.

I have attached the hive working using beeline scrren shot and Hadoop connection in RapidMiner 

If u know please help me out . Any Way thanks for ur reply

regards,

Praveen 

 

 

[Nov 25, 2016 9:43:25 AM] SEVERE: java.lang.RuntimeException: java.lang.NoClassDefFoundError: org/apache/hadoop/hive/metastore/api/MetaException
[Nov 25, 2016 9:43:25 AM] SEVERE: Hive server 2 connection test failed. Please check that the server/daemon runs and is accessible on the address and port you specified.
[Nov 25, 2016 9:43:25 AM] SEVERE: Test failed: Hive connection
[Nov 25, 2016 9:43:25 AM] SEVERE: Connection test for 'Hadoop' failed.Hive Using Beeline.PNGRapidMiner Hadoop settings.PNG

RMStaff

Re: RapidMiner Hadoop Configuration For apache distribution

Hi Praveen,

 

The screenshots help.

 

The first thing is that JDBC URL Postfix field is only there for additional, custom postfix (the URL is constructed automatically). So it should be empty in your case. That could already solve the connection problem.

 

However, the java.lang.NoClassDefFoundError indicates that probably temporary files or folders (for example, usually in /tmp/ on Linux) of Studio may have been deleted, since the software has been started. Is that possible? If you keep getting this error, a Studio restart should help.

 

The address in the connection is localhost, does that mean that you are running Studio on the master node? (Beeline, of course, runs on the Hadoop node, but Studio may not.)

I would also make sure that 54310 is the port to use for the NameNode. If you navigate to localhost:50070, is that the port that the Overview page shows?

 

Best,

Peter