"FPGrowth in spark and docker"

HelloLolaHelloLola Member Posts: 18 Contributor I
edited June 2019 in Help

Hello :)

 

I created a spark cluster on my pc via docker image, and I want to apply the FPGrowth on my data.

 

Can you give information about how to apply this algorithme from Rapidminer on my spark cluster, without using cloudera?

 

Thank you in advance.

Answers

  • phellingerphellinger Employee, Member Posts: 103 RM Engineering

    Hi,

     

    Spark is only supported when it is running on YARN. Radoop requires a Hadoop cluster that has YARN, HDFS and Hive on it.

     

    But you can start such a cluster via docker similarly, you just need to find a Hadoop docker image that has Hive.

     

    Best,

    Peter

Sign In or Register to comment.