Due to recent updates, all users are required to create an Altair One account to login to the RapidMiner community. Click the Register button to create your account using the same email that you have previously used to login to the RapidMiner community. This will ensure that any previously created content will be synced to your Altair One account. Once you login, you will be asked to provide a username that identifies you to other Community users. Email us at Community with questions.
Radoop Open File Operator
Jugi
RapidMiner Certified Analyst, Member Posts: 12 Contributor II
It would be handsome to have an operator that can read files from HDFS without the definition of a schema in hive.
It should then provide the file as the Open File operator does for local files, URL and Repository Blob Entries.
HDFS security features like user and kerberos should be used in this new operator.
One application would be the processing of XML or JSON files from a cluster.
This would be usefull for the process pushdown because various file types could be processed inside the cluster.
Tagged:
9
Comments
Not just simple JSON or XML, but also for image files.
Hi,
I agree, that would make RM much more useful and a real centerpiece in the architecture. Of course a Write File (HDFS) operator should be added, too
Greetings,
Sebastian