It would be handsome to have an operator that can read files from HDFS without the definition of a schema in hive.
It should then provide the file as the Open File operator does for local files, URL and Repository Blob Entries.
HDFS security features like user and kerberos should be used in this new operator.
One application would be the processing of XML or JSON files from a cluster.
This would be usefull for the process pushdown because various file types could be processed inside the cluster.
... View more