The Altair Community is migrating to a new platform to provide a better experience for you. The RapidMiner Community will merge with the Altair Community at the same time. In preparation for the migration, both communities are on read-only mode from July 15th - July 24th, 2024. Technical support via cases will continue to work as is. For any urgent requests from Students/Faculty members, please submit the form linked here.
Options

How can I set tblproperties on hive table?

cesar_ortizcesar_ortiz Member Posts: 10 Contributor I
I want to create a hive table in PARQUET format but also compressed with SNAPPY. Can't find the way with the Store in Hive operator.
Any help would be appreciated 
Tagged:

Answers

  • Options
    Pavithra_RaoPavithra_Rao Administrator, Moderator, Employee, RapidMiner Certified Analyst, RapidMiner Certified Expert, Member Posts: 123 RM Data Scientist
    @cesar_ortiz
    Store in Hive has a parameter 'custom storage' If selected, a new parameter called 'impala file format' there you could select 'PARQUET '.

    Hope this helps.
  • Options
    cesar_ortizcesar_ortiz Member Posts: 10 Contributor I
    @Pavithra_Rao
    The parameter 'hive file format' also has the option for PARQUET, ORC, AVRO,… but I wanted to change the compression algorithm.

    After experimenting with hive CLI, I realized the Store in Hive operator creates the table in Parquet with the default compression algorithm, in this case, Snappy. Good enough for the moment but would be nice to have the option to change to gzip or lzo.

    Thanks for your help
Sign In or Register to comment.