hive-advantages

Hive Advantages
Hive Advantages



  • Hive is Data Warehousing Tool built on Hadoop Ecosystem which is Highly scalable as it uses HDFS for storage and Map Reduce for processing
  • Useful for SQL scripting rather than programming approach, Provides great support for Analysts to run Adhoc queries, Data analysis & summarization
  • No need to write complex Map-Reduce jobs for processing & extracting data, HQL queries internally transforms to Map-Reduce jobs
  • No Programming expertise needed
  • Supports external tables which make it possible to process data without actually storing in HDFS
  • Hive internally contains optimizer for logical plans optimizations to improve performance
  • External Metastore makes lookup very easy and stable
  • External tables avoids data loss if metastore was dropped mistakenly
  • Hive process both Structured and Semi-Structered data
  • Hive supports many compression codecs which optimises storage and network bandwidth
  • Bucketing in Hive allows us quick sampling and reporting
  • Complex HQL functions are available (Eg. Windowing functions)
  • Hive allows us to incorporate custom logic as UDFs
  • Hive uses Swift processing for speed up of query execution

Comments

Popular posts from this blog

hadoop-installation-ubuntu

jenv-tool

hive-installation-in-ubuntu