IMPALA
Impala is an open-source and native analytics database for Hadoop. It is a Massive Parallel Processing (MPP) SQL query engine that processes massive amounts of data stored in a Hadoop cluster. Impala provides high performance and low latency compared to other SQL engines for Apache Hadoop, such as Hive.mpala provides support for: Most common SQL-92 features of Hive Query Language (HiveQL) including SELECT, joins, and aggregate functions. HDFS, HBase, and Amazon Simple Storage System (S3) storage, including: HDFS file formats: delimited text files, Parquet, Avro, SequenceFile, and RCFile.