Intro to Apache Hive and Impala
Mark Grover
This class gives an introduction to Apache Hive and Impala, both of which are open source SQL engines for Hadoop. Both of these engines enable Hadoop users to move away from writing MapReduce jobs and instead write familiar SQL like queries to query data in Hadoop. We will go into architectural details and use cases for both of the engines. We will also detail which use cases are suited for which SQL engine and conclude with a feature and performance comparison of the two engines.

Level : Overview