Boris Lublinsky and Michael Segel
We will start with a short introduction of different approaches to using Hadoop in the real-time environment, including real-time queries, streaming and real-time data processing and delivery. Then we'll describe the most common use cases for real-time queries and products implementing these capabilities. We will also describe the role of streaming, common use cases, and products in the space.
The majority of time will be dedicated to the usage of HBase as a foundation for the real-time data process. We describe several architectures for such implementation, and a high-level design and implementation for two examples: system for storing and retrieving images, and using HBase as a back end for Lucene.
Level : Intermediate