Application Architectures with Hadoop: Putting the Pieces Together Through Example
Mark Grover and Jonathan Seidman
Numerous companies are undertaking efforts to explore how Hadoop can be applied to optimize their data management and processing, as well as address challenges with ever-growing data volumes. Although there's currently a large amount of material on how to use Hadoop and related components in the Hadoop ecosystem, there's a scarcity of information on how to optimally tie those components together into complete applications, as well as how to integrate Hadoop with existing data management systems – a requirement to make Hadoop truly useful in enterprise environments.

This class will help developers and architects who are already familiar with Hadoop to put the pieces together through a real-world example architecture, implementing an end-to-end application using Hadoop and components in the Hadoop ecosystem. This example will be used to illustrate important topics such as:
  • Modeling data in Hadoop and selecting optimal storage formats for data stored in Hadoop
  • Moving data between Hadoop and external systems such as relational databases and logs
  • Accessing and processing data in Hadoop
  • Orchestrating and scheduling workflows on Hadoop
Throughout the example, best practices and considerations for architecting applications on Hadoop will be covered. This class will be valuable for developers who are already knowledgeable about Hadoop, and are now looking for more insight into how it can be leveraged to implement real-world applications.

Note: You will need an understanding of Hadoop concepts and components in the Hadoop ecosystem, as well as an understanding of traditional data management systems (e.g. relational databases), and knowledge of programming languages and concepts.

Level : Advanced