Map/Reduce Tips and Tricks Has code image
Boris Lublinsky
This class will start with a short Map/Reduce architectural refresher, showing how it executes in Hadoop and what the main Map/Reduce components and classes are, which can be used for customizing an execution. We will then describe the most common possible Map/Reduce customizations and the reasons for their implementation.

The majority of time will be dedicated to going through the code examples, showing how to design and implement custom input/output formats, readers and writers, and partitioners. We will also show what to expect out of any customization.

Time permitting, we will also talk about high-level Map/Reduce frameworks, namely Apache Crunch.

Level : Intermediate