Hadoop Programming with Scalding
Scalding is a Scala API for writing advanced data workflows for Hadoop. Unlike low-level APIs, it provides intuitive pipes and filters idioms, while hiding the complexities of MapReduce programming. Scalding wraps the Java-based Cascading framework in Functional Programming concepts that are ideal for data problems, especially the mathematical algorithms for machine learning.
This class will use examples to demonstrate these points. Developers will see that Scalding is an ideal tool when they need a more full-featured and flexible tool set for Big Data applications, beyond what Hive or Pig can provide.
Level : Intermediate