Hadoop Programming with Scalding
Scalding is a Scala-language API for writing advanced data workflows for Hadoop. Unlike low-level APIs, it provides intuitive pipes and filters
idioms, while hiding the complexities of MapReduce programming. Scalding wraps the Java-based Cascading framework in Functional
Programming concepts that are ideal for data problems, especially the mathematical algorithms for machine learning. Scalding code is very
concise compared to comparable Java code in Cascading or the low-level Hadoop API, providing far greater productive. Even non-developer
data analysts could learn Scalding.
In this hands-on tutorial aimed at advanced Java developers and data scientists, we will work through exercises that demonstrate these points. You will see that Scalding is an ideal tool when a full-featured and flexible toolset for Big Data applications is needed beyond what Hive or Pig can provide.
Level : Advanced