How to Build Enterprise Data Apps with Cascading
Cascading is a popular Java-based application development framework for building Big Data applications on Apache Hadoop. This open source, enterprise development framework allows developers to leverage their existing skill sets such as Java and SQL to create enterprise-grade applications without having to think in MapReduce. This comprehensive framework separates business logic from integration logic so that developers can quickly build and test data applications locally on their laptop and then deploy them on Hadoop. While typical enterprise data applications must cross through multiple departments and frameworks, Cascading allows multiple departments to seamlessly integrate their application components into one single data processing application.
In this class, you will get a brief introduction to Cascading, see how it works, and then dive into building applications with Cascading. You will discover what types of use cases exist for data-driven businesses and how to approach them with Cascading and its vast ecosystem (Lingual, Pattern, Scalding, Cascalog, etc.). You will also learn how to get started in building enterprise-grade applications with Cascading, all without having to think in MapReduce. You will be guided through the Cascading framework, seeing code, examples, and best practices for Cascading application development.
This class is highly beneficial for application developers and data scientists looking to build data-oriented applications on top of Hadoop. Because Cascading is a Java library, Java skills will be helpful in understanding the code and examples. Come and see how companies like Twitter, eBay, Etsy, and other data-driven companies are taking advantage of the Cascading framework and how Cascading is changing the business of Big Data in the enterprise!
Level : Advanced