The digital revolution has transformed the communication media radically. There are numerous types of data available to cater ever increasing demands of the users. The application of such data is wide spread across multiple domains. The traditional data processing systems were unable to store and process such huge, distributed and multi dimensional data which led to the emergence of a new stream called Big Data. This phenomenon comprises of a set of tools to store, process and visualize big data. This subject will explain the Big Data framework from its basics to advanced data analytics methods.
Unit I helps to understand the need for Big data platform, Challenges in the conventional Data Processing systems and the methods to overcome them. The details of various types of data available and tools to manipulate them are detailed in this section. Unit II introduces the Streams concept of generating continuous data for industry applications. It explains the ways to take samples and filter the needed data to generate cognitive information using Real Time Analytics Platform Applications.
Unit III enunciates the history of Hadoop and its building blocks. It also details the Map Reduce Algorithm along with its anatomy and features. The module also covers the ways to schedule Jobs and execute tasks needed during Map Reduce. Unit IV introduces additional Big Data tools like Pig, Hive, HBase, Zookeeper, Infosphere Big Insights and Streams. Unit V demonstrates Predictive Analytics and Visualization using Big Data It explains the structure of Simple Linear Regression and Multiple Linear Regression. This module finally highlights various Visual Data Analytics and Interaction techniques along with sample Applications.
The course will help the learners to understand the basics of Big Data, the tools available currently and the methods to visualize them for generating useful information to take business critical decisions. The students will be able to comprehend different types of data available and utilize them effectively to formulate strategic initiatives.