Subject Syllabus
303105361 - Big Data Analytics
Course: BTech Semester: 7
Prerequisite: Database Management system, SQL
Course Objective: Big data analytics is the often complex process of examining big data to uncover information such as hidden
patterns, correlations, market trends and customer preferences that can help organizations make informed business decisions
Teaching and Examination Scheme
Teaching Scheme Examination Scheme
Lecture Tutorial Lab Internal Marks External Marks Total
Credit
Hrs/Week Hrs/Week Hrs/Week Hrs/Week T CE P T P
3 0 0 0 3 20 20 - 60 - 100
SEE - Semester End Examination, T - Theory, P - Practical
Course Content W - Weightage (%) , T - Teaching hours
Sr. Topics W T
1 Introduction: 20 9
What is in Store? , Classification of Digital Data: Structured, Semi Structured & Un Structured , Evolution of Big Data ,
Definition of Big Data - Volume - Velocity ±Variety, Challenges of Big Data , Why Big Data?, Traditional Business
Intelligence (BI) versus Big Data , industry examples of big data , What is Big Data Analytics? , Data Science
2 Nosql Data Management: 20 9
Introduction to NoSQL, Types of NoSQL, Why NoSQL? , Advantages of NoSQL, Comparison of SQL, NoSQL and
NewSQL , aggregates , key-value and document data models, graph databases, map-reduce, partitioning and
combining
3 Basics Of Hadoop: 40 18
What is Hadoop?, Brief History of Hadoop , Why Hadoop? , RDBMS versus Hadoop , Hadoop Components , High
Level Architecture of Hadoop , Key Advantages & Features of Hadoop , Data format ,Hadoop distributed file system
(HDFS) , Processing Data with Hadoop. Map Reduce Interface:Overview of Map Reduce, Map-Reduce workflows,
anatomy of Map-Reduce job run, shuffle and sort ,task execution ,input formats , output formats.
4 Hadoop Related Tools: 20 9
Overview of HBase, Pig introduction, Pig data model, Hive, data types and file formats, HiveQL data definition,
HiveQL data manipulation, HiveQL queries, Pig Latin Overview , Pig versus Hive, Using JSON , Overview of Cassandra,
Jasper Reports.
Reference Books
1. Hadoop: The Definitive Guide by Tom White, Third Edition, O'Reilley. (TextBook)
By Tom White
2. Understanding Big data
By Chris Eaton,Dirk derooset al. | McGraw Hill, Pub. Year 2012
3. Hadoop Operations
By Eric Sammer | O'Reilley, Pub. Year 2012
4. Big data analytics with R and Hadoop, VigneshPrajapati,SPD.
By VigneshPrajapati
5. Big Data and Analytics
By Seema Acharya and Subhashini C | Wiley India
6. Programming Hive, E. Capriolo, D. Wampler, and J. Rutherglen, O'Reilley
By E. Capriolo, D. Wampler, and J. Rutherglen
7. MongoDB in Action
By Kyle Banker, Piter Bakkum ,Shaun Verch | Dream tech Press
8. HBase: The Definitive Guide, Lars George, O'Reilley
By O'Reilley