COURSE PLAN
Regulation –2021
Subject Code : CCS334 Degree : [Link]
Subject Name : BIG DATA ANALYTICS Year/Sem/Sec : III Year / VII Sem
Credits : 4 credits Academic Year : 2025-26 ODD
Faculty : [Link] Total No. of Hour given in Syllabus
Sem. : ODD Lecture :30 Tutorials: - Practical:30 - Total: 60
Department : [Link]
COURSE OBJECTIVES:
The student should be made to:
COURSE OBJECTIVES:
To understand big data.
To learn and use NoSQL big data management.
To learn mapreduce analytics using Hadoop and related tools.
To work with map reduce applications
To understand the usage of Hadoop related tools for Big Data Analytics
Text/ Text /Ref
Course Teaching
Hour Topic Reference Book Date
Outcome Methodology
Books Page Nos.
UNIT I UNDERSTANDING BIG DATA
1 Introduction to big data E-Source CO1 Black Board
To explain the convergence E-Source CO1 Black Board
2 of key trends
To describe E-Source CO1
unstructured ,data, for semi Black Board
3 structure ,structure data
To explain the Industry E-Source CO1 Group
4 examples of big data Discussion
To classify the Web E-Source CO1 PPT
5 analytics, technologies
What are Big data E-Source CO1 PPT
applications and Introduction
6 to Hadoop
What are the open source E-Source CO1 Black Board
7 technologies with exanples
How for utilized cloud and E-Source CO1 Black Board
big data, mobile business
8 intelligence
How to implement the Crowd E-Source CO1 Black Board
sourcing analytics, inter and
9 trans firewall analytics.
UNIT II NOSQL DATA MANAGEMENT
To explain the Basic E-Source CO2
Introduction to NoSQLand
what are aggregate data Black Board
models , keyvalue and
10 document data models
To explain Relationships E-Source CO2
fordata models ,
To define graph databases
11 and, PPT
To classify schema less
databases
To explain concept of E-Source CO2
materialized views and To
12 explain the distribution PPT
models
How it is master-slave E-Source CO2 Black Board
13 replication – consistency
The tool def Cassandra and E-Source CO2
how to create Cassandra data
modeland what are the
14 Cassandra examples and how Black Board
to acces for Cassandra
clients
UNIT III MAP REDUCE APPLICATIONS
E-Source CO3
To explain the concept of Black Board
15 MapReduce workflows
To formulate concept for unit E-Source CO3
16 tests with MRUnit and test PPT
data and local tests
To express the basic anatomy E-Source CO3
17 of MapReduce job run and PPT
classic Map-reduce
To introduce the YAqN E-Source CO3
technology and to aqcuire Black Board
the failures in classic Map-
18 reduce and YARN
How to acchive job E-Source CO3
scheduling mechanism and
19 its shuffle and sort – task Black Board
execution
What are the MapReduce E-Source CO3
20 types and various input PPT
formats– output formats
How Scaling out – Hadoop E-Source C03
21 streaming – Hadoop pipes PPT
To explain the design E-Soure C03
structure of Hadoop
distributed file system
22 (HDFS) and how to PPT
implement HDFS concepts in
Hadoop
UNIT IV BASICS OF HADOOP
The explain the basics of E-Source CO4
Data format in Hadoop and Black Board
how to analyzing data with
23 Hadoop
How to implement the scaling E-Source CO4
24 out technique and Hadoop PPT
streaming and Hadoop pipes
How to create Java interface E-Source CO4
programs and how to
25 integrate with data flow , how Seminar
to implement Hadoop I/O
operations
How to implent the data E-Source CO4 Group
compressions and
26 serializations Discussion
To introduce Avro and file- E-Source CO4
27 based data structures BlackBoard
28 How to implement the E-Source CO4 Seminar
Cassandra – Hadoop
ntegration with examples
UNIT V- HADOOP RELATED TOOLS
To explain the Hbase – data E-Source CO5
model and implementations PPT
29
To mplementations of Hbae E-Source CO5
38 clients – Hbase examples – PPT
and praxis. Pig – Grunt
To explain the pig data model E-Source CO5
39 – Pig Latin – developing and PPT
testing Pig Latin scripts
To explain the Hive – data E-Source CO5
types and file formats and
40 HiveQL data definition and Seminar
also HiveQL data
manipulation
To explain the concept of E-Source CO5 Group
41 HiveQL queries. discussion
CONTENT BEYOND SYLLABUS:
S. No Topic Mode Date
To explain the concept of various cloud architecture ,Distributed
1. PPT
clusters
To explain the Mongo DB Architecture and explain the working
2. PPT
principles
3. How to data can be stored at Various Data centres in Data mining PPT
COURSE OUTCOMES: After the completion of this course, students will be able to:
CO1: To Describe big data and use cases from selected business domains.
CO2: To Explain NoSQL big data management.
CO3: To Install, configure, and run Hadoop and HDFS
.CO4: To Perform map-reduce analytics using Hadoop.
CO5: TO Use Hadoop-related tools such as HBase,
Cassandra, Pig, and Hive for big data analytics
CO PO Correlation:
c Pos P
o s
s o
1 2 3 4 5 6 7 8 9 1 1 1 1 2 3
0 1 2
C 3 3 3 3 3 3 2 2 3 2 3 3 3 3 1
1
C 2 2 3 2 3 2 1 3 3 3 3 3 3 2 1
2
C 3 2 3 3 3 2 3 3 3 3 3 3 3 2 1
3
C 3 3 3 2 3 3 2 2 3 2 3 3 3 2 3
3
C 2 3 3 2 3 3 2 2 1 3 2 2 3 3 3
4
C 3 2 3 3 3 3 2 1 3 3 3 2 1 3 1
5
1 - Low, 2 - Medium, 3 - High, ‘-' - No correlation
REFERENCES:
1. Gerson and Gerson - Technical Communication: Process and Product, 7th Edition, Prentice Hall(2012)
2. Virendra K. Pamecha - Guide to Project Reports, Project Appraisals and Project Finance (2012)
3. Daniel Riordan - Technical Report Writing Today (1998)
Darla-Jean Weatherford - Technical Writing for Engineering Professionals (2016) Penwell
Publishers.
FACULTY HOD IQAC PRINCIPAL