0% found this document useful (0 votes)
32 views4 pages

Big Data Analytics Course Overview

The document outlines a Big Data Analytics course with a focus on understanding big data platforms, managing big data using NoSQL and Hadoop, and applying analytics methods. It includes course objectives, outcomes, and a detailed module breakdown covering topics such as Hadoop, Map Reduce, MongoDB, and Spark. The course is designed for students to develop practical skills in big data analytics and is supported by specified textbooks and reference materials.

Uploaded by

darkaspector.ukn
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
32 views4 pages

Big Data Analytics Course Overview

The document outlines a Big Data Analytics course with a focus on understanding big data platforms, managing big data using NoSQL and Hadoop, and applying analytics methods. It includes course objectives, outcomes, and a detailed module breakdown covering topics such as Hadoop, Map Reduce, MongoDB, and Spark. The course is designed for students to develop practical skills in big data analytics and is supported by specified textbooks and reference materials.

Uploaded by

darkaspector.ukn
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

BIG DATA ANALYTICS

Course code: 22IS61 Credits: 03


Prerequisites: Machine Learning ,Operating System CIE Marks: 50
Teaching Hours / Week (L : T : P) [Link] SEE Marks: 50

Total Hours: Exam Hours: 03


40

Course Objectives:
1. Understand the big data platform and its use cases.
2. Explore the techniques of managing big data using NoSQL, Hadoop.
3. Use ETL tools for developing business case studies in big data analytics.
4. Develop the process of map-reduce analytics using Hadoop and related tools.

Course Outcomes: At the end of the course, student will be able to:
CO1 Apply Big Data concepts, tools and applications in engineering and societal
problems.
CO2 Apply appropriate solutions for the applications using Hadoop tools.
CO3 Analyze the given data set and identify deep insights from the data set.
CO4 Design and apply appropriate analytics methods based on the nature of the problem,
the characteristics of the data, and the desired outcomes.

Mapping of Course outcomes to Program outcomes:


PO1 PO2 PO3 PO4 PO5 PO6 PO7 PO8 PO9 PO10 PO11 PO12 PSO1 PSO2 PSO3

CO1 3
CO2 3
CO3 3
CO4 2 2 3 2 3 2 2 2 3 3 3 2
Module Module Contents Hours
Classification of data, Characteristics, Evolution and definition of Big
data, What is Big data, Why Big data, Traditional Business Intelligence Vs
1. 08
Big Data, Typical data warehouse and Hadoop environment.
Big Data Analytics: What is Big data Analytics, Classification of
Analytics, Importance of Big Data Analytics, Technologies used in Big
data Environments, Few Top Analytical Tools, NoSQL
Introduction to Hadoop: Introducing hadoop, why hadoop, why not
2. RDBMS, RDBMS Vs Hadoop, History of Hadoop, Hadoop overview, use 08
case of Hadoop, HDFS (Hadoop Distributed File System), Processing data
with Hadoop, Managing resources and applications with Hadoop YARN
(Yet Another Resource Negotiator).
Introduction to Map Reduce Programming: Introduction, Mapper,
Reducer, Combiner, Partitioner, Searching, Sorting, Compression.
Introduction to MongoDB: What is MongoDB, Why MongoDB, Terms
used in RDBMS and MongoDB, Data Types in MongoDB, MongoDB Query
3. 08
Language. Introduction to Cassandra: Apache cassandra- An
Introduction, Features, CQL Datatypes, CQLSH, Keyspaces, CRUD,
Collections
Introduction to Hive: What is Hive, Hive Architecture, Hive data types, Hive
file formats, Hive Query Language (HQL), RC File implementation, User
Defined Function (UDF).
4. 08
Introduction to Data Analysis with Spark: What Is Apache Spark? A
Unified Stack, Programming with RDDs: RDD Basics, Creating RDDs, RDD
Operations, Transformations, Passing Functions to Spark-Python.
Machine Learning with MLlib: Overview, Machine Learning Basics, Data
Types, Algorithms - Feature Extraction, Statistics, Classification and
5. Regression, Clustering, Collaborative Filtering and Recommendation, 08
Dimensionality Reduction, Model Evaluation

Text Books:
1. Seema Acharya and Subhashini Chellappan, “Big data and Analytics”, Wiley India
Publishers, 2nd Edition, 2019. (Ch 1: 1.1, Ch2: 2.1-2.5,2.7,2.9-2.11, Ch3: 3.2,3.5,3.8,3.12,
Ch4: 4.1,4.2, Ch 5: 5.1-,5.8, 5.10-5.12, Ch 6: 6.1-6.5, Ch 7: 7.1-7.7, Ch 8: 8.1 - 8.8, Ch 9: 9.1-
9.6,9.8)
2. Holden Karau, Andy Konwinski, Patrick Wendell, Matei Zaharia, “Learning Spark: Lightning-
Fast Big Data Analysis”, O'Reilly, 2015. (Ch 1, Ch 3, Ch 11)
Reference Books:
1. Tom White, Hadoop: The Definitive Guide, ISBN-13: 978- 9352130672, 4th Edition, O‟Reilly
Media, 2015.
2. Arshdeep Bahga, Vijay Madisetti, Big Data Analytics: A Hands-On Approach, ISBN-13:
978-
st
0996025577, 1 Edition, VPT Publications, 2018.
3. Eric Sammer, Hadoop Operations: A Guide for Developers and Administrators, ISBN-13:
st
978-9350239261, 1 edition, O'Reilly Media, 2012.

You might also like