APEEJAY STYA UNIVERSITY
School of Engineering and Technology
VALUE ADDED COURSE
COURSE CODE: VACE 101
DATA ANALYSIS AND MACHINE LEARNING
Data Analysis involves extracting actionable insights from raw data. Various scientific
methods, algorithms, and processes are used to extract insights from vast amounts of data. Data
Science provides a vast array of tools for working with data coming from many different
sources, such as financial logs, multimedia files, marketing forms, sensors, and text files. An
important aspect of Data Science is the preparation of data for analysis, including cleaning,
aggregating, and manipulating it to perform advanced analysis. Globally, data scientists are the
most sought-after, in-demand profession and almost all industries are actively hiring Data
Scientists, so obtaining a Data Science certification will be really of great value. Data Science
training course is designed for beginners and professionals. After the students complete this
Data science course, they will have an in-demand set of skills that are critical to today's career
opportunities such as those of data scientists, risk analysts etc. The Machine Learning
Specialization is a foundational online program created in collaboration with AI. This
Specialization is taught by an expert in the school of Engineering, and provides a broad
introduction to modern machine learning, including supervised learning (multiple linear
regression, logistic regression, neural networks, and decision trees), unsupervised learning
(clustering, dimensionality reduction, recommender systems), and some of the best practices
used in Silicon Valley for artificial intelligence and machine learning innovation (evaluating and
tuning models, taking a data-centric approach to improving performance, and more.).
COURSE OUTCOMES
By the end of this specialized course being taught, the student will have mastered following key
concepts and gained the practical know-how:
To quickly and powerfully apply machine learning to challenging real-world problems.
Coding techniques of MLE
COURSE CONTENT
Syllabus
Course Content Hours Schedule
Module
Module 1 1. Overview and introduction to data science 8 1st and 2nd week
2. The Shape of Data
Module 2 1. Overview and introduction to data science 8 3rd and 4th week
2. The Shape of Data
Module 3 1. Working with Data 8 5th and 6th week
2. Linear Regression
Module 4 1. Classification 8 7th and 8th week
2. Non-linear models and tree-based methods
Module 5 1. Resampling methods, model selection and 8 9th and 10th
week
regularization
2. Supervised, Unsupervised learning and
dimensional reduction
3. Text Modelling
Module 6 1. Text analysis 8 11th and 12th
week
2. Text classification and scaling
3. Data from the Web
READING MATERIAL
1. Garrett Grolemund and Hadley Wickham (2016) R for Data Science, O’Reilly Media
2. Visualize, and Model Data. Sebastopol, CA: O’Reilly
3. Lake, Peter. Concise Guide to Databases: A Practical Introduction. Springer, 2013
4. David Blei (2012). “Probabilistic topic models.”” Communications of the ACM, 55(4):
77-84
5. Lazer et al., 2014. ``The Parable of Google Flu: Traps in Big Data Analysis’’ Science
343: 1203-1205
EVALUATION
Component Distribution (in %)
Lab Skills 20
Assignment 20
Presentation 30
End Term 30
TIME SLOT
Considering the nature of the course and to enable cross faculty or inter-disciplinary learning,
slots for Value Added Course may vary during the semester and will be communicated by the
course faculty in advance.
COURSE DATE AND DURATION
The Course will commence on 1 Sep 2017 and end on 24 Nov 2017. It would run for 4 hours on
every week. Students shall register by 25 Aug 2017. The duration of this value-added course is
48 hours.
ATTENDANCE AND ASSESSMENT
Attendance and Assessment Record of the participants will be maintained by the course faculty.
The Record shall contain details of the students’ attendance, marks obtained in the Continuous
Internal Assessment (CIA) Tests, Assignments, Role Plays and Seminars. Each student shall
have a minimum of 75% attendance in the course failing which he or she would not be eligible
for the final Examination and Certificate of Participation will not be awarded.
ELIGIBILITY
The Course is open to admission for the students from all the Schools of the year (2017-2018)
INSTRUCTOR ACADEMIC PROFILE
Dr. Manpreet Singh Sehgal is working as an Assistant Professor in the
Computer Science and Engineering Department of the School of Engineering
and Technology, Apeejay Stya University (ASU). His Doctorate is in the field
of deep web data extraction and he holds an M. Tech (Information
Technology) from YMCAIE, Faridabad, and B. Tech (Computer Science and
Engineering) from PTU, Jalandhar.