Data Science & Gen-AI Program (GetSetCode)
Course Overview
This is a 6-month intensive Data Science & Gen-AI program designed to provide practical,
industry-ready skills in Python and Data Science & Gen-AI through hands-on learning and
real-world projects.
Course Duration & Structure
● Total Duration: 6 Months
● In-Class Training: 4 Months
○ Daily practical-oriented sessions
○ Concept + implementation approach
● Hands-On Project Phase: 2 Months
○ Real-world project execution
○ Internship-style learning environment
Students will be actively coding and implementing concepts every day, not just theory.
Internship Details
● Students receive a 6-Month Internship Certificate from GetSetCode
● Internship runs parallel with the course
● Focus is on real-world problem solving and project development
Certifications (Total: 3)
Students will receive three recognized certifications upon successful completion:
1. 6-Month Internship Certificate
○ Issued by GetSetCode
2. Course Completion Certificate
○ Issued by GetSetCode (USA-registered parent company)
3. Udemy Certification
○ For completing the same Generative AI course on Udemy
Curriculum & Skills Covered
0.1 What is Data Science?
● What is Data Science?
● Data Science Lifecycle
● Real-world Data Science use cases
● Career roles: Data Analyst, Data Scientist, ML Engineer, AI Engineer
0.2 Tools & Environment
● VS Code, Jupyter, PyCharm
● Anaconda vs Miniconda
● Conda environments (best practices)
● Jupyter Notebook & JupyterLab
● Google Colab (cloud workflows)
PHASE 1: PYTHON PROGRAMMING FOR DATA
SCIENCE
1.1 Python Basics
● Installation & setup
● Comments, variables
● Data types
● Type conversion
● Input / Output
● Operators & precedence
● If–Else, match-case
● Loops (for, while)
● Loop-based problem solving
● Mini logic games
1.2 Functions & Core Python
● Functions
● Lambda functions
● List comprehensions
● Exception handling
● File handling (CSV, TXT, JSON)
● File handling mini-project
1.3 Data Structures
● List
● Tuple
● Set
● Dictionary
● Practical use-cases
1.4 Object-Oriented Programming (OOPs)
● Classes & Objects
● Constructors
● Inheritance
● Encapsulation
● Real-world OOP design
PHASE 2: DATA SCIENCE MATHEMATICS
2.1 Statistics Foundations
● Population vs Sample
● Descriptive vs Inferential Statistics
● Mean, Median, Mode
● Variance & Standard Deviation
● Percentiles & Quartiles
2.2 Probability
● Probability basics
● Conditional probability
● Bayes’ Theorem
● Probability distributions:
○ Uniform
○ Binomial
○ Normal
● Central Limit Theorem (CLT)
2.3 Advanced Statistics
● Covariance
● Correlation
● Hypothesis Testing
● p-value, confidence intervals
PHASE 3: DATA ANALYSIS & EDA
3.1 NumPy
● NumPy arrays
● Indexing & slicing
● Broadcasting
● Mathematical functions
● Multidimensional arrays
3.2 Pandas
● Series & DataFrames
● Data loading (CSV, Excel)
● Data cleaning
● Handling missing values
● Filtering & transformation
● GroupBy & aggregation
● Merge, join & concat
● Pivot & melt
3.3 Data Visualization
● Matplotlib:
○ Line, bar, pie, histogram, scatter
● Seaborn:
○ Statistical plots
● Visual storytelling for insights
PHASE 4: DATA COLLECTION & DATABASES
4.1 Data Collection
● Web scraping basics
● HTML for scraping
● Requests module
● BeautifulSoup
● Ethics & legal considerations
4.2 SQL for Data Science
● MySQL setup
● CRUD operations
● Constraints & keys
● Joins & subqueries
● GROUP BY
● Views & indexes
● Stored procedures
● SQL for analytics
PHASE 5: MACHINE LEARNING – CORE
5.1 Machine Learning Foundations
● What is ML?
● Types of ML
● ML workflow
● Bias–variance tradeoff
5.2 Data Preprocessing (Critical)
● Missing values
● Encoding:
○ One-hot
○ Label
○ Ordinal
● Feature scaling:
○ Standardization
○ Normalization
● Outlier detection
● Handling duplicates
● Data pipelines (sklearn)
PHASE 6: SUPERVISED LEARNING
6.1 Regression
● Train-test split
● Linear regression
● Multiple linear regression
● Polynomial regression
● Cost functions
● R² & Adjusted R²
● RMSE & MAE
6.2 Classification
● Logistic regression:
○ Binary
○ Multiclass
○ Polynomial features
● Confusion matrix
● Precision, recall, F1
● Imbalanced datasets
● Naive Bayes
PHASE 7: ADVANCED ML ALGORITHMS
7.1 Non-Linear Models
● Decision Trees (Regression & Classification)
● KNN
● Hyperparameter tuning
● Cross-validation
7.2 Unsupervised Learning
● K-means
● Hierarchical clustering
● DBSCAN
● Silhouette score
7.3 Association Rule Learning
● Apriori
● FP-Growth
● Market basket analysis
7.4 Ensemble Learning
● Voting classifiers
● Bagging
● Random Forest
● Model comparison
PHASE 8: DEEP LEARNING & AI
8.1 Deep Learning Fundamentals
● Neural networks
● Neurons & layers
● Perceptrons
● ANN architecture
8.2 Training Neural Networks
● Forward propagation
● Backpropagation
● Activation functions
● Loss functions
● Optimizers
8.3 Frameworks
● TensorFlow
● Keras
● PyTorch (overview)
● MNIST implementation
PHASE 9: LLMs & MODERN AI
● What are LLMs
● How LLMs work
● RAG (Retrieval-Augmented Generation)
● Embeddings
● Vector databases (conceptual)
● AI tools for data scientists
PHASE 10: SOFTWARE & DEPLOYMENT SKILLS
10.1 Git & GitHub
● Git workflow
● Branching
● Merge conflicts
● GitHub projects
● Portfolio readiness
10.2 Web & APIs for Data Scientists
● Flask
● APIs
● Frontend basics
● Model serving concepts
PHASE 11: REAL-WORLD PROJECTS (ONGOING)
Mandatory Projects
1. Data Cleaning & EDA Project
2. ML Regression Project
3. ML Classification Project
4. RAG-based AI Assistant
5. End-to-end ML Deployment (Optional)
Projects are industry-relevant and designed to build a strong portfolio.
Teaching Methodology
● 100% Practical-Oriented Training
● Daily hands-on implementation
● Learning by building real projects
● Continuous evaluation through practical work
About Trainers
● All trainers are working professionals
● Each trainer has 5+ years of industry experience
● Students learn real industry practices, not just academic concepts
Support System
● Live doubt-solving support
● Students can directly ask questions during or after sessions
● Continuous guidance throughout the course
Career Opportunities After Completion
After completing the program, students can explore:
● Freelancing opportunities
● Paid internship opportunities
● Full-time job opportunities
● Strong project portfolio to showcase skills
Key Points
● No prerequisites required
● Learn from industry professionals
● 30+ real-world projects
● 3 certifications included
● Internship + career opportunities
● Practical, job-focused training
After completing the course, you’ll be AI job-ready with strong practical skills and
real-world projects added to your portfolio and resume.