0% found this document useful (0 votes)

8 views17 pages

Mlshit

The document outlines multiple experiments focused on implementing various machine learning techniques including Linear Regression, Ensemble Learning, Multivariate Linear Regression, Support Vector Machines (SVM), Graph-Based Clustering, CART, Linear Discriminant Analysis (LDA), and PCA/SVD. Each experiment includes an aim, theoretical background, implementation steps, and code examples, demonstrating practical applications of these algorithms in Python. The document serves as a tutorial for understanding and applying these machine learning methods.

Uploaded by

0447.ankit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views17 pages

Mlshit

Uploaded by

0447.ankit

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Date:

Experiment No. 1

Aim:
To implement Linear Regression.
Theory:
Linear regression uses the relationship between the data-points to draw a straight line through all
them.
This line can be used to predict future values.

CODE:

1. import numpy as nmp

2. import [Link] as mtplt
50. plot_regression_line(p, q, b)
51.
52. if __name__ == "__main__":
53. main()

OUTPUT:

CONCLUSION:

Linear regression is a statistical technique to describe relationships between dependent variables

with a number of independent variables. This tutorial will discuss the basic concepts of linear
regression as well as its application within Python.
In order to give an understanding of the basics of the concept of linear regression, we begin with
the most basic form of linear regression, i.e., "Simple linear regression".

**
OUTPUT:

ACCURACY : 95.61%

LOGISTIC REGRESSION MODEL : 2.63%

CONCLUSION:
We learned how to implement your custom binary logistic regression model in Python while
understanding the underlying math. You saw how similar the logistic regression model can be
to a simple neural network.

**
Date:
Experiment No. 3

Aim:
To implement Ensemble learning (bagging/boosting)

Theory:
Bootstrap Aggregating, also known as bagging, is a machine learning ensemble meta-algorithm designed
to improve the stability and accuracy of machine learning algorithms used in statistical classification and
regression. It decreases the variance and helps to avoid overfitting. It is usually applied to decision tree
methods. Bagging is a special case of the model averaging approach.

Implementation Steps of Bagging

 Step 1: Multiple subsets are created from the original data set with equal tuples, selecting
observations with replacement.
 Step 2: A base model is created on each of these subsets.
 Step 3: Each model is learned in parallel with each training set and independent of each other.
 Step 4: The final predictions are determined by combining the predictions from all the models.
CODE:
# evaluate bagging algorithm for classification

from numpy import mean

from numpy import std
from [Link] import make_classification
from sklearn.model_selection import cross_val_score
from sklearn.model_selection import RepeatedStratifiedKFold
from [Link] import BaggingClassifier
# define dataset
X, y = make_classification(n_samples=1000, n_features=20, n_informative=15, n_redundant=5,
random_state=5)
# define the model
model = BaggingClassifier()
# evaluate the model
cv = RepeatedStratifiedKFold(n_splits=10, n_repeats=3, random_state=1)
n_scores = cross_val_score(model, X, y, scoring='accuracy', cv=cv, n_jobs=-1, error_score='raise')
# report performance
print('Accuracy: %.3f (%.3f)' % (mean(n_scores), std(n_scores)))

OUTPUT:
RAISE, SCORES

**
Date:
Experiment No. 4

Aim:
To implement multivariate Linear Regression
Theory:
Step 1: Select the features. First, you need to select that one feature that drives the
multivariate regression. ...
Step 2: Normalize the feature. ...
Step 3: Select loss function and formulate a hypothesis. ...
Step 4: Minimize the cost and loss function. ...
Step 5: Test the hypothesis
CODE:
import numpy as np
import [Link] as sm

y = [1,2,3,4,3,4,5,4,5,5,4,5,4,5,4,5,6,5,4,5,4,3,4]

x=[
[4,2,3,4,5,4,5,6,7,4,8,9,8,8,6,6,5,5,5,5,5,5,5],
[4,1,2,3,4,5,6,7,5,8,7,8,7,8,7,8,7,7,7,7,7,6,5],
[4,1,2,5,6,7,8,9,7,8,7,8,7,7,7,7,7,7,6,6,4,4,4]
]

def reg_m(y, x):

ones = [Link](len(x[0]))
X = sm.add_constant(np.column_stack((x[0], ones)))
for ele in x[1:]:
X = sm.add_constant(np.column_stack((ele, X)))
results = [Link](y, X).fit()
return results

**
Date:
Experiment No. 5

Aim:
To implement SVM
Theory:
Support Vector Machine (SVM) is a powerful machine learning algorithm used for linear or
nonlinear classification, regression, and even outlier detection tasks. SVMs can be used for a
variety of tasks, such as text classification, image classification, spam detection, handwriting
identification, gene expression analysis, face detection, and anomaly detection. SVMs are
adaptable and efficient in a variety of applications because they can manage high-dimensional
data and nonlinear relationships.
SVM algorithms are very effective as we try to find the maximum separating hyperplane
between the different classes available in the target feature.

**
Date:
EXPERIMENT NO. 06

AIM:
To implement Graph Based Clustering
THEORY:
1. Click the counts data node.
2. Click the Exploratory analysis section of the toolbox.
3. Click Graph-based clustering.
4. Configure the parameters.
5. Click Finish to run.

CODE:

import networkx as nx
G = [Link]()
G.add_edges_from([('A', 'B'), ('A', 'K'), ('B', 'K'), ('A', 'C'),
('B', 'C'), ('C', 'F'), ('F', 'G'), ('C', 'E'),
('E', 'F'), ('E', 'D'), ('E', 'H'), ('I', 'J')])
print([Link](G, 'C'))

**
[Link](figsize =(9, 9))
[Link](X_principal['P1'], X_principal['P2'], c = cvec)
[Link]((r, g, b, c, y, m, k),
('Label 0', 'Label 1', 'Label 2', 'Label 3 'Label 4',
'Label 5', 'Label -1'),
scatterpoints = 1,
loc ='upper left',
ncol = 3,
fontsize = 8)
[Link]()

OUTPUT:
BEFORE
AFTER

CONCLUSION:

Density Based Spatial Clustering of Applications with Noise(DBCSAN) is a clustering

algorithm which was proposed in 1996. In 2014, the algorithm was awarded the ‘Test of
Time’ award at the leading Data Mining conference, KDD.
Dataset – Credit Card

**
Date:
EXPERIMENT NO. 08
AIM:
To implement CART
THEORY:
CART( Classification And Regression Trees) is a variation of the decision tree
algorithm. It can handle both classification and regression tasks. Scikit-Learn uses the
Classification And Regression Tree (CART) algorithm to train Decision Trees (also called
“growing” trees). CART was first produced by Leo Breiman, Jerome Friedman, Richard
Olshen, and Charles Stone in 1984.
CART(Classification And Regression Tree) for Decision Tree
CART is a predictive algorithm used in Machine learning and it explains how the target
variable’s values can be predicted based on other matters. It is a decision tree where each fork
is split into a predictor variable and each node has a prediction for the target variable at the end.
The term CART serves as a generic term for the following categories of decision trees:
 Classification Trees: The tree is used to determine which “class” the target variable is most
likely to fall into when it is continuous.
 Regression trees: These are used to predict a continuous variable’s value.
CODE:
import numpy as np
import pandas as pd
iris = load_iris()
X = [Link]
y = [Link]
iris_df = [Link](data=[Link], columns=iris.feature_names)
iris_df['species'] = iris.target_names[[Link]]
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3, random_state=42)
clf = DecisionTreeClassifier(criterion='gini', random_state=42)
[Link](X_train, y_train)
y_pred = [Link](X_test)
print("Accuracy Score:", accuracy_score(y_test, y_pred))
print("Classification Report:")
print(classification_report(y_test,y_pred, target_names=iris.target_names))
[Link](figsize=(20,10))
tree.plot_tree(clf, feature_names=iris.feature_names, class_names=iris.target_names,
filled=True)
[Link]()

**
Date:
EXPERIMENT NO. 09

AIM:
To implement LDA
THEORY:
Linear Discriminant Analysis (LDA), also known as Normal Discriminant Analysis or
Discriminant Function Analysis, is a dimensionality reduction technique primarily utilized
in supervised classification problems. It facilitates the modeling of distinctions between
groups, effectively separating two or more classes. LDA operates by projecting features
from a higher-dimensional space into a lower-dimensional one. In machine learning, LDA
serves as a supervised learning algorithm specifically designed for classification tasks,
aiming to identify a linear combination of features that optimally segregates classes within
a dataset.
For example, we have two classes and we need to separate them efficiently. Classes can
have multiple features. Using only a single feature to classify them may result in some
overlapping as shown in the below figure. So, we will keep on increasing the number of
features for proper classification.
CONCLUSION:
Linear discriminant analysis (LDA), also known as normal discriminant analysis (NDA) or
discriminant function analysis (DFA), builds on Fisher's linear discriminant, a statistical approach
pioneered by Sir Ronald Fisher. It is a dimensionality reduction technique that is used
in supervised machine learning.
The primary function of LDA is to project high-dimensional data on to a lower-dimensional space
while retaining the data's inherent class separability. LDA can be applied to enhance the operation
of classification algorithms such as a decision tree or random forest.

**
Date:

EXPERIMENT NO. 10
AIM:
To implement PCA/SVD/LDA
THEORY:
PCA is suitable for unsupervised dimensionality reduction, LDA is effective for supervised
problems with a focus on class separability, and SVD is versatile, catering to various
applications including collaborative filtering and matrix factorization.
CODE:
import pandas as pd
iris = datasets.load_iris()
df = [Link](iris['data'], columns = iris['feature_names'])
[Link]()
scalar = StandardScaler()
scaled_data = [Link](scalar.fit_transform(df)) #scaling the data
scaled_data
[Link](scaled_data.corr())
pca = PCA(n_components = 3)
[Link](scaled_data)
data_pca = [Link](scaled_data)
data_pca = [Link](data_pca,columns=['PC1','PC2','PC3'])
data_pca.head()
[Link](data_pca.corr())

Mtech Lab Manual
No ratings yet
Mtech Lab Manual
11 pages
Labpdf
No ratings yet
Labpdf
10 pages
AI Algorithms Lab Manual
No ratings yet
AI Algorithms Lab Manual
29 pages
Linear Regression and SVM Implementation
No ratings yet
Linear Regression and SVM Implementation
25 pages
Scikit-learn Machine Learning Guide
No ratings yet
Scikit-learn Machine Learning Guide
20 pages
Experiments in Machine Learning Models
No ratings yet
Experiments in Machine Learning Models
21 pages
K-Nearest Neighbour Iris Classification
No ratings yet
K-Nearest Neighbour Iris Classification
16 pages
ML 1
No ratings yet
ML 1
22 pages
Complete Machine Learning Algorithms Interview Guide
No ratings yet
Complete Machine Learning Algorithms Interview Guide
41 pages
Machine Learning Lab Exercises in Python
No ratings yet
Machine Learning Lab Exercises in Python
37 pages
Supervised vs Unsupervised ML Algorithms
No ratings yet
Supervised vs Unsupervised ML Algorithms
16 pages
Evidence-Based Analysis with Python
No ratings yet
Evidence-Based Analysis with Python
21 pages
Linear Regression and Classification Models
No ratings yet
Linear Regression and Classification Models
20 pages
Machine Learning Techniques Overview
No ratings yet
Machine Learning Techniques Overview
15 pages
Supervised vs Unsupervised Learning Guide
No ratings yet
Supervised vs Unsupervised Learning Guide
12 pages
Mushroom Classification Lab Report
No ratings yet
Mushroom Classification Lab Report
14 pages
AI Lab
No ratings yet
AI Lab
15 pages
IML - File Till 8
No ratings yet
IML - File Till 8
11 pages
Machine Learning Lab by Tanvi Wadhwa
No ratings yet
Machine Learning Lab by Tanvi Wadhwa
47 pages
8 Python Programs for Data Classification
No ratings yet
8 Python Programs for Data Classification
18 pages
Logistic Regression Analysis in Python
No ratings yet
Logistic Regression Analysis in Python
17 pages
Linear and Non-Linear Regression & Clustering
No ratings yet
Linear and Non-Linear Regression & Clustering
12 pages
MLT Theory
No ratings yet
MLT Theory
8 pages
Linear Regression and Classification Models
No ratings yet
Linear Regression and Classification Models
13 pages
Machine Learning Lab Certificate and Experiments
No ratings yet
Machine Learning Lab Certificate and Experiments
44 pages
Data Mining Experiments Overview
100% (2)
Data Mining Experiments Overview
43 pages
ML Full Lab Manual
No ratings yet
ML Full Lab Manual
19 pages
Machine Learning Model Training Guide
No ratings yet
Machine Learning Model Training Guide
25 pages
ML Lab
No ratings yet
ML Lab
39 pages
Machine Learning Classification Overview
No ratings yet
Machine Learning Classification Overview
214 pages
Ensemble Methods in Machine Learning
No ratings yet
Ensemble Methods in Machine Learning
24 pages
Python Simple Linear Regression Guide
No ratings yet
Python Simple Linear Regression Guide
56 pages
Cy-701 Machine Learning Lab Manual
No ratings yet
Cy-701 Machine Learning Lab Manual
31 pages
SK Krai Hardware Data Analysis Techniques
No ratings yet
SK Krai Hardware Data Analysis Techniques
38 pages
Gradient Boosting for Electricity Theft Detection
No ratings yet
Gradient Boosting for Electricity Theft Detection
10 pages
DMW Record
No ratings yet
DMW Record
28 pages
Understanding Classification in Machine Learning
No ratings yet
Understanding Classification in Machine Learning
10 pages
Lab Manual
No ratings yet
Lab Manual
23 pages
Machine Learning Lab ManuaL
No ratings yet
Machine Learning Lab ManuaL
39 pages
Random Forest Model Evaluation Guide
No ratings yet
Random Forest Model Evaluation Guide
7 pages
ML Lab: Experiments in Machine Learning
No ratings yet
ML Lab: Experiments in Machine Learning
36 pages
Scikit Learn Cross-Validation Guide
No ratings yet
Scikit Learn Cross-Validation Guide
141 pages
Linear Regression and KNN Algorithms Guide
100% (5)
Linear Regression and KNN Algorithms Guide
56 pages
Understanding Random Forest Algorithm
No ratings yet
Understanding Random Forest Algorithm
45 pages
Alogrithm and AI Lab (Mtech 2024-2025)
No ratings yet
Alogrithm and AI Lab (Mtech 2024-2025)
26 pages
Implementing Ensemble Learning in Python
No ratings yet
Implementing Ensemble Learning in Python
23 pages
Machine Learning for Breast Cancer Prediction
No ratings yet
Machine Learning for Breast Cancer Prediction
8 pages
BCA Part II Sem IV - Lab - Introduction To ML
No ratings yet
BCA Part II Sem IV - Lab - Introduction To ML
47 pages
Introduction to Machine Learning Types
No ratings yet
Introduction to Machine Learning Types
8 pages
Decision Tree Implementation Guide
No ratings yet
Decision Tree Implementation Guide
7 pages
Python Machine Learning Experiments
No ratings yet
Python Machine Learning Experiments
14 pages
Bayesian Classification and Clustering in Python
No ratings yet
Bayesian Classification and Clustering in Python
25 pages
Machine Learning Laboratory Record
No ratings yet
Machine Learning Laboratory Record
23 pages
Final Ai Lab 6 Programs 2024-25
No ratings yet
Final Ai Lab 6 Programs 2024-25
10 pages
Supervised Machine Learning Overview
No ratings yet
Supervised Machine Learning Overview
42 pages
Machine Learning Lab Manual for B.Tech AI
No ratings yet
Machine Learning Lab Manual for B.Tech AI
22 pages
06 Machine Learning Fundamentals
No ratings yet
06 Machine Learning Fundamentals
13 pages
15066/Pnvl GKP Express Sleeper Class (SL)
No ratings yet
15066/Pnvl GKP Express Sleeper Class (SL)
2 pages
Decentralized Chat App with Blockchain
No ratings yet
Decentralized Chat App with Blockchain
13 pages
Cse (Aiml) Sem 7 Blockchain Technologies
No ratings yet
Cse (Aiml) Sem 7 Blockchain Technologies
53 pages
Cyber Security Laws: Chapters 1 & 2 Overview
No ratings yet
Cyber Security Laws: Chapters 1 & 2 Overview
51 pages
Aadhaar Demographics Update Receipt
No ratings yet
Aadhaar Demographics Update Receipt
1 page
Secure P2P Communication via Blockchain
No ratings yet
Secure P2P Communication via Blockchain
5 pages
Decentralized Messaging via Blockchain
No ratings yet
Decentralized Messaging via Blockchain
7 pages
Indian Railways Booking Details
No ratings yet
Indian Railways Booking Details
2 pages
Class 3 Maths MCQ Test Paper
No ratings yet
Class 3 Maths MCQ Test Paper
5 pages
Bias in Bookmaker Probability Methods
No ratings yet
Bias in Bookmaker Probability Methods
15 pages
Momentum Investing Vs Index Base and Value Investing in India
No ratings yet
Momentum Investing Vs Index Base and Value Investing in India
15 pages
Autism Spectrum Disorder Diagnosis Framework
No ratings yet
Autism Spectrum Disorder Diagnosis Framework
50 pages
Social Capital and Climate Adaptation in Nigeria
No ratings yet
Social Capital and Climate Adaptation in Nigeria
15 pages
Predicting User Interaction on Microblogs
No ratings yet
Predicting User Interaction on Microblogs
10 pages
Machine Learning: Types & Algorithms
No ratings yet
Machine Learning: Types & Algorithms
9 pages
(Original PDF) Marketing Research An Applied Orientation 7th by Naresh K. Malhotra Full Digital Chapters
50% (2)
(Original PDF) Marketing Research An Applied Orientation 7th by Naresh K. Malhotra Full Digital Chapters
110 pages
Data Analysis with Python: Regression & Classification
No ratings yet
Data Analysis with Python: Regression & Classification
30 pages
Web Apps for Heart Disease & Diabetes Risk
No ratings yet
Web Apps for Heart Disease & Diabetes Risk
2 pages
Data Mining for Personal Credit Scoring
No ratings yet
Data Mining for Personal Credit Scoring
8 pages
Heavy vs. Light Smokers: Key Differences
No ratings yet
Heavy vs. Light Smokers: Key Differences
5 pages
Clickstream Data Analysis for Online Sales
No ratings yet
Clickstream Data Analysis for Online Sales
35 pages
Business Intelligence Course Overview
No ratings yet
Business Intelligence Course Overview
40 pages
Churn Prediction with Machine Learning
No ratings yet
Churn Prediction with Machine Learning
3 pages
Family Psychoeducation for Type 1 Diabetes
No ratings yet
Family Psychoeducation for Type 1 Diabetes
14 pages
Supervised Learning: Regression & Classification
No ratings yet
Supervised Learning: Regression & Classification
40 pages
IPL Match Outcome Prediction Using ML
No ratings yet
IPL Match Outcome Prediction Using ML
29 pages
Deep Learning Techniques in Genomics
No ratings yet
Deep Learning Techniques in Genomics
15 pages
Factors Influencing Financial Report Timeliness
No ratings yet
Factors Influencing Financial Report Timeliness
12 pages
Bioassay Data Analysis Guidelines
No ratings yet
Bioassay Data Analysis Guidelines
17 pages
Machine Learning Terms Glossary
No ratings yet
Machine Learning Terms Glossary
85 pages
Aggregator Model Article
No ratings yet
Aggregator Model Article
18 pages
Principles of Regression Analysis 2025
No ratings yet
Principles of Regression Analysis 2025
36 pages
Machine Learning in Insurance Claims Analysis
No ratings yet
Machine Learning in Insurance Claims Analysis
15 pages
Predictive Classification Analysis Template
No ratings yet
Predictive Classification Analysis Template
4 pages
Predictive Modeling in Construction Management
No ratings yet
Predictive Modeling in Construction Management
32 pages
(Ebook PDF) Using IBM? SPSS? Statistics For Research Methods and Social Science Statistics 7th Edition Updated 2025
No ratings yet
(Ebook PDF) Using IBM? SPSS? Statistics For Research Methods and Social Science Statistics 7th Edition Updated 2025
81 pages
CS3491 Unit 3: Supervised Learning Notes
No ratings yet
CS3491 Unit 3: Supervised Learning Notes
17 pages
Correlates of The Victim-Offender Relationship in Homicide
No ratings yet
Correlates of The Victim-Offender Relationship in Homicide
16 pages
Postdialysis Hypertension: Associated Factors, Patient Profiles, and Cardiovascular Mortality
No ratings yet
Postdialysis Hypertension: Associated Factors, Patient Profiles, and Cardiovascular Mortality
6 pages

Mlshit

Uploaded by

Mlshit

Uploaded by

Date:

1. import numpy as nmp

Linear regression is a statistical technique to describe relationships between dependent variables

LOGISTIC REGRESSION MODEL : 2.63%

Implementation Steps of Bagging

from numpy import mean

def reg_m(y, x):

Density Based Spatial Clustering of Applications with Noise(DBCSAN) is a clustering

You might also like