0% found this document useful (0 votes)

4 views10 pages

Unsupervised Learning and Clustering

The document discusses unsupervised learning, highlighting its difference from supervised learning, and emphasizes exploratory analysis to uncover patterns in data. It focuses on clustering techniques, particularly K-means clustering, detailing the process of assigning data points to clusters based on proximity to centroids. The document also includes code snippets for implementing K-means clustering using Python and visualizing the results.

Uploaded by

oulla898

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views10 pages

Unsupervised Learning and Clustering

Uploaded by

oulla898

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Unsupervised - Jupyter Notebook [Link]

ipynb

Unsupervised learning
• Supervised learning – Use the data to learn the output values

• Unsupervised learning – No output variables available

• Use the data to learn from the data – Sometimes called exploratory analysis – What to find in
the data?

• Structure

• Regularities • Hidden information • Etc.

Clustering
• Divide data into groups (subsets/clusters) that are

1 of 10 3/27/2023, 12:33 PM
Unsupervised - Jupyter Notebook [Link]

– Meaningful: Capture the natural structure of the data

– Useful: Depends on purpose

• Observations in the same cluster are similar in some sense

• Unsupervised classification

2 of 10 3/27/2023, 12:33 PM
Unsupervised - Jupyter Notebook [Link]

3 of 10 3/27/2023, 12:33 PM
Unsupervised - Jupyter Notebook [Link]

K-means clustering
Select K points as initial centroids
Repeat
– Form K clusters by assigning eachpoint to its closest centroid using Euclidean distance
– Recompute the centroids of each cluster ( mean of all objects in each cluster)
Until centroids do not change

4 of 10 3/27/2023, 12:33 PM
Unsupervised - Jupyter Notebook [Link]

5 of 10 3/27/2023, 12:33 PM
Unsupervised - Jupyter Notebook [Link]

6 of 10 3/27/2023, 12:33 PM
Unsupervised - Jupyter Notebook [Link]

7 of 10 3/27/2023, 12:33 PM
Unsupervised - Jupyter Notebook [Link]

In [10]: from [Link] import make_blobs

import numpy as np
import matplotlib as mpl
import [Link] as plt
from [Link] import KMeans

blob_centers = [Link](
[[ 0.2, 2.3],
[-1.5 , 2.3],
[-2.8, 1.8],
[-2.8, 2.8],
[-2.8, 1.3]])
blob_std = [Link]([0.4, 0.3, 0.1, 0.1, 0.1])
X, y = make_blobs(n_samples=2000, centers=blob_centers,
cluster_std=blob_std, random_state=7)
def plot_clusters(X, y=None):
[Link](X[:, 0], X[:, 1], c=y, s=1)
[Link]("$x_1$", fontsize=14)
[Link]("$x_2$", fontsize=14, rotation=0)
[Link](figsize=(8, 4))
plot_clusters(X)
plt show()

<Figure size 576x288 with 0 Axes>

8 of 10 3/27/2023, 12:33 PM
Unsupervised - Jupyter Notebook [Link]

In [31]: k = 5
kmeans = KMeans(n_clusters=k, random_state=42)
y_pred = kmeans.fit_predict(X)
print(kmeans.cluster_centers_)
X_new = [Link]([[-2.8, 1.7], [3, 2], [-3, 3], [-3, 2.5]])
kmeans predict(X_new)
[[ 0.06154126 2.58026834]
[-2.80389616 1.80117999]
[-2.79290307 2.79641063]
[-1.47083264 2.28276928]
[ 0.32780688 1.98072917]
[-2.80037642 1.30082566]]

Out[31]: array([1, 4, 2, 2])

In [18]: def plot_data(X):

[Link](X[:, 0], X[:, 1], 'k.', markersize=2)

def plot_centroids(centroids, weights=None, circle_color='w', cross_color='k'):

if weights is not None:
centroids = centroids[weights > [Link]() / 10]
[Link](centroids[:, 0], centroids[:, 1],
marker='o', s=30, linewidths=8,
color=circle_color, zorder=10, alpha=0.9)
[Link](centroids[:, 0], centroids[:, 1],
marker='x', s=50, linewidths=50,
color=cross_color, zorder=11, alpha=1)

def plot_decision_boundaries(clusterer, X, resolution=1000, show_centroids=True

show_xlabels=True, show_ylabels=True):
mins = [Link](axis=0) - 0.1
maxs = [Link](axis=0) + 0.1
xx, yy = [Link]([Link](mins[0], maxs[0], resolution),
[Link](mins[1], maxs[1], resolution))
Z = [Link](np.c_[[Link](), [Link]()])
Z = [Link]([Link])

[Link](Z, extent=(mins[0], maxs[0], mins[1], maxs[1]),

cmap="Pastel2")
[Link](Z, extent=(mins[0], maxs[0], mins[1], maxs[1]),
linewidths=1, colors='k')
plot_data(X)
if show_centroids:
plot_centroids(clusterer.cluster_centers_)

if show_xlabels:
[Link]("$x_1$", fontsize=14)
else:
plt.tick_params(labelbottom=False)
if show_ylabels:
[Link]("$x_2$", fontsize=14, rotation=0)
else:
plt tick_params(labelleft False)

9 of 10 3/27/2023, 12:33 PM
Unsupervised - Jupyter Notebook [Link]

In [32]: [Link](figsize=(8, 4))

plot_decision_boundaries(kmeans, X)
[Link]()
kmeans inertia_

Out[32]: 169.23715382893596

10 of 10 3/27/2023, 12:33 PM

Clustering in Python: Unsupervised Learning
No ratings yet
Clustering in Python: Unsupervised Learning
12 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
11 pages
Unsupervised Machine Learning Techniques
No ratings yet
Unsupervised Machine Learning Techniques
10 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
10 pages
K-Means Clustering in Machine Learning
No ratings yet
K-Means Clustering in Machine Learning
12 pages
Unsupervised Learning and Clustering Techniques
No ratings yet
Unsupervised Learning and Clustering Techniques
38 pages
ML Lab Manual
No ratings yet
ML Lab Manual
13 pages
K-Means Clustering in Data Science
No ratings yet
K-Means Clustering in Data Science
7 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
25 pages
K-Means Clustering in Python Guide
No ratings yet
K-Means Clustering in Python Guide
3 pages
K-Means Clustering Overview 2025
No ratings yet
K-Means Clustering Overview 2025
19 pages
K-Means Clustering in Python Guide
No ratings yet
K-Means Clustering in Python Guide
2 pages
Ui22cs49 ML 05
No ratings yet
Ui22cs49 ML 05
7 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
34 pages
Understanding Clustering Techniques in ML
No ratings yet
Understanding Clustering Techniques in ML
35 pages
Clustering Algorithms and Distance Measures
No ratings yet
Clustering Algorithms and Distance Measures
21 pages
Clustering Algorithms in Machine Learning
No ratings yet
Clustering Algorithms in Machine Learning
9 pages
Unsupervised Learning: Clustering Basics
No ratings yet
Unsupervised Learning: Clustering Basics
84 pages
Supervised & Unsupervised Learning Guide
No ratings yet
Supervised & Unsupervised Learning Guide
40 pages
Unsupervised Learning: K-Means Clustering
No ratings yet
Unsupervised Learning: K-Means Clustering
21 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
32 pages
Week 11 - Clustering I
No ratings yet
Week 11 - Clustering I
43 pages
K-Means Clustering in Python Guide
No ratings yet
K-Means Clustering in Python Guide
3 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
38 pages
K-Means Clustering with Scikit-Learn
No ratings yet
K-Means Clustering with Scikit-Learn
6 pages
KMeans and Hierarchical Clustering Code
No ratings yet
KMeans and Hierarchical Clustering Code
4 pages
Clustering in Unsupervised Learning
No ratings yet
Clustering in Unsupervised Learning
15 pages
K-Means Algorithm
No ratings yet
K-Means Algorithm
29 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
6 pages
Bayesian Classification in Python
No ratings yet
Bayesian Classification in Python
27 pages
Unsupervised Learning: Clustering & KMeans
No ratings yet
Unsupervised Learning: Clustering & KMeans
50 pages
K-Means Clustering Implementation in Python
No ratings yet
K-Means Clustering Implementation in Python
4 pages
Practical 4 (Dmbi)
No ratings yet
Practical 4 (Dmbi)
4 pages
ST1511 AIML Lab 6
No ratings yet
ST1511 AIML Lab 6
16 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
25 pages
Understanding Inertia in K-Means
No ratings yet
Understanding Inertia in K-Means
5 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
9 pages
Unsupervised Learning: Clustering & PCA
No ratings yet
Unsupervised Learning: Clustering & PCA
9 pages
K-Means Clustering on Iris Dataset
No ratings yet
K-Means Clustering on Iris Dataset
7 pages
Unsupervised Learning: K-Means Clustering
No ratings yet
Unsupervised Learning: K-Means Clustering
96 pages
Unit4-Clustering and KNN
No ratings yet
Unit4-Clustering and KNN
12 pages
Program 3
No ratings yet
Program 3
11 pages
K-Means Clustering Algorithm Overview
No ratings yet
K-Means Clustering Algorithm Overview
47 pages
Unsupervised Learning and Clustering in Machine Learning
No ratings yet
Unsupervised Learning and Clustering in Machine Learning
49 pages
Unsupervised Learning Professional Guide
No ratings yet
Unsupervised Learning Professional Guide
32 pages
Unsupervised Learning Algorithms Overview
No ratings yet
Unsupervised Learning Algorithms Overview
88 pages
K-Means Algorithm For Clustering - Jupyter Notebook
No ratings yet
K-Means Algorithm For Clustering - Jupyter Notebook
1 page
Understanding K-Means Clustering Algorithm
No ratings yet
Understanding K-Means Clustering Algorithm
14 pages
K-Means Clustering Lab Report
No ratings yet
K-Means Clustering Lab Report
8 pages
K-Means and Spectral Clustering Analysis
No ratings yet
K-Means and Spectral Clustering Analysis
13 pages
Understanding K-Means Clustering
No ratings yet
Understanding K-Means Clustering
5 pages
Cluster Visualization Lab Manual
No ratings yet
Cluster Visualization Lab Manual
1 page
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
14 pages
Supervised and Unsupervised Learning Overview
No ratings yet
Supervised and Unsupervised Learning Overview
62 pages
Week4 K-Means, PCA
No ratings yet
Week4 K-Means, PCA
78 pages
K-Means Clustering in Python Guide
No ratings yet
K-Means Clustering in Python Guide
7 pages
Python K-means Clustering Guide
No ratings yet
Python K-means Clustering Guide
8 pages
Complete Test Bank of Principles of Communications 7th Edition Ziemer
100% (3)
Complete Test Bank of Principles of Communications 7th Edition Ziemer
219 pages
Understanding Decision Tree Learning
No ratings yet
Understanding Decision Tree Learning
41 pages
Understanding Deadlocks in Operating Systems
No ratings yet
Understanding Deadlocks in Operating Systems
42 pages
HMM Exercises for POS Tagging
No ratings yet
HMM Exercises for POS Tagging
2 pages
Heuristic Search Techniques in AI
No ratings yet
Heuristic Search Techniques in AI
15 pages
Numerical Methods Question Bank Model Paper
No ratings yet
Numerical Methods Question Bank Model Paper
7 pages
Asymptotic Analysis of Algorithms
No ratings yet
Asymptotic Analysis of Algorithms
4 pages
Optimizing CNN Hyperparameters Guide
No ratings yet
Optimizing CNN Hyperparameters Guide
13 pages
Understanding Bloom Filters in Data Structures
No ratings yet
Understanding Bloom Filters in Data Structures
2 pages
Subarray Sum Techniques Explained
No ratings yet
Subarray Sum Techniques Explained
23 pages
Lesson Plan: Classifying Polynomials
100% (1)
Lesson Plan: Classifying Polynomials
4 pages
Trapezoidal and Simpson's Rule Exercises
No ratings yet
Trapezoidal and Simpson's Rule Exercises
2 pages
Understanding LSTM Architecture and Applications
No ratings yet
Understanding LSTM Architecture and Applications
6 pages
NSL-KDD Dataset for IDS Evaluation
No ratings yet
NSL-KDD Dataset for IDS Evaluation
7 pages
Max-Flow Min-Cut for Image Restoration
No ratings yet
Max-Flow Min-Cut for Image Restoration
4 pages
DIT vs DIF FFT Algorithms Explained
No ratings yet
DIT vs DIF FFT Algorithms Explained
68 pages
Branch and Bound for Assignment Problem
No ratings yet
Branch and Bound for Assignment Problem
14 pages
NOI 2017 Week 9 Training Guide
No ratings yet
NOI 2017 Week 9 Training Guide
6 pages
Cramer's Rule 2x2 Practice Problems
No ratings yet
Cramer's Rule 2x2 Practice Problems
5 pages
GMM Smote
No ratings yet
GMM Smote
6 pages
DECONZ: Zero-Phase Deconvolution: Topics
No ratings yet
DECONZ: Zero-Phase Deconvolution: Topics
20 pages
Understanding Additive White Gaussian Noise
No ratings yet
Understanding Additive White Gaussian Noise
2 pages
Linear Programming Concepts and Examples
No ratings yet
Linear Programming Concepts and Examples
11 pages
Particle Filters for Tracking Applications
100% (1)
Particle Filters for Tracking Applications
47 pages
Neural Network for IMDB Review Classification
No ratings yet
Neural Network for IMDB Review Classification
5 pages
Asymptotic Analysis and Recursion in Algorithms
No ratings yet
Asymptotic Analysis and Recursion in Algorithms
17 pages
Heapsort and Quicksort Overview
No ratings yet
Heapsort and Quicksort Overview
29 pages
Efficient Pareto Method for Job-Shop Scheduling
No ratings yet
Efficient Pareto Method for Job-Shop Scheduling
14 pages
Digital Signal Processing: Key Concepts
No ratings yet
Digital Signal Processing: Key Concepts
2 pages
AI Search Algorithms Explained
No ratings yet
AI Search Algorithms Explained
16 pages