0% found this document useful (0 votes)

6 views8 pages

Clustering Techniques in Data Mining Lab

The document outlines a lab experiment for TE CSE students at Finolex Academy, focusing on clustering using open-source tools like Weka. It details lab objectives, outcomes, practical applications, and provides examples of K-means clustering in both one-dimensional and two-dimensional cases. The conclusion emphasizes the relevance of clustering algorithms in industry and engineering, along with the skills developed through the experiment.

Uploaded by

sakwarefaisal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views8 pages

Clustering Techniques in Data Mining Lab

Uploaded by

sakwarefaisal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Hope Foundation’s

Finolex Academy of Management and Technology, Ratnagiri

Department of Computer Science and Engineering (AIML)

Subject name: Data Warehousing and Mining Lab Subject Code: CSL503

Class TE CSE Semester –VI (CBCGS) Academic year: 2024-25

Name of Student Zain Munawar Solkar QUIZ Score :

Roll No 75 Experiment No. 04
Title: Using open source tools perform Clustering.

1. Lab objectives applicable:

LOB4: To make students well versed in all data mining algorithms, methods, and tools.
2. Lab outcomes applicable:
LO3: Demonstrate an understanding of the importance of data mining.
LO6: Implement the appropriate data mining methods like classification, clustering or Frequent Pattern mining on large
data sets.
3. Learning Objectives:
1. To determine similarity and dissimilarity among elements and create clusters accordingly.
4. Practical applications of the assignment/experiment:
Clustering algorithms group similar data points together to uncover patterns and relationships, enhancing data
analysis and decision-making.
5. Prerequisites:
NA
6. Minimum Hardware Requirements:
1. I series processor, RAM 4GB,
7. Software Requirements:
1. Weka 3.8
8. Quiz Questions
[Link]
wform?usp=sf_link
9. Experiment/Assignment Evaluation:
Sr. No. Parameters Marks obtained Out of

1 Technical Understanding (Assessment may be done based on Q & A or any 6

other relevant method.) Teacher should mention the other method used -
2 Lab Performance 2
3 Punctuality 2
Date of performance (DOP) Total marks obtained 10

Signature of Faculty

Department of Computer Science and Engineering

10. Theory:

Solve example which is fed as input to Weka software. K-means one dimensional problem and 2-dimensional problem

Q.1) Implement k means clustering to form 2 clusters.

{13, 16, 29 ,78, 21, 43, 56, 90, 21, 8, 88, 60, 34}

Solution: -
Step 1 –
K=2
Let the two clusters be K1 and K2 with means M1 and M2 respectively
M1=29, M2=13
Step 2 –
Cluster K1: {29, 78, 21, 43, 56, 90, 21, 88, 60, 34}
Cluster K2: {13, 16, 8}
New M1 = (29 + 78 + 21 + 43 + 56 + 90 + 21 + 88 + 60 + 34) / 10 = 520 / 10 = 52.0
New M2 = (13 + 16 + 8) / 3 = 37 / 3 ≈ 12.33

Cluster K1: {29, 78, 43, 56, 90, 88, 60, 34}
Cluster K2: {13, 16, 21, 21, 8}
New M1 = (29 + 78 + 43 + 56 + 90 + 88 + 60 + 34) / 8 = 478 / 8 = 59.75
New M2 = (13 + 16 + 21 + 21 + 8) / 5 = 79 / 5 = 15.8

Cluster K1: {78, 43, 56, 90, 88, 60}

Cluster K2: {13, 16, 29, 21, 21, 8, 34}
New M1 = (78 + 43 + 56 + 90 + 88 + 60) / 6 = 415 / 6 ≈ 69.17
New M2 = (13 + 16 + 29 + 21 + 21 + 8 + 34) / 7 = 142 / 7 ≈ 20.29

Cluster K1: {78, 56, 90, 88, 60}

Cluster K2: {13, 16, 29, 21, 21, 8, 34, 43}
New M1 = (78 + 56 + 90 + 88 + 60) / 5 = 372 / 5 = 74.4
New M2 = (13 + 16 + 29 + 21 + 21 + 8 + 34 + 43) / 8 = 185 / 8 = 23.13

Cluster K1: {78, 56, 90, 88, 60}

Cluster K2: {13, 16, 29, 21, 21, 8, 34, 43}

No changes in the Clusters.

Step 3 –
Final Clusters are; -
K1 (Mean ≈ 74.4): {78, 56, 90, 88, 60}
K2 (Mean ≈ 23.13): {13, 16, 29, 21, 21, 8, 34, 43}

Q.2) Apply k means clustering to form 2 clusters.

Object Attribute1 (X) Attribute 2 (Y)

Weight index PH
MedicineA 1 1
MedicineB 2 1
MedicineC 4 3
MedicineD 5 4

Solution: -
Step 1 –
K=2
Let the two clusters be K1 and K2 with means M1 and M2 respectively
M1=MedicineC (4,3), M2=MedicineA (1,1)

Department of Computer Science and Engineering

Step 2 –

Object Coordinates Distance to M1 (4,3) Distance to M2 (1,1) Assigned

Cluster
MedicineA (1,1) √((4 − 1)² + (3 − 1)²) = √13 ≈ √((1 − 1)² + (1 − 1)²) = 0.00 K2
3.61
MedicineB (2,1) √((4 − 2)² + (3 − 1)²) = √8 ≈ √((1 − 2)² + (1 − 1)²) = √1 = K2
2.83 1.00
MedicineC (4,3) √((4 − 4)² + (3 − 3)²) = 0.00 √((1 − 4)² + (1 − 3)²) = √13 ≈ K1
3.61
MedicineD (5,4) √((4 − 5)² + (3 − 4)²) = √2 ≈ √((1 − 5)² + (1 − 4)²) = √25 = K1
1.41 5.00

K1: {MedicineC (4, 3), MedicineD (5, 4)}

K2: {MedicineA (1, 1), MedicineB (2, 1)}
Updated Means:
M1 = (4.5, 3.5)
M2 = (1.5, 1)

Object Coordinates Distance to M1 (4.5,3.5) Distance to M2 (1.5,1) Assigned

Cluster
MedicineA (1,1) √((4.5 − 1)² + (3.5 − 1)²) = √((1.5 − 1)² + (1 − 1)²) = K2
√18.5 ≈ 4.30 √0.25 = 0.50
MedicineB (2,1) √((4.5 − 2)² + (3.5 − 1)²) = √((1.5 − 2)² + (1 − 1)²) = K2
√12.5 ≈ 3.54 √0.25 = 0.50
MedicineC (4,3) √((4.5 − 4)² + (3.5 − 3)²) = √((1.5 − 4)² + (1 − 3)²) = K1
√0.5 ≈ 0.71 √10.25 ≈ 3.20
MedicineD (5,4) √((4.5 − 5)² + (3.5 − 4)²) = √((1.5 − 5)² + (1 − 4)²) = K1
√0.5 ≈ 0.71 √21.25 ≈ 4.61

K1: {MedicineC (4, 3), MedicineD (5, 4)}

K2: {MedicineA (1, 1), MedicineB (2, 1)}
Updated Means:
M1 = (4.5, 3.5)
M2 = (1.5, 1)

No changes in the Clusters

Step 3 –
Final Clusters are; -
K1: {MedicineC (4, 3), MedicineD (5, 4)}
K2: {MedicineA (1, 1), MedicineB (2, 1)}

Department of Computer Science and Engineering

11. Outcome –

K-means 1D -
Source code:

Department of Computer Science and Engineering

Output:

Department of Computer Science and Engineering

K-means 2D -
Source code:

Department of Computer Science and Engineering

Output:

Department of Computer Science and Engineering

12. Learning Outcomes Achieved

1. Students are able to cluster the given data in k- some known number of clusters.

13. Conclusion:

1. Applications of the Studied Technique in Industry

Clustering algorithms, such as K-means or hierarchical clustering, are widely used in industry for customer
segmentation, market analysis, and anomaly detection. These techniques help businesses tailor marketing strategies,
optimize resource allocation, and identify unusual patterns or trends in large datasets

2. Engineering Relevance

Clustering algorithms are crucial in engineering for solving complex problems related to pattern recognition, image
processing, and system optimization. They enable engineers to group similar data points, improve model accuracy, and
make informed decisions based on data-driven insights.

3. Skills Developed

The experiment with clustering algorithms enhances skills in data preprocessing, algorithm implementation, and result
interpretation. It also develops expertise in applying statistical techniques to solve real-world problems, as well as
proficiency in using data mining tools and software for effective data analysis.

14. References:

[1] https:// Paulraj Ponniah, “Data Warehousing: Fundamentals for IT Professional” , Wiley Publications
[2] Han, Kamber, "Data Mining Concepts and Techniques", Morgan Kaufmann 3nd Edition.
[3] Margaret H. Dunham, “Data Mining: Introductory and Advanced Topics”, Person Education.
[4] Raghu Ramakrishnan and Johannes Gehrke, “Database Management Systems”, 3rd Edition McGraw Hill.
[5] Elmasari and Navathe, “Fundamentals of Database Systems”, Pearson Education.

Department of Computer Science and Engineering

K Mean Clustering
No ratings yet
K Mean Clustering
31 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
31 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
48 pages
K-Means Clustering Example Explained
No ratings yet
K-Means Clustering Example Explained
3 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
12 pages
Clustering Methods and Applications
No ratings yet
Clustering Methods and Applications
153 pages
K-Means Clustering Overview and Applications
No ratings yet
K-Means Clustering Overview and Applications
25 pages
K-means Clustering Explained
No ratings yet
K-means Clustering Explained
3 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
32 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
20 pages
Lecture 9 Unsupervised Learning K Means, Association Analysis and
No ratings yet
Lecture 9 Unsupervised Learning K Means, Association Analysis and
76 pages
Understanding Cluster Analysis Techniques
No ratings yet
Understanding Cluster Analysis Techniques
91 pages
K-Means Clustering Explained with Example
No ratings yet
K-Means Clustering Explained with Example
14 pages
K Means Clustering
No ratings yet
K Means Clustering
7 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
32 pages
K-means Clustering Lab Manual
No ratings yet
K-means Clustering Lab Manual
11 pages
Unit 5 DWDM
No ratings yet
Unit 5 DWDM
44 pages
K-Means Clustering in Machine Learning
No ratings yet
K-Means Clustering in Machine Learning
11 pages
Unsupervised Learning: Clustering Methods
No ratings yet
Unsupervised Learning: Clustering Methods
51 pages
K-means Clustering Implementation Guide
No ratings yet
K-means Clustering Implementation Guide
27 pages
K-Means Clustering Algorithm - L4 (A)
No ratings yet
K-Means Clustering Algorithm - L4 (A)
9 pages
Understanding K-means Clustering
No ratings yet
Understanding K-means Clustering
29 pages
K-Means Clustering and Elbow Method
No ratings yet
K-Means Clustering and Elbow Method
22 pages
Clustering Techniques in Machine Learning
No ratings yet
Clustering Techniques in Machine Learning
33 pages
K-Means Clustering Experiment Guide
No ratings yet
K-Means Clustering Experiment Guide
7 pages
K Means Clustering
No ratings yet
K Means Clustering
12 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
8 pages
K-Means Clustering Assignment Guide
No ratings yet
K-Means Clustering Assignment Guide
8 pages
Understanding Clustering Techniques
No ratings yet
Understanding Clustering Techniques
23 pages
Unsupervised Learning: K-means & Apriori
No ratings yet
Unsupervised Learning: K-means & Apriori
73 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
9 pages
K-Means Clustering in Unsupervised Learning
No ratings yet
K-Means Clustering in Unsupervised Learning
17 pages
K-Means Clustering Overview and Applications
No ratings yet
K-Means Clustering Overview and Applications
8 pages
K-Means and K-Medoids Clustering Guide
No ratings yet
K-Means and K-Medoids Clustering Guide
24 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
45 pages
K-Means Clustering Overview
No ratings yet
K-Means Clustering Overview
19 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
36 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
76 pages
Understanding Clustering Techniques
No ratings yet
Understanding Clustering Techniques
35 pages
Types and Applications of Clustering
No ratings yet
Types and Applications of Clustering
84 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
84 pages
K-means Clustering and MapReduce Guide
No ratings yet
K-means Clustering and MapReduce Guide
26 pages
Unit 4
No ratings yet
Unit 4
124 pages
MSDA 3050 Lecture8 S24
No ratings yet
MSDA 3050 Lecture8 S24
27 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
21 pages
Unsupervised Learning with K-Means
No ratings yet
Unsupervised Learning with K-Means
27 pages
Understanding Clustering Techniques in ML
No ratings yet
Understanding Clustering Techniques in ML
35 pages
Cluster Analysis Methods Explained
No ratings yet
Cluster Analysis Methods Explained
45 pages
K-means Clustering Numerical Example
No ratings yet
K-means Clustering Numerical Example
7 pages
Clustering Techniques in Data Mining
No ratings yet
Clustering Techniques in Data Mining
33 pages
K-means Clustering Experiment Guide
No ratings yet
K-means Clustering Experiment Guide
6 pages
Module 3 - Machine Learning Notes For 3rd Year
No ratings yet
Module 3 - Machine Learning Notes For 3rd Year
187 pages
ML Notes Imp
No ratings yet
ML Notes Imp
74 pages
Clustering and Ensemble Methods Overview
No ratings yet
Clustering and Ensemble Methods Overview
28 pages
Cluster Analysis and K-means Overview
No ratings yet
Cluster Analysis and K-means Overview
48 pages
Fundamentals of Machine Learning Overview
No ratings yet
Fundamentals of Machine Learning Overview
64 pages
Understanding Clustering Techniques
No ratings yet
Understanding Clustering Techniques
28 pages
Procedure Ex03cc
No ratings yet
Procedure Ex03cc
8 pages
Exp 1 Cover
No ratings yet
Exp 1 Cover
1 page
Dyna86 Kit: 16-Bit Addition Lab Guide
No ratings yet
Dyna86 Kit: 16-Bit Addition Lab Guide
1 page
8086 String Instructions Lab Report
No ratings yet
8086 String Instructions Lab Report
1 page
To Study and Implement Platform As A Service Using AWS Elastic Beanstalk
No ratings yet
To Study and Implement Platform As A Service Using AWS Elastic Beanstalk
14 pages
8086 Assembly Program for Even Count
No ratings yet
8086 Assembly Program for Even Count
1 page
8086 Assembly Language Lab Programs
No ratings yet
8086 Assembly Language Lab Programs
1 page
Mumbai University B.E. Sem I Results 2025
No ratings yet
Mumbai University B.E. Sem I Results 2025
135 pages
Principal's Cabin Relocation Notice
No ratings yet
Principal's Cabin Relocation Notice
1 page
Implementing Classifiers in Data Mining
No ratings yet
Implementing Classifiers in Data Mining
7 pages
K-means Clustering Overview and Techniques
No ratings yet
K-means Clustering Overview and Techniques
3 pages
Musculoskeletal Imaging Handbook A Guide To Primary Practitioners 1st Edition Lynn N. Mckinnis Available Full Chapters
100% (1)
Musculoskeletal Imaging Handbook A Guide To Primary Practitioners 1st Edition Lynn N. Mckinnis Available Full Chapters
79 pages
Algorithm Problem Solving Guide
No ratings yet
Algorithm Problem Solving Guide
11 pages
Excel Data Analysis Lab Manual
No ratings yet
Excel Data Analysis Lab Manual
23 pages
Using Consent API with Data Cloud
No ratings yet
Using Consent API with Data Cloud
8 pages
Eil Telecom-Cctv
No ratings yet
Eil Telecom-Cctv
22 pages
Types of Network Communication Explained
No ratings yet
Types of Network Communication Explained
20 pages
High School Health Electives Guide
No ratings yet
High School Health Electives Guide
55 pages
Online Cake Order System Overview
No ratings yet
Online Cake Order System Overview
41 pages
Method Overriding and Data Hiding in Python
No ratings yet
Method Overriding and Data Hiding in Python
4 pages
IBM Cognos for Predictive Insights
No ratings yet
IBM Cognos for Predictive Insights
14 pages
SUMAIT University Academic Job Openings
No ratings yet
SUMAIT University Academic Job Openings
7 pages
Planning of Electric Vehicle Charging Infrastructure: Dharmakeerthi C.H., Mithulananthan N. and Saha T.K
No ratings yet
Planning of Electric Vehicle Charging Infrastructure: Dharmakeerthi C.H., Mithulananthan N. and Saha T.K
5 pages
Hadoop Single Node Setup Guide
No ratings yet
Hadoop Single Node Setup Guide
61 pages
Updating Firmware for MAG-200/250
No ratings yet
Updating Firmware for MAG-200/250
8 pages
Computer Organization Lab Manual KCS-352
No ratings yet
Computer Organization Lab Manual KCS-352
55 pages
Understanding Unit Testing Concepts
No ratings yet
Understanding Unit Testing Concepts
57 pages
Understanding Redux and useContext
No ratings yet
Understanding Redux and useContext
9 pages
Build Android Apps Without Coding PDF
100% (1)
Build Android Apps Without Coding PDF
54 pages
Software Maintenance Overview and Models
No ratings yet
Software Maintenance Overview and Models
23 pages
Module 3
No ratings yet
Module 3
4 pages
Complete HTML Cheat Sheet 2024
100% (1)
Complete HTML Cheat Sheet 2024
16 pages
Old Glory Flag Fundraiser Order Form
No ratings yet
Old Glory Flag Fundraiser Order Form
1 page
BytePlus MediaLive Oct2023 Formatted
No ratings yet
BytePlus MediaLive Oct2023 Formatted
34 pages
Graphing Linear Inequalities & Programming
No ratings yet
Graphing Linear Inequalities & Programming
59 pages
Year 9 3D Animation Learner Booklet
No ratings yet
Year 9 3D Animation Learner Booklet
15 pages
Facade Lighting Tender for Mettupalayam
No ratings yet
Facade Lighting Tender for Mettupalayam
10 pages
A Finite Element Approach For Locating The Center of Resistance of Maxillary Teeth
No ratings yet
A Finite Element Approach For Locating The Center of Resistance of Maxillary Teeth
12 pages
Application Layer Protocols Overview
No ratings yet
Application Layer Protocols Overview
23 pages
Vidyanjali Volunteer Registration Guide
No ratings yet
Vidyanjali Volunteer Registration Guide
27 pages
Health Tracker System Project Report
No ratings yet
Health Tracker System Project Report
28 pages

Clustering Techniques in Data Mining Lab

Uploaded by

Clustering Techniques in Data Mining Lab

Uploaded by

Hope Foundation’s

Finolex Academy of Management and Technology, Ratnagiri

Department of Computer Science and Engineering (AIML)

Class TE CSE Semester –VI (CBCGS) Academic year: 2024-25

Name of Student Zain Munawar Solkar QUIZ Score :

1. Lab objectives applicable:

1 Technical Understanding (Assessment may be done based on Q & A or any 6

Department of Computer Science and Engineering

Q.1) Implement k means clustering to form 2 clusters.

Cluster K1: {78, 43, 56, 90, 88, 60}

Cluster K1: {78, 56, 90, 88, 60}

Cluster K1: {78, 56, 90, 88, 60}

No changes in the Clusters.

Q.2) Apply k means clustering to form 2 clusters.

Object Attribute1 (X) Attribute 2 (Y)

Department of Computer Science and Engineering

Object Coordinates Distance to M1 (4,3) Distance to M2 (1,1) Assigned

K1: {MedicineC (4, 3), MedicineD (5, 4)}

Object Coordinates Distance to M1 (4.5,3.5) Distance to M2 (1.5,1) Assigned

K1: {MedicineC (4, 3), MedicineD (5, 4)}

No changes in the Clusters

Department of Computer Science and Engineering

Department of Computer Science and Engineering

Department of Computer Science and Engineering

Department of Computer Science and Engineering

Department of Computer Science and Engineering

1. Applications of the Studied Technique in Industry

Department of Computer Science and Engineering

You might also like