Principal Component Analysis Overview

Dimensionality reduction techniques like principal component analysis (PCA) aim to reduce the number of variables in a dataset while preserving as much information as possible. PCA transforms the data to a new coordinate system such that the greatest variance by any projection of the data comes to lie on the first coordinate (called the first principal component), the second greatest variance on the second coordinate, and so on. It is commonly used to speed up analysis and visualize high-dimensional data.

Uploaded by

Atul Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

19 views19 pages

Principal Component Analysis Overview

Uploaded by

Atul Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

Dimensionality Reduction

Principal Component Analysis

Dimensionality Reduction
• The complexity of any classifier or regressor depends on the number of
inputs.
• This determines both the time and space complexity and the necessary
number of training examples to train such a classifier or regressor.
• In many learning problems, the datasets have large number of variables.
Sometimes, the number of variables is more than the number of
observations.
• For example, such situations have arisen in many scientific fields such as
image processing
• Statistical and machine learning methods have some difficulty when
dealing with such high-dimensional data.
• Normally the number of input variables is reduced before the machine
learning algorithms can be successfully applied.
Dimensionality Reduction

• In statistical and machine learning, dimensionality reduction

or dimension reduction is the process of reducing the number
of variables under consideration by obtaining a smaller set of
principal variables.
Dimensionality Reduction
• Dimensionality reduction may be implemented in two ways.
• Feature selection:
– In feature selection, we are interested in finding k of the total of n
features that give us the most information and we discard the other
(n−k) dimensions.
• Feature extraction
– In feature extraction, we are interested in finding a new set of k features
that are the combination of the original n features.
– These methods may be supervised or unsupervised depending on
whether or not they use the output information.
– The best known and most widely used feature extraction methods are
Principal Components Analysis (PCA) and Linear Discriminant Analysis
(LDA), which are both linear projection methods, unsupervised and
supervised respectively.
Measures of error
• In regression problems, we may use the Mean Squared Error (MSE) or the
Root Mean Squared Error (RMSE) as the measure of error.
• MSE is the sum, over all the data points, of the square of the difference
between the predicted and actual target variables, divided by the number
of data points.
• In classification problems, we may use the misclassification rate as a
measure of the error. This is defined as follows:
• misclassification rate = no. of misclassified examples/ total no. of examples
Why dimensionality reduction is useful?
• In most learning algorithms, the complexity depends on the number of
input dimensions, d, as well as on the size of the data sample, N, and for
reduced memory and computation, we are interested in reducing the
dimensionality of the problem.
• Decreasing d also decreases the complexity of the inference algorithm
during testing.
• When an input is decided to be unnecessary, we save the cost of
extracting it.
• Simpler models are more robust on small datasets. Simpler models have
less variance, that is, they vary less depending on the particulars of a
sample, including noise, outliers.
• When data can be explained with fewer features, we get a better idea
about the process that underlies the data, which allows knowledge
extraction.
• When data can be represented in a few dimensions without loss of
information, it can be plotted and analyzed visually for structure and
outliers.
• Curse of dimensionality refers to an exponential increase in the size of
data caused by a large number of dimensions. As the number of
dimensions of a data increases, it becomes more and more difficult to
process it. Dimension Reduction is a solution to the curse of
dimensionality
Subset selection
• In machine learning subset selection, sometimes also called feature
selection, or variable selection, or attribute selection, is the process of
selecting a subset of relevant features (variables, predictors) for use in
model construction.
• Feature selection techniques are used for four reasons:
– simplification of models to make them easier to interpret by
researchers/users
– shorter training times,
– to avoid the curse of dimensionality
– enhanced generalization by reducing over fitting
Principal component analysis
• Principal Component Analysis, or PCA, is a dimensionality-reduction
method that is often used to reduce the dimensionality of large data sets,
by transforming a large set of variables into a smaller one that still
contains most of the information in the large set.
• Because smaller data sets are easier to explore and visualize and make
analyzing data much easier and faster for machine learning algorithms
without extraneous variables to process.
• So to sum up, the idea of PCA is simple — reduce the number of variables
of a data set, while preserving as much information as possible.
STEP BY STEP EXPLANATION OF PCA
• STEP 1: STANDARDIZATION
• The aim of this step is to standardize the range of the continuous initial
variables so that each one of them contributes equally to the analysis.
• if there are large differences between the ranges of initial variables, those
variables with larger ranges will dominate over those with small ranges
(For example, a variable that ranges between 0 and 100 will dominate
over a variable that ranges between 0 and 1), which will lead to biased
results. So, transforming the data to comparable scales can prevent this
problem.
• Once the standardization is done, all the variables will be transformed to
the same scale.
• STEP 2: COVARIANCE MATRIX COMPUTATION
• The aim of this step is to understand how the variables of the input data
set are varying from the mean with respect to each other, or in other
words, to see if there is any relationship between them.
• Because sometimes, variables are highly correlated in such a way that they
contain redundant information.
• So, in order to identify these correlations, we compute the covariance
matrix.
• What do the covariances that we have as entries of the matrix tell us
about the correlations between the variables?
• It’s actually the sign of the covariance that matters :
• if positive then : the two variables increase or decrease together
(correlated)
• if negative then : One increases when the other decreases (Inversely
correlated)
• STEP 3: COMPUTE THE EIGENVECTORS AND EIGENVALUES OF THE
COVARIANCE MATRIX TO IDENTIFY THE PRINCIPAL COMPONENTS
• Eigenvectors and eigenvalues are the linear algebra concepts that we need
to compute from the covariance matrix in order to determine
the principal components of the data.
• Principal components are new variables that are constructed as linear
combinations or mixtures of the initial variables.
• These combinations are done in such a way that the new variables (i.e.,
principal components) are uncorrelated and most of the information
within the initial variables is squeezed or compressed into the first
components.
• So, the idea is 10-dimensional data gives you 10 principal components, but
PCA tries to put maximum possible information in the first component,
then maximum remaining information in the second and so on, until
having something like shown in the screen plot below.
• .
• Organizing information in principal components this way, will allow you to
reduce dimensionality without losing much information, and this by
discarding the components with low information and considering the
remaining components as your new variables.
• Geometrically speaking, principal components represent the directions of
the data that explain a maximal amount of variance, that is to say, the
lines that capture most information of the data.
• The relationship between variance and information here, is that, the larger
the variance carried by a line, the larger the dispersion of the data points
along it, and the larger the dispersion along a line, the more the
information it has.
HOW PCA CONSTRUCTS THE PRINCIPAL COMPONENTS
• As there are as many principal components as there are variables in the
data, principal components are constructed in such a manner that the first
principal component accounts for the largest possible variance in the data
set.
• let's learn how does PCA achieves the above-
mentioned purpose through an animation.
• Each blue dot on the plot represents a point from data given by its x & y
coordinate.
• A line P (red line) is drawn from the center of the dataset i.e. from the
mean of x & y.
• Every point on the graph is projected on this line shown by two sets of
points red & green.
• The spread or variance of data along line p is given by the distance
between the two big red points.
• As the line p rotates the distance between the two red points
changes according to the angle created by line p with the x-
axis.
• The purple lines which join a point and its projection
represent the error which arises when we approximate a
point by its projection.
• PCA creates new variables from old ones
• If the new variables closely approximate the old variables,
then approximation error should be small.
• The squared sum of the lengths of all purple lines gives the
total error in approximation.
• The angle which minimizes the squared sum of errors also
maximizes the distance between the red points.
• The direction of maximum spread is called the principal axis.
Once we know a principal axis, we subtract the variance along
this principal axis to obtain the remaining variance.
• We apply the same procedure to find the next principal axis
from the residual variance. Apart from being the direction of
maximum variance, next principal axis must be orthogonal to
the other principal axes.
Once, we get all the principal axes, the dataset is projected
onto these axes. The columns in the projected or transformed
dataset are called principal components.

Feature Selection vs. Dimensionality Reduction
No ratings yet
Feature Selection vs. Dimensionality Reduction
18 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
34 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
8 pages
Dimensionality Reduction Techniques
No ratings yet
Dimensionality Reduction Techniques
59 pages
Dimensionality Reduction and Principal Component Analysis
No ratings yet
Dimensionality Reduction and Principal Component Analysis
4 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
41 pages
Understanding PCA in AI-ML
No ratings yet
Understanding PCA in AI-ML
20 pages
PCA for Dimensionality Reduction
No ratings yet
PCA for Dimensionality Reduction
27 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
14 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
31 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
3 pages
Dimensionality Reduction in Machine Learning
No ratings yet
Dimensionality Reduction in Machine Learning
15 pages
Dimensionality Reduction in Machine Learning
No ratings yet
Dimensionality Reduction in Machine Learning
27 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
32 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
102 pages
Dimensionality Reduction in Machine Learning
No ratings yet
Dimensionality Reduction in Machine Learning
30 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
16 pages
Data Reduction and Visualization Techniques
No ratings yet
Data Reduction and Visualization Techniques
22 pages
Understanding Dimensionality Reduction Techniques
No ratings yet
Understanding Dimensionality Reduction Techniques
123 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
36 pages
Dimensionality Reduction in Machine Learning
No ratings yet
Dimensionality Reduction in Machine Learning
27 pages
Dimensionality Reduction with PCA
No ratings yet
Dimensionality Reduction with PCA
28 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
27 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
27 pages
Understanding Principal Component Analysis
100% (1)
Understanding Principal Component Analysis
18 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
33 pages
Dimension Reduction
No ratings yet
Dimension Reduction
4 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
80 pages
Machine Learning Subset Selection Techniques
No ratings yet
Machine Learning Subset Selection Techniques
4 pages
PR Dimenstionality Reduction
No ratings yet
PR Dimenstionality Reduction
101 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
11 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
25 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
4 pages
PCA in Remote Sensing Explained
No ratings yet
PCA in Remote Sensing Explained
10 pages
PCA in Data Analytics Explained
No ratings yet
PCA in Data Analytics Explained
9 pages
Dimensionality Reduction with PCA in Python
No ratings yet
Dimensionality Reduction with PCA in Python
11 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
13 pages
PCA Applications in Finance Explained
No ratings yet
PCA Applications in Finance Explained
38 pages
PCA Implementation in Python
No ratings yet
PCA Implementation in Python
18 pages
Principal Component Analysis Overview
No ratings yet
Principal Component Analysis Overview
19 pages
PCA on Iris Dataset: Dimensionality Reduction
No ratings yet
PCA on Iris Dataset: Dimensionality Reduction
7 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
85 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
13 pages
Dimensionality Reduction Techniques in ML
No ratings yet
Dimensionality Reduction Techniques in ML
18 pages
Dimensionality Reduction in Machine Learning
No ratings yet
Dimensionality Reduction in Machine Learning
30 pages
PCA for Dimensionality Reduction Guide
No ratings yet
PCA for Dimensionality Reduction Guide
21 pages
Data Preprocessing Techniques Explained
No ratings yet
Data Preprocessing Techniques Explained
22 pages
Dimensionality Reduction @unit-1 ADS
No ratings yet
Dimensionality Reduction @unit-1 ADS
9 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
38 pages
UNIT-4 ( (C) PCA)
No ratings yet
UNIT-4 ( (C) PCA)
22 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
8 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
Dimensionality Reduction Techniques
No ratings yet
Dimensionality Reduction Techniques
82 pages
PCA for Dimensionality Reduction
No ratings yet
PCA for Dimensionality Reduction
17 pages
Understanding Principal Component Analysis
No ratings yet
Understanding Principal Component Analysis
44 pages
Dimensionality Reduction Techniques Explained
No ratings yet
Dimensionality Reduction Techniques Explained
21 pages
Understanding PCA Intuition and Uses
No ratings yet
Understanding PCA Intuition and Uses
11 pages
Step-by-Step Excel Learning Guide
No ratings yet
Step-by-Step Excel Learning Guide
3 pages
COF-C03 Online Questions - SnowPro Core Certification Exam
No ratings yet
COF-C03 Online Questions - SnowPro Core Certification Exam
53 pages
HCF and LCM Problems with Solutions
No ratings yet
HCF and LCM Problems with Solutions
2 pages
2.5 Sqmm Cable Specifications
No ratings yet
2.5 Sqmm Cable Specifications
2 pages
Web App Security: Key Concepts & Practices
No ratings yet
Web App Security: Key Concepts & Practices
15 pages
GEM Bid Status Overview
No ratings yet
GEM Bid Status Overview
4 pages
Valve Criticality Analysis Overview
No ratings yet
Valve Criticality Analysis Overview
6 pages
ISTQB Sample Question Paper 4
No ratings yet
ISTQB Sample Question Paper 4
40 pages
Evolution of Video Games History
No ratings yet
Evolution of Video Games History
6 pages
EPCF Payment Process for Pag-IBIG Employers
No ratings yet
EPCF Payment Process for Pag-IBIG Employers
11 pages
Slurry Suction Pipe Fabrication Update
No ratings yet
Slurry Suction Pipe Fabrication Update
12 pages
Online Student Portal Project Proposal
100% (1)
Online Student Portal Project Proposal
4 pages
Closure Properties of CFLs Explained
No ratings yet
Closure Properties of CFLs Explained
8 pages
CNC 800 T Features Overview 5.2 to 5.6
No ratings yet
CNC 800 T Features Overview 5.2 to 5.6
68 pages
Chill Premier Friedrich Parts Manual
No ratings yet
Chill Premier Friedrich Parts Manual
94 pages
Sierra Leone Electronic Transactions Act 2018
No ratings yet
Sierra Leone Electronic Transactions Act 2018
14 pages
MDF 86v188e 1
No ratings yet
MDF 86v188e 1
3 pages
React Beginner Setup and Essentials Guide
No ratings yet
React Beginner Setup and Essentials Guide
5 pages
Philips MX 16 CT Scanner Overview
No ratings yet
Philips MX 16 CT Scanner Overview
2 pages
AI's Impact on Science-Policy Interfaces
No ratings yet
AI's Impact on Science-Policy Interfaces
4 pages
OTL 175 Vacuum Pump Specifications
No ratings yet
OTL 175 Vacuum Pump Specifications
1 page
Hobart 60CU24 Generator Warranty Guide
No ratings yet
Hobart 60CU24 Generator Warranty Guide
236 pages
JYL210E Excavator Operator Manual
67% (3)
JYL210E Excavator Operator Manual
151 pages
Veeam Backup with HPE Apollo 4000 Architecture
No ratings yet
Veeam Backup with HPE Apollo 4000 Architecture
20 pages
M.E. Computer Science Curriculum 2021
No ratings yet
M.E. Computer Science Curriculum 2021
129 pages
GAN-Based Image Anomaly Detection
No ratings yet
GAN-Based Image Anomaly Detection
15 pages
Trigonometric Similarity in IFHSSs
No ratings yet
Trigonometric Similarity in IFHSSs
12 pages
PowerPoint Presentation Guide for Students
No ratings yet
PowerPoint Presentation Guide for Students
8 pages
Advanced Hydraulic CLSS Training Course
No ratings yet
Advanced Hydraulic CLSS Training Course
23 pages
RCC Slab Design Calculation Example
No ratings yet
RCC Slab Design Calculation Example
20 pages

Principal Component Analysis Overview

Uploaded by

Principal Component Analysis Overview

Uploaded by

Dimensionality Reduction

Principal Component Analysis

• In statistical and machine learning, dimensionality reduction

You might also like