LOESS Curve Fitting for Scatter Plots

The document discusses scatter plots and the need for smoothing to accurately assess relationships between variables, particularly in the presence of noisy data. It outlines parametric and nonparametric curve fitting methods, focusing on LOESS, which performs local regression to fit smooth curves. Key concepts include the impact of bandwidth on smoothing and the degree of polynomial used in fitting, as well as considerations for high-dimensional data and feature selection/extraction methods.

Uploaded by

deamonking1234king

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views15 pages

LOESS Curve Fitting for Scatter Plots

Uploaded by

deamonking1234king

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Visualization

Siddharth R
Scatter Plot

Source: [Link]
Need of smoothing
● Scatter plot enables visual assessments of relationships or
functional dependencies between the variable
● However, it is often quite difficult in practice because of
noisy data values, sparse data points, and weak
interrelationships
● General pattern vs Precise Nature
● Solution: fitting a smooth curve to the points in the
scatterplot
● Two approaches:
○ Parametric fitting

○ Nonparametric fitting
Parametric Curve Fitting
● Linear Regression (straight line fitting)
○ House Price vs House Size

● Polynomial Regression (curved line fitting)

○ Growth of the tree over time

● Exponential fitting
○ Modeling the spread of virus over time

● Sinusoidal curve fitting

○ Modeling temperature over time
Locally estimated scatterplot smoothing (LOESS)
● LOESS is a non-parametric method, i.e. it doesn’t assume a specific global
functional form for the data
● LOESS is based on local regression, meaning that the regression is
performed locally for each point in the dataset using a subset of the data.
● For each data point, a small neighborhood of surrounding points is selected,
and a weighted linear or polynomial regression is performed on those points.
● LOESS is also called LOWESS, which stands for locally weighted scatterplot
smoothing.
What relationship exists?
● Figure shows 1992 state
voter turnout rates plotted (on
the vertical axis) against the
percentage of high school
graduates in the respective
state populations (on the
horizontal axis).
What is the conclusion now?
● What the relationship looks like?

● A linear model would provide a

misleading depiction of the
relationship

● LOESS helps to avoid the inaccurate

representation of the data.

● Two key parameters

○ Smoothing parameter

○ Degree of polynomial
Steps for LOESS
● Assume that the data consist of n observations on two variables, X and Y.
● The data is displayed in a scatter plot with the scale of X and Y as x-axis and y-axis. The points are
(xi, yi) where i ranges from 1 to n.
● Select a series of m locations that are equally-spaced across the range of X
● Perform a series of m weighted regression analyses
● These regressions are “local” in the sense that each one only uses the subset of observations
● The observations included in each local regression are inversely weighted according to their
distance
● After all of the local regressions are completed, the resultant points are plotted in the scatterplot,
superimposed over the data points
● How the neighborhood subset of points will be selected?
Bandwidth (or Span)
● Bandwidth (often called the smoothing parameter or span)
controls the size of the neighborhood used for local regression.
● A smaller span means a smaller neighborhood and less
smoothing, while a larger span means more smoothing.
● The value ranges from 0 to 1
● Example:
○ A bandwidth of 0.2 uses 20% of the nearest points for local regression.
○ A bandwidth of 0.8 uses 80% of the nearest points.
Impact of Bandwidth/Span

● Larger size tend to cancel

idiosyncratic observations each
other out
● Larger value means that only fewer
observations will change when
moving from one fitting window to
the next.
● Smaller values are highly sensitive
to noise
● Consider LOESS curve as a string
and larger values pulling it tighter
producing a straighter curve.
● Trade off between overfitting vs lack
of fitting

Also See: [Link]

Degree of Polynomial
● If degree of polynomial is set to 1, then linear equations are fit within each of the windows.
● If degree of polynomial is set to 2, quadratic equations are used.
● The figure shows data on public preferences between two candidates Walter Mondale and Gary
Hart, over the course of the 1984 presidential primary campaign
Key points to note
● If the point cloud conforms to a generally monotonic pattern (either increasing
or decreasing), then it should be set to 1 for locally linear fitting.
● If the data exhibit some nonmonotone pattern, with local minima and/or
maxima, then it should be set to a value of 2 for locally quadratic equations.
● The residuals are the differences between the actual data points and the
fitted values. They measure how well the LOESS model fits the data.
● LOESS is sensitive to outliers. So after an initial LOESS fit, residuals are
computed, and points with large residuals are down-weighted in subsequent
iterations to make the model more resistant to outliers.
● What do you think of the computational complexity of LOESS?
High Dimensional Data
● Curse of dimensionality
● Dimensionality reduction approaches
○ Feature selection
○ Feature extraction
● Feature Selection
○ Filter method
○ Wrapper method
○ Embedded method
● Feature Extraction
○ PCA
○ Random projection
Next Class
● PCA
● t-SNE
Thank You !!!

Understanding LOESS Regression Methods
No ratings yet
Understanding LOESS Regression Methods
6 pages
Lowess and Loess in R: A Guide
No ratings yet
Lowess and Loess in R: A Guide
4 pages
Screenshot 2025-12-31 at 4.29.32 PM
No ratings yet
Screenshot 2025-12-31 at 4.29.32 PM
42 pages
Nonparametric ANCOVA in R
No ratings yet
Nonparametric ANCOVA in R
18 pages
Local Polynomial Regression Review
No ratings yet
Local Polynomial Regression Review
23 pages
R Code for Time Series Moving Average
No ratings yet
R Code for Time Series Moving Average
11 pages
Loess: Local Regression Methodology
No ratings yet
Loess: Local Regression Methodology
16 pages
Kernel Smoothers: An Overview of Curve Estimators For The First Graduate Course in Nonparametric Statistics
No ratings yet
Kernel Smoothers: An Overview of Curve Estimators For The First Graduate Course in Nonparametric Statistics
13 pages
Understanding MARS Algorithm in Regression
No ratings yet
Understanding MARS Algorithm in Regression
15 pages
Understanding Logistic Regression Basics
No ratings yet
Understanding Logistic Regression Basics
46 pages
Understanding Linear Regression Techniques
No ratings yet
Understanding Linear Regression Techniques
35 pages
Multiple Regression Models Explained
No ratings yet
Multiple Regression Models Explained
5 pages
OLS Regression Explained: A Beginner's Guide
No ratings yet
OLS Regression Explained: A Beginner's Guide
12 pages
LWP Manual
No ratings yet
LWP Manual
14 pages
Regression Splines in Machine Learning
No ratings yet
Regression Splines in Machine Learning
37 pages
Ridge and Lasso Regression Insights
No ratings yet
Ridge and Lasso Regression Insights
30 pages
Understanding Splines in Regression Analysis
No ratings yet
Understanding Splines in Regression Analysis
51 pages
Linear and Logistic Regression Overview
No ratings yet
Linear and Logistic Regression Overview
32 pages
Program-5 ML
No ratings yet
Program-5 ML
7 pages
Empirical Modeling Techniques Explained
No ratings yet
Empirical Modeling Techniques Explained
27 pages
Simple & Multiple Linear Regression Explained
No ratings yet
Simple & Multiple Linear Regression Explained
19 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
65 pages
Advanced Machine Learning Techniques
No ratings yet
Advanced Machine Learning Techniques
90 pages
Data Modeling Techniques for Prediction
No ratings yet
Data Modeling Techniques for Prediction
25 pages
Program 5
No ratings yet
Program 5
7 pages
Program 5
No ratings yet
Program 5
7 pages
Non-Linear Regression Techniques
No ratings yet
Non-Linear Regression Techniques
18 pages
Lecture 2
No ratings yet
Lecture 2
15 pages
Model Selection in Linear Regression
No ratings yet
Model Selection in Linear Regression
32 pages
Iris Data Regression Analysis
No ratings yet
Iris Data Regression Analysis
21 pages
Scattered Data Approximation Methods
No ratings yet
Scattered Data Approximation Methods
3 pages
Lecture 23 & 24
No ratings yet
Lecture 23 & 24
24 pages
Non-Linear Regression Techniques Explained
No ratings yet
Non-Linear Regression Techniques Explained
56 pages
Non-Linear Regression Techniques Explained
No ratings yet
Non-Linear Regression Techniques Explained
56 pages
Linear Regression Methods and Techniques
No ratings yet
Linear Regression Methods and Techniques
14 pages
Non-Regressive Addition Explained
No ratings yet
Non-Regressive Addition Explained
15 pages
Geometric Curve Fitting Techniques
100% (1)
Geometric Curve Fitting Techniques
197 pages
Notesheet
No ratings yet
Notesheet
21 pages
Iit MLT Week-5
No ratings yet
Iit MLT Week-5
27 pages
Model Fitting and Regression Techniques
No ratings yet
Model Fitting and Regression Techniques
57 pages
Linear and Logistic Regression Overview
No ratings yet
Linear and Logistic Regression Overview
40 pages
Polynomial and Spline Interpolation in Chemistry
No ratings yet
Polynomial and Spline Interpolation in Chemistry
4 pages
High-Dimensional Data Analytics Overview
No ratings yet
High-Dimensional Data Analytics Overview
87 pages
Regression Shrinkage Techniques Explained
No ratings yet
Regression Shrinkage Techniques Explained
26 pages
Model Fitting and Classification Techniques
No ratings yet
Model Fitting and Classification Techniques
25 pages
Enhancing Linear Models with Lasso and Ridge
No ratings yet
Enhancing Linear Models with Lasso and Ridge
41 pages
Machine Learning Basics and Jargon
No ratings yet
Machine Learning Basics and Jargon
81 pages
Data Fitting with Least Squares Methods
No ratings yet
Data Fitting with Least Squares Methods
17 pages
Linear Models: Regression & SVM Techniques
No ratings yet
Linear Models: Regression & SVM Techniques
92 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
32 pages
Polynomial Regression Models Overview
No ratings yet
Polynomial Regression Models Overview
69 pages
Ridge Regression for Polynomial Fitting
No ratings yet
Ridge Regression for Polynomial Fitting
1 page
Polynomial Regression Models Explained
No ratings yet
Polynomial Regression Models Explained
15 pages
Machine Learning Basics Cheat Sheet
No ratings yet
Machine Learning Basics Cheat Sheet
9 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
169 pages
Moving Least Squares: Practical Guide
No ratings yet
Moving Least Squares: Practical Guide
11 pages
Simple Linear Regression Overview
No ratings yet
Simple Linear Regression Overview
40 pages
Confidence Intervals for Population Means
No ratings yet
Confidence Intervals for Population Means
4 pages
Linear Regression and Data Splitting Guide
No ratings yet
Linear Regression and Data Splitting Guide
16 pages
Violations of Classical Assumptions in Econometrics
No ratings yet
Violations of Classical Assumptions in Econometrics
38 pages
Calibration and Linear Regression in Chemistry
100% (1)
Calibration and Linear Regression in Chemistry
64 pages
Logit Model Analysis for Campus Living
No ratings yet
Logit Model Analysis for Campus Living
4 pages
Linear Regression Analysis Guide
No ratings yet
Linear Regression Analysis Guide
41 pages
Estimation Theory Overview
100% (2)
Estimation Theory Overview
66 pages
Learning Unit 4 Chapter 8 2
No ratings yet
Learning Unit 4 Chapter 8 2
11 pages
Optimal Forecasting Methods for Sales
No ratings yet
Optimal Forecasting Methods for Sales
6 pages
Tobit Regression Analysis in Stata
No ratings yet
Tobit Regression Analysis in Stata
10 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
116 pages
Sales Prediction and Regression Analysis
No ratings yet
Sales Prediction and Regression Analysis
2 pages
Gridding Report and Statistical Analysis
No ratings yet
Gridding Report and Statistical Analysis
5 pages
Bayesian Parameter Estimation in Models
No ratings yet
Bayesian Parameter Estimation in Models
33 pages
Econometrics II Mid-Term Exam 2021
100% (1)
Econometrics II Mid-Term Exam 2021
4 pages
Understanding Multiple Regression Analysis
100% (1)
Understanding Multiple Regression Analysis
30 pages
Gender Impact on Educational Support Analysis
No ratings yet
Gender Impact on Educational Support Analysis
4 pages
STATS 330/772 Term Test Instructions
No ratings yet
STATS 330/772 Term Test Instructions
13 pages
Panel Data Econometrics in Finance
0% (1)
Panel Data Econometrics in Finance
4 pages
Continuous Univariate Distributions Vowwe1: - Norman L. Johnson - Samuelkotz - S - Iy N. Balakrishnan
100% (1)
Continuous Univariate Distributions Vowwe1: - Norman L. Johnson - Samuelkotz - S - Iy N. Balakrishnan
792 pages
Time Series Forecasting with ARIMA in Python
No ratings yet
Time Series Forecasting with ARIMA in Python
10 pages
Revenue Forecasting with Regression Analysis
No ratings yet
Revenue Forecasting with Regression Analysis
3 pages
Regression Techniques for Forecasting
No ratings yet
Regression Techniques for Forecasting
53 pages
Data Management Mock Exam Overview
No ratings yet
Data Management Mock Exam Overview
7 pages
Some Theoretical Aspects of Partial Leas
No ratings yet
Some Theoretical Aspects of Partial Leas
26 pages
SQL Joins, Keys, and Queries Explained
No ratings yet
SQL Joins, Keys, and Queries Explained
14 pages
Gold ETF Price Forecast Milestone Report
No ratings yet
Gold ETF Price Forecast Milestone Report
23 pages
Correlation and Regression Analysis
No ratings yet
Correlation and Regression Analysis
20 pages
Confidence Interval and Estimation Techniques
No ratings yet
Confidence Interval and Estimation Techniques
3 pages
Regression Analysis of Real Estate Prices
No ratings yet
Regression Analysis of Real Estate Prices
7 pages