Regression
2. Multiple Linear Regression
• Regression is a statistical method used to study the
relationship between a dependent variable (Y) and one or • Multiple Regression is an extension of Simple Regression . it
more independent variables (X). uses two or more independent variables (X₁, X₂, …, Xn) to
predict a single dependent variable (Y).
It helps us to predict the value of Y based on X.
• Equation:
Regression Line: A regression line is a best-fit line that • Y=c +m1X1+m2X2+…+mnXn
shows the relationship between an independent variable
(X) and a dependent variable (Y). It represents the • Where,
predicted values of Y for given values of X. • Y = Dependent variable
• c = Intercept
• m1,m2,…,mn = Coefficients (slopes for each independent
1. Simple Linear Regression variable)
• Simple Linear Regression shows how one independent • X1,X2,…,Xn= Independent variables
variable (X) affects the dependent variable (Y). • Applications
• Equation: Y=mX+c • 1. Predicting crop yield based on rainfall, fertilizer,
temperature
• Where, • 2. Predicting house prices using area, rooms, and location
• Y= Dependent variable (output) • 3. Predicting student performance using study time and
attendance
• X = Independent variable (input)
• m = Slope of the line (regression coefficient)
• c = Intercept
Polynomial regression
Polynomial Regression is an advanced form of Linear
Regression where the relationship between the General Equation:
independent variable (X) and the dependent
variable (Y) is curved (non-linear) rather than a
straight line. Where,
Y = Dependent variable
• Even though it models a curve, it is still a linear X = Independent variable
model in terms of coefficients. c,m1,m2,...,mn= Coefficients to be estimated
• Polynomial Regression is a technique that models n = Degree of the polynomial
the relationship between the dependent variable
Y and the independent variable X as an nth-
degree polynomial.
Ridge regression
• Ridge Regression is a type of linear regression
that adds a small penalty to the model to
prevent the coefficients from becoming too
large.
• It helps to reduce overfitting and handle
multicollinearity.
• Regression line:
Y=m1X1+m2X2+m3X3+⋯+c