0% found this document useful (0 votes)

7 views37 pages

Understanding Regression Analysis Techniques

Uploaded by

brsamrat65

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views37 pages

Understanding Regression Analysis Techniques

Uploaded by

brsamrat65

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Regression

• Regression is a supervised learning technique in data mining which

helps predict numerical values in a given data set, such as predicting
temperature, cost or such values.
• Hence, regression techniques in data mining are widely popular in
business settings, most popularly in marketing, trend analysis and
varied kinds of financial forecasting.
• A regression is a statistical technique that relates a dependent variable
to one or more independent (explanatory) variables.
• A regression model is able to show whether changes observed in the
dependent variable are associated with changes in one or more of the
explanatory variables.

Ranjit Kaur Walia, Asst Prof., SCA, LPU

For the regression analysis is be a successful method,
we understand the following terms:

• Dependent Variable: This is the variable that we are

trying to understand or forecast.

• Independent Variable: These are factors that

influence the analysis or target variable and provide
us with information regarding the relationship of the
variables with the target variable.

Ranjit Kaur Walia, Asst Prof., SCA, LPU

Examples:-
Ranjit Kaur Walia, Asst Prof., SCA, LPU
Regression Analysis

-Height is increasing with the increase of age

-Expensive products are getting purchased with higher income
Ranjit Kaur Walia, Asst Prof., SCA, LPU
This statistical method is used across different
industries such as,
• Financial Industry- Understand the trend in the
stock prices, forecast the prices, and evaluate risks
in the insurance domain
• Marketing- Understand the effectiveness of market
campaigns, and forecast pricing and sales of the
product.
• Manufacturing- Evaluate the relationship of
variables that determine to define a better engine
to provide better performance
• Medicine- Forecast the different combinations of
medicines to prepare generic medicines for
diseases.

Ranjit Kaur Walia, Asst Prof., SCA, LPU

Ranjit Kaur Walia, Asst Prof., SCA, LPU
Ranjit Kaur Walia, Asst Prof., SCA, LPU
Regression:

• Objective: Regression is used when the target variable

is continuous or numeric. The goal is to predict a
numeric value based on input features and patterns in
the data.
• Examples: Predicting house prices, estimating the age
of a person based on other attributes, forecasting stock
prices, and predicting the temperature based on
historical data.

Ranjit Kaur Walia, Asst Prof., SCA, LPU

Ranjit Kaur Walia, Asst Prof., SCA, LPU
Ranjit Kaur Walia, Asst Prof., SCA, LPU
Let’s break down the output of summary(model)

This part shows the formula used for the linear regression model, where Price is the dependent variable
and Area and Bedrooms are the independent variables.
Residuals:
Min 1Q Median 3Q Max
-15300.6 -9400.3 -720.2 6100.7 17500.2

Residuals are the differences between the observed values and the predicted values from the
model. They give an idea of how well the model fits the data:
•Min: The largest negative residual (underestimation by the model).
•1Q (First Quartile): The 25th percentile of the residuals.
•Median: The median of the residuals.
•3Q (Third Quartile): The 75th percentile of the residuals.
•Max: The largest positive residual (overestimation by the model).
In this example, the model underestimates some prices by as much as $15,300 and
overestimates others by up to $17,500.
The coefficients table gives information on how each predictor affects the dependent
variable:
•(Intercept): The estimated price when both Area and Bedrooms are zero, which is
$11,852.34. This doesn’t have much real-life meaning but is required mathematically to build
the regression equation.
•Area: The coefficient for Area is 108.94, meaning that for every additional square foot,
the house price increases by about $108.94, keeping the number of bedrooms
constant.
•Bedrooms: The coefficient for Bedrooms is 15,930.76, meaning that adding one
bedroom increases the house price by around $15,930.76, keeping the area constant.
• Standard Error:
• Std. Error represents the accuracy of the estimated coefficients. Smaller values
indicate more precise estimates.
• t value:
• t value is the test statistic used to determine whether each coefficient is
significantly different from zero.
• It is calculated as t=Estimate/StdError.
Pr(>|t|):
•Pr(>|t|) is the p-value, which tells us whether the predictor variables are statistically
significant. A low p-value (typically < 0.05) suggests the variable is significant.
•For Area, p=0.00, meaning the area is highly significant in predicting house
prices.
•For Bedrooms, p=0.048, meaning bedrooms are also significant but less strongly
than the area.
The significance codes show the levels of significance:
***: Highly significant (p < 0.001)
*: Significant (p < 0.05)
.: Marginally significant (p < 0.1)
Blank: Not significant
• The Residual Standard Error is a measure of the typical size of the
residuals. In this case, the average error between the predicted and
actual house prices is approximately $11,450.

•R-squared (0.943): This indicates that 94.3% of the variability in house prices is explained by the
model (i.e., by the variables Area and Bedrooms).
•Adjusted R-squared (0.921): This is a modified version of R-squared that adjusts for the number
of predictors. It’s usually slightly lower and penalizes for adding unnecessary variables. Here, it still
shows a very good fit (92.1%).
•The F-statistic tests the overall significance of the regression model.
•DF (Degrees of Freedom): 2 predictors and 4 residual degrees of freedom.
•p-value (0.00231): This indicates that the overall model is statistically significant at predicting
house prices, as the p-value is well below 0.05.

Conclusion:
From this summary, we can conclude:
•Area and Bedrooms significantly contribute to predicting house prices.
•The model fits the data well, as indicated by the high R-squared value (0.943).
•The model is statistically significant overall, with a p-value of 0.00231.
NOTE:-

Interpretation of Residuals
Residuals are the differences between the observed values of the dependent variable (in this case,
Price) and the predicted values from the regression model.
•Positive residuals: Indicate that the model underestimated the actual value. For example:
•For data point 2, the residual is 38775.3, meaning the predicted price is $38,775.3 lower than
the actual price.
•For data point 1, the residual is 10521.9, indicating the model underestimated the price by
$10,521.9.
•Negative residuals: Indicate that the model overestimated the actual value. For
example:
For data point 6, the residual is -44538.0, meaning the predicted price is $44,538
higher than the actual price.
For data point 3, the residual is -13597.9, indicating the model overestimated the
price by $13,597.9.
•Close-to-zero residuals: Suggest the model's predictions are accurate for that data
point. For example:
For data point 4, the residual is -278.8, meaning the predicted price is quite close to
the actual price.

•Large residuals (like 38,775.3 or -44,538.0) suggest that the model is not fitting
those particular data points well, leading to significant over- or under-predictions.
•Smaller residuals (like -278.8) suggest that the model is making fairly accurate
predictions for those data points.

Understanding Regression Analysis Basics
No ratings yet
Understanding Regression Analysis Basics
12 pages
Understanding Linear Regression Basics
No ratings yet
Understanding Linear Regression Basics
23 pages
House Price Prediction Analysis
No ratings yet
House Price Prediction Analysis
3 pages
Understanding Regression Analysis Basics
No ratings yet
Understanding Regression Analysis Basics
23 pages
Chapter 3.1. Linear Regression
No ratings yet
Chapter 3.1. Linear Regression
83 pages
Econometrics Analysis of House Prices
No ratings yet
Econometrics Analysis of House Prices
14 pages
Understanding Regression Analysis Types
No ratings yet
Understanding Regression Analysis Types
16 pages
Understanding Regression Analysis Techniques
No ratings yet
Understanding Regression Analysis Techniques
36 pages
Linear Regression Example Data: House Price in $1000s (Y) Square Feet (X)
No ratings yet
Linear Regression Example Data: House Price in $1000s (Y) Square Feet (X)
33 pages
Regression Analysis in Business Applications
No ratings yet
Regression Analysis in Business Applications
31 pages
Regression Analysis and Durbin-Watson Test
No ratings yet
Regression Analysis and Durbin-Watson Test
49 pages
Regression Analysis in Business Applications
No ratings yet
Regression Analysis in Business Applications
31 pages
Understanding Regression Analysis
No ratings yet
Understanding Regression Analysis
6 pages
Multiple Regression Analysis Overview
No ratings yet
Multiple Regression Analysis Overview
53 pages
Linear Regression Analysis for Sales Prediction
No ratings yet
Linear Regression Analysis for Sales Prediction
64 pages
Understanding Regression Analysis Techniques
No ratings yet
Understanding Regression Analysis Techniques
56 pages
Regression and Correlation Analysis Guide
No ratings yet
Regression and Correlation Analysis Guide
52 pages
Understanding Regression Techniques
No ratings yet
Understanding Regression Techniques
29 pages
Lec 11
No ratings yet
Lec 11
68 pages
Unit 4 DataScience
No ratings yet
Unit 4 DataScience
21 pages
Linear Regression Analysis Techniques
No ratings yet
Linear Regression Analysis Techniques
58 pages
House Price vs. Size Regression Analysis
No ratings yet
House Price vs. Size Regression Analysis
34 pages
1 Linear Regression
No ratings yet
1 Linear Regression
6 pages
Housing Price Prediction Using Regression
No ratings yet
Housing Price Prediction Using Regression
9 pages
Linear Regression Interview Q&A Guide
No ratings yet
Linear Regression Interview Q&A Guide
37 pages
Understanding Multiple Regression Analysis
No ratings yet
Understanding Multiple Regression Analysis
27 pages
Linear Regression for Housing Prices
No ratings yet
Linear Regression for Housing Prices
24 pages
Multiple Regression Analysis Overview
No ratings yet
Multiple Regression Analysis Overview
25 pages
Comparing Model Performance in Housing Prices
No ratings yet
Comparing Model Performance in Housing Prices
14 pages
MLR Comprehensive Guide
No ratings yet
MLR Comprehensive Guide
27 pages
Multivariate Regression Analysis Explained
No ratings yet
Multivariate Regression Analysis Explained
96 pages
Linear Regression for Housing Prices
No ratings yet
Linear Regression for Housing Prices
2 pages
Liner Regression
No ratings yet
Liner Regression
6 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
70 pages
Simple Linear Regression Notes
No ratings yet
Simple Linear Regression Notes
5 pages
Advanced Regression Analysis Methods
No ratings yet
Advanced Regression Analysis Methods
44 pages
Regression Analysis in Real Estate
No ratings yet
Regression Analysis in Real Estate
27 pages
Simple Linear Regression
No ratings yet
Simple Linear Regression
30 pages
Business Analytics Module 8
100% (1)
Business Analytics Module 8
65 pages
Module 2
No ratings yet
Module 2
53 pages
Linear Regression Basics for Data Science
No ratings yet
Linear Regression Basics for Data Science
15 pages
Classification and Regression Models Guide
No ratings yet
Classification and Regression Models Guide
9 pages
Simple Linear Regression Explained
No ratings yet
Simple Linear Regression Explained
46 pages
Understanding Multiple Regression Analysis
100% (2)
Understanding Multiple Regression Analysis
21 pages
Understanding Simple Linear Regression
No ratings yet
Understanding Simple Linear Regression
51 pages
Simple vs Multiple Linear Regression
100% (2)
Simple vs Multiple Linear Regression
39 pages
Predicting House Prices with Regression
No ratings yet
Predicting House Prices with Regression
4 pages
Assignment 3
No ratings yet
Assignment 3
6 pages
Predicting Housing Prices with Linear Regression
No ratings yet
Predicting Housing Prices with Linear Regression
2 pages
Predicting Home Prices with Linear Regression
No ratings yet
Predicting Home Prices with Linear Regression
12 pages
Predictive Analytics and AI Trends
No ratings yet
Predictive Analytics and AI Trends
79 pages
Correlation vs. Regression Explained
No ratings yet
Correlation vs. Regression Explained
64 pages
Predicting Home Prices with Linear Regression
No ratings yet
Predicting Home Prices with Linear Regression
12 pages
Regression Analysis: Simple & Multiple
No ratings yet
Regression Analysis: Simple & Multiple
70 pages
Understanding Least Squares Criterion
No ratings yet
Understanding Least Squares Criterion
14 pages
Housing Price Regression Project
No ratings yet
Housing Price Regression Project
3 pages
Overview of Regression Techniques
No ratings yet
Overview of Regression Techniques
48 pages
Chapter 18 Forecasting Techniques
No ratings yet
Chapter 18 Forecasting Techniques
7 pages
Understanding Sample Variance Bias
No ratings yet
Understanding Sample Variance Bias
4 pages
Variogram Calculation and Analysis Guide
No ratings yet
Variogram Calculation and Analysis Guide
20 pages
Test Bank for Econometrics in Finance
No ratings yet
Test Bank for Econometrics in Finance
12 pages
Bootstrap Prediction in GARCH Models
No ratings yet
Bootstrap Prediction in GARCH Models
20 pages
VGAM Package for Categorical Analysis
No ratings yet
VGAM Package for Categorical Analysis
30 pages
Analisis Uji Asumsi Klasik dan Regresi
No ratings yet
Analisis Uji Asumsi Klasik dan Regresi
4 pages
Econometrics Assignment 1 Overview
50% (2)
Econometrics Assignment 1 Overview
3 pages
Height Prediction Models for BC Trees
No ratings yet
Height Prediction Models for BC Trees
7 pages
Introduction To Econometrics, 5 Edition: Chapter 5: Dummy Variables
No ratings yet
Introduction To Econometrics, 5 Edition: Chapter 5: Dummy Variables
47 pages
IE506 Machine Learning Assignment 1 Guide
No ratings yet
IE506 Machine Learning Assignment 1 Guide
2 pages
Understanding Structural Equation Models
No ratings yet
Understanding Structural Equation Models
84 pages
NumPy, Pandas, and Data Analysis Guide
No ratings yet
NumPy, Pandas, and Data Analysis Guide
43 pages
Bivariate Regression Analysis for Sales
No ratings yet
Bivariate Regression Analysis for Sales
51 pages
Minitab Regression Analysis Guide
No ratings yet
Minitab Regression Analysis Guide
7 pages
Introduction To Econometrics, 5 Edition: Chapter 3: Multiple Regression Analysis
No ratings yet
Introduction To Econometrics, 5 Edition: Chapter 3: Multiple Regression Analysis
31 pages
Sales Prediction Model Evaluation
No ratings yet
Sales Prediction Model Evaluation
3 pages
Rao-Blackwell Theorem Examples
No ratings yet
Rao-Blackwell Theorem Examples
4 pages
Introduction to Econometrics Concepts
No ratings yet
Introduction to Econometrics Concepts
137 pages
SmartPLS 4 Analysis Report
No ratings yet
SmartPLS 4 Analysis Report
162 pages
Advanced Linear Regression Techniques
No ratings yet
Advanced Linear Regression Techniques
66 pages
SPSS Analysis of Pb Levels Treatment
No ratings yet
SPSS Analysis of Pb Levels Treatment
3 pages
Pengaruh Pelatihan dan Lingkungan Kerja
No ratings yet
Pengaruh Pelatihan dan Lingkungan Kerja
9 pages
Understanding Linear Regression Analysis
No ratings yet
Understanding Linear Regression Analysis
8 pages
Correlation of Preference and Nutrition
No ratings yet
Correlation of Preference and Nutrition
1 page
Pengaruh SDM dan Politik Anggaran Jambi
No ratings yet
Pengaruh SDM dan Politik Anggaran Jambi
17 pages
Applications of Structural Equation Modeling in Marketing and Consumer Research
100% (1)
Applications of Structural Equation Modeling in Marketing and Consumer Research
23 pages
Implications of Violating Classical Assumptions
No ratings yet
Implications of Violating Classical Assumptions
105 pages
Quantile Regression in Econometrics
No ratings yet
Quantile Regression in Econometrics
72 pages
Logarithmic Analysis of Salaries Data
No ratings yet
Logarithmic Analysis of Salaries Data
36 pages

Understanding Regression Analysis Techniques

Uploaded by

Understanding Regression Analysis Techniques

Uploaded by

Regression

• Regression is a supervised learning technique in data mining which

Ranjit Kaur Walia, Asst Prof., SCA, LPU

• Dependent Variable: This is the variable that we are

• Independent Variable: These are factors that

Ranjit Kaur Walia, Asst Prof., SCA, LPU

-Height is increasing with the increase of age

Ranjit Kaur Walia, Asst Prof., SCA, LPU

• Objective: Regression is used when the target variable

Ranjit Kaur Walia, Asst Prof., SCA, LPU

You might also like