0% found this document useful (0 votes)

15 views3 pages

Alteryx Inspire 2018: Linear Regression Insights

The document discusses techniques for exploring, preparing, and modeling time series and other types of data using Alteryx tools. Key points include using field summaries, scatter plots, and other exploratory techniques to understand data; imputing missing values; performing regression analysis and assessing significance of predictors; creating classification models like decision trees; and evaluating different model performance metrics.

Uploaded by

Ishan Sane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views3 pages

Alteryx Inspire 2018: Linear Regression Insights

Uploaded by

Ishan Sane

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Alteryx Inspire Conference

 Field summary used to investigate data type & statistical dist.

 Scatter plots & plot of means can be used for exploratory data analysis
 Impute tool (handles missing or zero values) with mean as an option
 Are 0 values included in the mean calculation?
 P-value analysis on target variable (lower the value more significant the result)
 Association measure (analysis only relevant for linear/logistic regression)
 Create samples tool: creates a training/testing set
 Linear regression (interactive tool provides breakdown of results). Especially look for lowest
p value indicating most relevance (statistical significance)
 Intercept value (value if every other variable is zero)
 OLS analysis (spread of errors will reveal model bias)
 Stepwise regression (re-selects predictor variables depending on their significance)
 Oversample tool (selects samples biased to a certain value)
 Log normalisation (dealing with skewed data)
Log([value]+1), regression deals easier with linearised data
 Confusion matrix will give values of false positives/negatives
 Using false positives, we can oversample that to 50% split to train the model

 Decision Tree (green: path to failure, orange represents success, Tree Classification browse
tool, if it is a yes (go to the left otherwise right)
Accuracy at each node can be shown
 Union tool can also combine model objects together

Understanding Time Series

 Always start with a field summary (describe())

 Find any missing periods
 MUST have consecutive periods between beginning and ending periods

 TS Filler fills missing gaps

 Green bar represents population of numeric vs. null values
 TS Plots allows you to analyse time series data in terms of decomposition, auto-correlation,
partial auto-correlation

 Log frequency/sample to look at relative basis over time

 Clustering is an un-supervised learning technique
 Udacity (predictive analytics course). Can do

Cache & run workflow (caching up till a certain point in a workflow)

Insights tool – has a built in viz platform

Putler’s Predictive Analytics Pyramid

 Determine information needed to address problem/issue

 Find & engineer appropriate and meaningful predictors
 Relationship between predictors & target
 Determine type of models needed

Meaningful metrics for prediction

Decision makers can tend to jump to a solution too soon rather than determining what information
is really needed to inform the problem/solution.

Comparing metrics from different types of models

Is it providing signal or creating noise in the model

Which predictor matters the most when making a prediction

Different modelling methods use different measures of effect size

How does predicted value change as level of numeric predictor increases or as the category changes
for a categorical predictor

For classification models – predicted probability for each possible target classes

Regression models (predicted numeric value of target)

Metrics - Regression

1. MAPE (%)
2. RMSE
3. Correlation between actual & predicted values

Metrics - Binary or Multi-Class Models

- Area under receiver operator curve (AUC) only for binary, can have multi-class extension to
it
- Confusion matrix
- Log-loss (penalise based on count)

Partial dependency plot (fitted values across range of a focal predictor)

Multi-collinearity only starts affecting the model when number of records are a lot

Reverse-causality

Efficiency

 Performance
 Memory
 Hard drive space
 Load on servers during production

Develop Efficiency

Caching

 Right-click & cache to avoid re-running workflow

Reduce by sampling

Ctrl+f (in all caps, can search for values within tools)

Can load games (in ‘about’ section)

HIPPO (Highest Paid Person’s Opinion)

Understanding Predictive Analytics Models
No ratings yet
Understanding Predictive Analytics Models
28 pages
Notes
No ratings yet
Notes
39 pages
Econometrics: Data Types and Analysis
No ratings yet
Econometrics: Data Types and Analysis
28 pages
Data-Driven Insights with Python App
No ratings yet
Data-Driven Insights with Python App
12 pages
Descriptive, Diagnostic, Predictive Analytics
No ratings yet
Descriptive, Diagnostic, Predictive Analytics
24 pages
Predictive Analytics and Regression Models
No ratings yet
Predictive Analytics and Regression Models
29 pages
Predictive Analytics with Qlik Sense
No ratings yet
Predictive Analytics with Qlik Sense
24 pages
Spark Neural Network Overview
No ratings yet
Spark Neural Network Overview
43 pages
Predictive Analytics Overview and Techniques
No ratings yet
Predictive Analytics Overview and Techniques
25 pages
Understanding Predictive Analytics Basics
No ratings yet
Understanding Predictive Analytics Basics
5 pages
Data Mining Project Study Guide
No ratings yet
Data Mining Project Study Guide
6 pages
Alteryx Predictive Analytics Tools Guide
No ratings yet
Alteryx Predictive Analytics Tools Guide
1 page
Big Data Analytics: Techniques & Applications
No ratings yet
Big Data Analytics: Techniques & Applications
27 pages
Tableau Time Series Forecasting Guide
No ratings yet
Tableau Time Series Forecasting Guide
10 pages
Introduction to Predictive Analytics
No ratings yet
Introduction to Predictive Analytics
77 pages
Overview of Predictive Analytics
No ratings yet
Overview of Predictive Analytics
26 pages
Oracle Crystal Ball Predictor Overview
No ratings yet
Oracle Crystal Ball Predictor Overview
26 pages
Big Data Analytics Complete Notes.
No ratings yet
Big Data Analytics Complete Notes.
5 pages
Predictive Analytics in Project Risk Management
No ratings yet
Predictive Analytics in Project Risk Management
25 pages
PGP in Data Science Curriculum Overview
No ratings yet
PGP in Data Science Curriculum Overview
17 pages
Predictive Analytics Overview and Applications
No ratings yet
Predictive Analytics Overview and Applications
24 pages
Exploratory Data Analysis Techniques
No ratings yet
Exploratory Data Analysis Techniques
102 pages
Data Analytics Techniques Overview
No ratings yet
Data Analytics Techniques Overview
25 pages
SAS Data Analytics for Predictive Insights
No ratings yet
SAS Data Analytics for Predictive Insights
49 pages
Business Intelligence Tools Overview
No ratings yet
Business Intelligence Tools Overview
4 pages
PTL - Savilles Ads
No ratings yet
PTL - Savilles Ads
13 pages
Predictive Analytics in Business Decisions
No ratings yet
Predictive Analytics in Business Decisions
21 pages
Time Series Forecasting with R Techniques
No ratings yet
Time Series Forecasting with R Techniques
18 pages
Forecasting Techniques in Predictive Analytics
No ratings yet
Forecasting Techniques in Predictive Analytics
11 pages
Predictive Analytics Masterclass Overview
No ratings yet
Predictive Analytics Masterclass Overview
183 pages
Predictive Analytics and Visualization Techniques
100% (1)
Predictive Analytics and Visualization Techniques
19 pages
Types of Data Analytics Explained
No ratings yet
Types of Data Analytics Explained
27 pages
Data Cleaning and Exploration in Analytics
No ratings yet
Data Cleaning and Exploration in Analytics
37 pages
HR Analytics Fundamentals and Techniques
No ratings yet
HR Analytics Fundamentals and Techniques
41 pages
Understanding Analytics Types: A Guide
No ratings yet
Understanding Analytics Types: A Guide
6 pages
Data Preprocessing Techniques in Modeling
No ratings yet
Data Preprocessing Techniques in Modeling
18 pages
Data Preparation for Predictive Analytics
No ratings yet
Data Preparation for Predictive Analytics
6 pages
Data Analytics Techniques and Ethics
No ratings yet
Data Analytics Techniques and Ethics
10 pages
What Is Predictive Analytics - 3 Things You Need To Know - MATLAB & Simulink
No ratings yet
What Is Predictive Analytics - 3 Things You Need To Know - MATLAB & Simulink
11 pages
Understanding Predictive Analytics
No ratings yet
Understanding Predictive Analytics
10 pages
Excel Analytics and Visualization Tools
No ratings yet
Excel Analytics and Visualization Tools
4 pages
EDA Techniques for Time Series Analysis
No ratings yet
EDA Techniques for Time Series Analysis
20 pages
Predictive Analytics Explained: A Guide
No ratings yet
Predictive Analytics Explained: A Guide
11 pages
Data Structures and Analysis Software Guide
No ratings yet
Data Structures and Analysis Software Guide
3 pages
Bigdata Analytics
No ratings yet
Bigdata Analytics
11 pages
328607overview of Analytical Techniques-1745296825666
No ratings yet
328607overview of Analytical Techniques-1745296825666
8 pages
Mda - Unit V
No ratings yet
Mda - Unit V
31 pages
Predictive Analytics Overview and Methods
No ratings yet
Predictive Analytics Overview and Methods
8 pages
Data Structures for Analysis Explained
No ratings yet
Data Structures for Analysis Explained
5 pages
01st Review
No ratings yet
01st Review
19 pages
Overview of Business Analytics Types
No ratings yet
Overview of Business Analytics Types
24 pages
Understanding Predictive Analytics Types
100% (1)
Understanding Predictive Analytics Types
32 pages
Data Analytics Overview and Tools Guide
No ratings yet
Data Analytics Overview and Tools Guide
26 pages
Comprehensive Guide to Data Analytics
No ratings yet
Comprehensive Guide to Data Analytics
9 pages
Business Analytics Overview and Examples
No ratings yet
Business Analytics Overview and Examples
16 pages
Winter 2025 PA
No ratings yet
Winter 2025 PA
51 pages
Machine Learning and Descriptive Analytics
No ratings yet
Machine Learning and Descriptive Analytics
8 pages
Future Predictions by a Blind Mystic
No ratings yet
Future Predictions by a Blind Mystic
13 pages
CBOT-Understanding Basis
No ratings yet
CBOT-Understanding Basis
26 pages
19-00351 DATA61 REPORT AgricultureWorkforce WEB 191031
No ratings yet
19-00351 DATA61 REPORT AgricultureWorkforce WEB 191031
80 pages
Cirq Bar Menu at Crown Sydney
No ratings yet
Cirq Bar Menu at Crown Sydney
3 pages
Australia’s Grain Trading Landscape
No ratings yet
Australia’s Grain Trading Landscape
38 pages
The Vessel Scheduling Problem in A Liner Shipping
No ratings yet
The Vessel Scheduling Problem in A Liner Shipping
17 pages
The Nomenclature of Jewelry Part 3 - Rings - International Gem Society IGS
No ratings yet
The Nomenclature of Jewelry Part 3 - Rings - International Gem Society IGS
4 pages
Crop Supply & Demand Update October 2023
No ratings yet
Crop Supply & Demand Update October 2023
6 pages
Chickpea Marketing India
No ratings yet
Chickpea Marketing India
19 pages
Python for Finance & Trading Guide
No ratings yet
Python for Finance & Trading Guide
11 pages
2017 Advisory Compliance Workshop Program ONLY V12
No ratings yet
2017 Advisory Compliance Workshop Program ONLY V12
1 page
Making Money Investing in Gems - International Gem Society IGS
No ratings yet
Making Money Investing in Gems - International Gem Society IGS
9 pages
The Nomenclature of Jewelry Part 1 - Settings - International Gem Society IGS
No ratings yet
The Nomenclature of Jewelry Part 1 - Settings - International Gem Society IGS
9 pages
Continuous Futures Data Series For Back Testing and Technical Analysis
No ratings yet
Continuous Futures Data Series For Back Testing and Technical Analysis
6 pages
Naked Money
100% (1)
Naked Money
341 pages
Mahesh Gowande: Equity Futurologist Insights
No ratings yet
Mahesh Gowande: Equity Futurologist Insights
2 pages
Kegunaan dan Analisis Biodiesel
No ratings yet
Kegunaan dan Analisis Biodiesel
3 pages
Sample: For Your Information
No ratings yet
Sample: For Your Information
28 pages
Commodity Trading Goes Back To The Future
No ratings yet
Commodity Trading Goes Back To The Future
10 pages
Backwardation's Impact on Commodity Futures
No ratings yet
Backwardation's Impact on Commodity Futures
30 pages
Education Lesson Inventory: Courses
No ratings yet
Education Lesson Inventory: Courses
21 pages
KPI Performance Management Procedure
No ratings yet
KPI Performance Management Procedure
6 pages
A Quantitative Analysis of Managed Futures Strategies: Lintner Revisited
No ratings yet
A Quantitative Analysis of Managed Futures Strategies: Lintner Revisited
40 pages
Chentsov's Theorem in Information Geometry
No ratings yet
Chentsov's Theorem in Information Geometry
20 pages
Chapra Numerical Methods 6th Solutions
0% (2)
Chapra Numerical Methods 6th Solutions
3 pages
Cryptanalysis of Vigen Re Cipher Method Implementation
No ratings yet
Cryptanalysis of Vigen Re Cipher Method Implementation
5 pages
Deep Learning for Plant Disease Detection
No ratings yet
Deep Learning for Plant Disease Detection
18 pages
F-Tests in Econometrics Explained
No ratings yet
F-Tests in Econometrics Explained
7 pages
Types of Probability Distributions
No ratings yet
Types of Probability Distributions
16 pages
ICPC Dhaka 2020 Contest Editorial
No ratings yet
ICPC Dhaka 2020 Contest Editorial
9 pages
Real-Time DDoS Detection with AI
No ratings yet
Real-Time DDoS Detection with AI
11 pages
Collaborative Filtering in Recommender Systems
No ratings yet
Collaborative Filtering in Recommender Systems
6 pages
Non-Parametric News Impact Curve Model
No ratings yet
Non-Parametric News Impact Curve Model
45 pages
Iroha-chan Problem Set Overview
No ratings yet
Iroha-chan Problem Set Overview
4 pages
Hit and Run Algorithm Assignment
No ratings yet
Hit and Run Algorithm Assignment
6 pages
Neeraj Chopra's Path to Paris 2024 Gold
No ratings yet
Neeraj Chopra's Path to Paris 2024 Gold
6 pages
Introduction to Numerical Methods
No ratings yet
Introduction to Numerical Methods
34 pages
Physics-Based Neural Networks in Engineering
No ratings yet
Physics-Based Neural Networks in Engineering
18 pages
Polynomial Factoring and Intercepts Guide
No ratings yet
Polynomial Factoring and Intercepts Guide
20 pages
LCG Randomness Tests Explained
No ratings yet
LCG Randomness Tests Explained
6 pages
Deep Learning Overview and Applications
No ratings yet
Deep Learning Overview and Applications
3 pages
Multiply in Parts: Strategies and Examples
No ratings yet
Multiply in Parts: Strategies and Examples
3 pages
Bi-Level Optimization for Face Detection
No ratings yet
Bi-Level Optimization for Face Detection
5 pages
Six Sigma Reference Tool Overview
No ratings yet
Six Sigma Reference Tool Overview
45 pages
S1 Second Series Algorithmic Thinking With Python Franklin's Lectures
No ratings yet
S1 Second Series Algorithmic Thinking With Python Franklin's Lectures
2 pages
Predicting Respiratory Diseases from PM2.5
No ratings yet
Predicting Respiratory Diseases from PM2.5
4 pages
Key Questions for Semester Exam Review
No ratings yet
Key Questions for Semester Exam Review
3 pages
CS221 Week 2 ML Problem Solutions
No ratings yet
CS221 Week 2 ML Problem Solutions
7 pages
Regula Falsi Method for Root Finding
No ratings yet
Regula Falsi Method for Root Finding
3 pages
EEG-Based Depression Detection Analysis
No ratings yet
EEG-Based Depression Detection Analysis
24 pages
Spark ML Pipeline for Classifying Reviews
No ratings yet
Spark ML Pipeline for Classifying Reviews
11 pages
Two-Port Network Parameter Analysis
No ratings yet
Two-Port Network Parameter Analysis
3 pages
Understanding Graph Theory Basics
No ratings yet
Understanding Graph Theory Basics
33 pages

Alteryx Inspire 2018: Linear Regression Insights

Uploaded by

Alteryx Inspire 2018: Linear Regression Insights

Uploaded by

Alteryx Inspire Conference

 Field summary used to investigate data type & statistical dist.

Understanding Time Series

 Always start with a field summary (describe())

 TS Filler fills missing gaps

 Log frequency/sample to look at relative basis over time

Cache & run workflow (caching up till a certain point in a workflow)

Putler’s Predictive Analytics Pyramid

 Determine information needed to address problem/issue

Meaningful metrics for prediction

Comparing metrics from different types of models

Is it providing signal or creating noise in the model

Which predictor matters the most when making a prediction

Different modelling methods use different measures of effect size

Regression models (predicted numeric value of target)

Metrics - Binary or Multi-Class Models

Partial dependency plot (fitted values across range of a focal predictor)

 Right-click & cache to avoid re-running workflow

Can load games (in ‘about’ section)

HIPPO (Highest Paid Person’s Opinion)

You might also like