0% found this document useful (0 votes)

3 views6 pages

Python CLT and Confidence Interval Simulations

data science coding tutorial series 6

Uploaded by

hpmfsl2sys3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views6 pages

Python CLT and Confidence Interval Simulations

data science coding tutorial series 6

Uploaded by

hpmfsl2sys3

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Q11.

Write Python code to demonstrate the Central Limit Theorem (CLT) using
simulation. (Hint: Draw many samples, compute sample means, plot histogram.)

In [ ]: # Central Limit Theorem (CLT)

import numpy as np
import [Link] as plt

# Try changing these values-

n = 30 # Sample size in each draw (try 5, 30, 100)

B = 1000 # Number of repeated samples (try 500, 2000, 5000)
dist = "exponential" # Try for: "exponential", "uniform", "poisson"

# Generate samples and compute means

[Link](0) # Reproducibility
means = [Link](B) # Store sample means

for i in range(B):
if dist == "exponential":
sample = [Link](scale=1.0, size=n)
elif dist == "uniform":
sample = [Link](0, 10, size=n)
elif dist == "poisson":
sample = [Link](lam=3, size=n)
means[i] = [Link](sample)

# Plot results

[Link](means, bins=30, density=True, edgecolor='black', color='lightgreen')

[Link](f"CLT: Distribution of sample means (n={n}, B={B}, dist={dist})")
[Link]("Sample Mean")
[Link]("Density")
[Link]()

# Check mean and std

print("Average of sample means:", [Link](means))

print("Std. deviation of sample means:", [Link](means))

# What you should observe / interpret:

#The original population (exponential) is skewed.
#The histogram of sample means should look approximately normal (bell-shaped).

# Small exercises you can try:

# Try n as (5,30,100) and see how the shape changes (larger n → more normal).
# Try [Link](...) with [Link](...) or [Link](...)
# to test CLT for different population shapes.
Average of sample means: 0.9954889043544146
Std. deviation of sample means: 0.18313997891553105

Q12. Write Python code to construct a 95% confidence interval for the mean of a
dataset. (Hint: Use the following predefined
dataset:([65,70,68,72,74,69,71,73,67,66,70,75])

In [ ]: # Confidence Interval

import numpy as np
import [Link] as plt
from [Link] import norm

# Try changing these values!

scores = [65,70,68,72,74,69,71,73,67,66,70,75] # Your dataset
conf = 95 # Confidence level (try 90, 95, 99)

# Step 1: Basic statistics

n = len(scores)
mean = [Link](scores)
s = [Link](scores, ddof=1) # sample standard deviation

# Step 2: Standard error

SE = s / [Link](n)

# Step 3: Critical z-value

alpha = 1 - (conf / 100)
z = [Link](1 - alpha/2) # two-tailed

# Step 4: Confidence interval

lower = mean - z * SE
upper = mean + z * SE

print(f"{conf}% CI for the mean: ({lower:.3f}, {upper:.3f})")

print("Sample mean is:", mean)

# Step 5: Plot
[Link](scores, bins=5, edgecolor='black', color='skyblue')
[Link](lower, color='red', linestyle='--', label=f'Lower {conf}% CI')
[Link](upper, color='blue', linestyle='--', label=f'Upper {conf}% CI')
[Link](mean, color='green', linestyle='-', label='Sample Mean')
[Link](f"{conf}% Confidence Interval for the Mean")
[Link]("Scores")
[Link]("Frequency")
[Link]()
[Link]()
#Try for different CI 90%,99%. And also try for different scores.

95% CI for the mean: (68.211, 71.789)

Sample mean is: 70.0

[Link] random numbers from the normal distribution and approximate the
shape of: a) Chi square distribution with 3 degrees of freedom b) t-distribution with
5 degrees of freedom (Hint: Use only [Link], generate 100,000
samples, construct them using definitions, then plot histograms.

In [ ]: import numpy as np
import [Link] as plt

# Step 1: Generate 100,000 Chi-square samples with df=3

# Each sample = sum of squares of df independent standard normal variables
df = 3
chi_sq_samples = [sum([Link](0,1,df)**2) for _ in range(100000)]

# Step 2: Basic statistics

mean = [Link](chi_sq_samples)
std = [Link](chi_sq_samples)
print(f"Chi-square Mean: {mean:.2f}, Std Dev: {std:.2f}")

# Step 3: Plot histogram

[Link](chi_sq_samples, bins=50, edgecolor='black', color='lightcoral')
[Link](mean, color='green', linestyle='-', label='Mean')
[Link]('Chi-square Distribution (df=3)')
[Link]('Value')
[Link]('Frequency')
[Link]()
[Link]()

# Exploration / Try & Observe

# Change df to (5,10,15) and observe how distribution becomes more symmetric.
# Observe that distribution is always positive and skewed to the right.
# Try plotting samples (e.g., 10,000 or 200,000) and see histogram smoothing.

Chi-square Mean: 2.99, Std Dev: 2.44

[Link] random numbers from the normal distribution and approximate the
shape of: b) t-distribution with 5 degrees of freedom (Hint: Use only
[Link], generate 100,000 samples, construct them using
definitions,then plot histograms.)

In [ ]: import numpy as np
import [Link] as plt

# Step 1: Generate 100,000 t-distribution samples

# t = Z / sqrt(V/df), where Z~N(0,1) and V=sum of squares of df standard normal variab
df = 5
t_samples = []
for _ in range(100000):
Z = [Link](0,1) # numerator (standard normal)
V = sum([Link](0,1,df)**2) # denominator chi-square component
t_samples.append(Z / [Link](V/df))

# Step 2: Basic statistics

mean = [Link](t_samples)
std = [Link](t_samples)
print(f"t-Distribution Mean: {mean:.2f}, Std Dev: {std:.2f}")
# Step 3: Plot histogram
[Link](t_samples, bins=50, edgecolor='black', color='skyblue')
[Link](mean, color='green', linestyle='-', label='Mean')
[Link]('t-Distribution (df=5)')
[Link]('Value')
[Link]('Frequency')
[Link]()
[Link]()

# Step 4: Exploration / Try & Observe

# Change df to (2,10,20) to see effect on tail thickness.
# Observe symmetry around 0 and heavier tails compared to normal distribution.
# Try different samples (50,000 or 200,000) to see smoother histograms.

t-Distribution Mean: 0.00, Std Dev: 1.29

Central Limit Theorem Simulation Analysis
No ratings yet
Central Limit Theorem Simulation Analysis
6 pages
Expt 5
No ratings yet
Expt 5
34 pages
Probability Distributions in Python
No ratings yet
Probability Distributions in Python
11 pages
Histogram of Normal Distribution
No ratings yet
Histogram of Normal Distribution
12 pages
PDF Sampling and Statistics Workshop
No ratings yet
PDF Sampling and Statistics Workshop
10 pages
Central Limit Theorem for Uniform RVs
No ratings yet
Central Limit Theorem for Uniform RVs
9 pages
Normal Distribution
No ratings yet
Normal Distribution
18 pages
Normal Distribution
No ratings yet
Normal Distribution
18 pages
Python for Probability and Statistics
No ratings yet
Python for Probability and Statistics
37 pages
BCA V SEM Probability & Statistics Lab Manual
No ratings yet
BCA V SEM Probability & Statistics Lab Manual
7 pages
Modeling and Simulation Lab Tasks
No ratings yet
Modeling and Simulation Lab Tasks
8 pages
Normal Distribution in Python
No ratings yet
Normal Distribution in Python
43 pages
QCR-II LAB 11 - ReportTemplate
No ratings yet
QCR-II LAB 11 - ReportTemplate
10 pages
Empirical vs Theoretical PDF Analysis
No ratings yet
Empirical vs Theoretical PDF Analysis
56 pages
Lab 02
No ratings yet
Lab 02
28 pages
Simulation Modelling Lab Report Mukesh Pant
No ratings yet
Simulation Modelling Lab Report Mukesh Pant
26 pages
House Price Analysis and Visualization
No ratings yet
House Price Analysis and Visualization
16 pages
Probability & Statistics Confidence Intervals
No ratings yet
Probability & Statistics Confidence Intervals
31 pages
Random Processes Proj
No ratings yet
Random Processes Proj
7 pages
Statistical Simulations and Analysis
No ratings yet
Statistical Simulations and Analysis
3 pages
Evaluate Mean, Median & Mode in Python
No ratings yet
Evaluate Mean, Median & Mode in Python
10 pages
Chapter 0 Introduction
No ratings yet
Chapter 0 Introduction
14 pages
Central Tendency & Dispersion in Python
No ratings yet
Central Tendency & Dispersion in Python
7 pages
Lab 1
No ratings yet
Lab 1
8 pages
Central Limit Theorem R Simulations Guide
No ratings yet
Central Limit Theorem R Simulations Guide
6 pages
Central Limit Theorem Overview
No ratings yet
Central Limit Theorem Overview
3 pages
Random Number Generation and Analysis
No ratings yet
Random Number Generation and Analysis
25 pages
PRACTICALS
No ratings yet
PRACTICALS
26 pages
Z-Test for Proportions Explained
No ratings yet
Z-Test for Proportions Explained
9 pages
Markov Chain Analysis in Statistics
No ratings yet
Markov Chain Analysis in Statistics
13 pages
Probability Distributions in MATLAB
No ratings yet
Probability Distributions in MATLAB
16 pages
Standard Deviation in ML Analysis
No ratings yet
Standard Deviation in ML Analysis
31 pages
Computer Simulation Experiments
No ratings yet
Computer Simulation Experiments
10 pages
Probability and Regression Analysis Examples
No ratings yet
Probability and Regression Analysis Examples
2 pages
Sampling Distributions & R Simulations
No ratings yet
Sampling Distributions & R Simulations
7 pages
Untitled 14
No ratings yet
Untitled 14
9 pages
Fds Project Lab Program Ex4 - 1
No ratings yet
Fds Project Lab Program Ex4 - 1
13 pages
Normal and Log-Normal Distributions
No ratings yet
Normal and Log-Normal Distributions
1 page
Sampling and Bootstrap Distributions Guide
No ratings yet
Sampling and Bootstrap Distributions Guide
14 pages
Statistical Inference and Hypothesis Testing
No ratings yet
Statistical Inference and Hypothesis Testing
2 pages
R Functions for Probability Distributions
No ratings yet
R Functions for Probability Distributions
29 pages
Sampling
No ratings yet
Sampling
42 pages
Matlab Sem 2 Unit 2
No ratings yet
Matlab Sem 2 Unit 2
58 pages
Central Limit Theorem Experiment Analysis
No ratings yet
Central Limit Theorem Experiment Analysis
9 pages
Q-Q Plots in Central Limit Theorem
No ratings yet
Q-Q Plots in Central Limit Theorem
10 pages
Random Number Generation in Python
No ratings yet
Random Number Generation in Python
8 pages
Implementing Exponential Distribution in Python
No ratings yet
Implementing Exponential Distribution in Python
44 pages
Understanding Probability Distributions
No ratings yet
Understanding Probability Distributions
9 pages
Standard Normal Distribution in R
No ratings yet
Standard Normal Distribution in R
6 pages
Understanding Sampling Distributions
No ratings yet
Understanding Sampling Distributions
12 pages
Statistical Tests and Distributions Guide
No ratings yet
Statistical Tests and Distributions Guide
5 pages
Probability and Statistics Course Guide
No ratings yet
Probability and Statistics Course Guide
5 pages
House Prices and Statistical Analysis
No ratings yet
House Prices and Statistical Analysis
16 pages
Understanding Inferential Statistics
No ratings yet
Understanding Inferential Statistics
35 pages
CLT Simulation with COVID-19 Data Analysis
No ratings yet
CLT Simulation with COVID-19 Data Analysis
12 pages
Understanding Sampling Distributions
No ratings yet
Understanding Sampling Distributions
15 pages
Year 2 Discrete Random Variables Guide
No ratings yet
Year 2 Discrete Random Variables Guide
1 page
Sta301 by S Khan Academy
No ratings yet
Sta301 by S Khan Academy
102 pages
Truncated and Censored Regression Models
No ratings yet
Truncated and Censored Regression Models
18 pages
Probabilistic Topic Models Overview
No ratings yet
Probabilistic Topic Models Overview
64 pages
Math 523: Advanced Probability Course
No ratings yet
Math 523: Advanced Probability Course
2 pages
Edexcel Statistical Formulae Booklet
No ratings yet
Edexcel Statistical Formulae Booklet
140 pages
Conditional Probability with Dice Example
No ratings yet
Conditional Probability with Dice Example
2 pages
Evaluating Language Models: Lecture 10
No ratings yet
Evaluating Language Models: Lecture 10
18 pages
Business Statistics Model Paper 4th Sem
80% (5)
Business Statistics Model Paper 4th Sem
2 pages
Skewness and Kurtosis Explained
No ratings yet
Skewness and Kurtosis Explained
3 pages
Normality Testing and Data Transformation
No ratings yet
Normality Testing and Data Transformation
56 pages
Bayesian Analysis for Low-Background Isotope Detection
No ratings yet
Bayesian Analysis for Low-Background Isotope Detection
9 pages
Fundamentals of Probability Concepts
No ratings yet
Fundamentals of Probability Concepts
78 pages
Ellen's Stock Gamble and Wealth Probability
No ratings yet
Ellen's Stock Gamble and Wealth Probability
141 pages
Basic Probability Concepts Explained
No ratings yet
Basic Probability Concepts Explained
7 pages
Probability MCQs for Career Ride
No ratings yet
Probability MCQs for Career Ride
4 pages
Probability Concepts in Statistics
No ratings yet
Probability Concepts in Statistics
93 pages
Research Methodology: Data Collection & Analysis
No ratings yet
Research Methodology: Data Collection & Analysis
23 pages
Market Risk Measurement Course Overview
No ratings yet
Market Risk Measurement Course Overview
12 pages
Bayesian State Space Models in R
No ratings yet
Bayesian State Space Models in R
14 pages
Variance Estimation in Random Forests
No ratings yet
Variance Estimation in Random Forests
73 pages
MTH312 Tutorial Solutions - MPT
No ratings yet
MTH312 Tutorial Solutions - MPT
5 pages
Central Limit Theorem Explained
No ratings yet
Central Limit Theorem Explained
29 pages
Mathematical Aspects: Reliability-Centered Maintenance
No ratings yet
Mathematical Aspects: Reliability-Centered Maintenance
99 pages
Bayes Classifier for Power System Security
No ratings yet
Bayes Classifier for Power System Security
9 pages
MA034: Probability & Random Variables Exam
No ratings yet
MA034: Probability & Random Variables Exam
9 pages
Astronomical Statistics Overview
No ratings yet
Astronomical Statistics Overview
52 pages
FN3142 Commentary May 2024
No ratings yet
FN3142 Commentary May 2024
25 pages
Probability of Random Variables in Drinks
No ratings yet
Probability of Random Variables in Drinks
2 pages

Python CLT and Confidence Interval Simulations

Uploaded by

Python CLT and Confidence Interval Simulations

Uploaded by

Q11.

In [ ]: # Central Limit Theorem (CLT)

# Try changing these values-

n = 30 # Sample size in each draw (try 5, 30, 100)

# Generate samples and compute means

[Link](means, bins=30, density=True, edgecolor='black', color='lightgreen')

# Check mean and std

print("Average of sample means:", [Link](means))

# What you should observe / interpret:

# Small exercises you can try:

# Try changing these values!

# Step 1: Basic statistics

# Step 2: Standard error

# Step 3: Critical z-value

# Step 4: Confidence interval

print(f"{conf}% CI for the mean: ({lower:.3f}, {upper:.3f})")

95% CI for the mean: (68.211, 71.789)

# Step 1: Generate 100,000 Chi-square samples with df=3

# Step 2: Basic statistics

# Step 3: Plot histogram

# Exploration / Try & Observe

Chi-square Mean: 2.99, Std Dev: 2.44

# Step 1: Generate 100,000 t-distribution samples

# Step 2: Basic statistics

# Step 4: Exploration / Try & Observe

t-Distribution Mean: 0.00, Std Dev: 1.29

You might also like