0% found this document useful (0 votes)
11 views37 pages

Probability and Statistics Course Plan

Uploaded by

epboltipublic02
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
11 views37 pages

Probability and Statistics Course Plan

Uploaded by

epboltipublic02
Copyright
© All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

Department of Mathematics & Computing

Lecture Plan, Session: 2024-25 (Monsoon Semester)


Course Course
Name of Course L T P Credit
Type Code
DC MCC505 Probability and Statistics 3 0 0 9
Course Objective
To offer a foundation in probability theory and statistical inference in order to solve applied problems and
to prepare for more advanced courses in probability and statistics.
Learning Outcomes
This course provides a solid undergraduate foundation in both probability theory and mathematical statistics
and at the same time provides an indication of the relevance and importance of the theory in solving
practical problems in the real world.
Unit Lecture
Topics to be Covered Learning Outcome
No. Hours
1 Definition of statistics, types of data, concept of frequency 7 To understand the nature
distributions. Measures of Central Tendency: Mean, and deviation of data.
median, mode, quantiles. Measures of Dispersion: Range,
mean absolute deviation, standard deviation, variance,
covariance, coefficient of variation and moments, relation
between first four central and raw moments. Skewness and
kurtosis: measures of skewness and kurtosis.
2 Events, sample space, definitions of Probabilities, 10 To understand the logic of
Theorems of Probabilities (without proof): Addition, probability. To find the
Conditional Probability and Multiplication, Bayes theorem, descriptive statistics of
and its proof with application based on numerical distribution through
problems. Random variables; discrete and continuous, moment generating
probability functions: pmf and pdf, Joint, marginal and function.
conditional probability distributions. Mathematical
expectation and its properties. Moment generating and
characteristic functions. Exercises based on above topics
3 Statements (without proof) of Markov and Chebyshev 4 To obtain the different
inequalities and its applications based on numerical probability bounds of
problems. Statements of Law of large numbers: WLLN, data.
SLLN, Central limit theorem. Numerical problems based
on above topics.
4 Definitions, MGF, mean and variance of the following 14 To understand the
Probability distributions: Discrete: Uniform, Bernoulli, concepts of a random
binomial, negative binomial, geometric, hyper geometric, variable and analyze the
Poisson probability distributions. Continuous: Uniform, ideal patterns of data.
normal, lognormal, Cauchy, exponential, gamma, beta,
Weibull probability distributions. Definitions & uses of
sampling distributions: Chi-square, t and F, distributions of
smallest, largest order statistics and range
5 Definition of Karl Pearson correlation coefficient and its 7 To know the relationship
properties for bivariate data, Spearman’s rank correlation between variables and
coefficient with examples. Concept and derivations of predict (estimate) the
regression lines, properties of regression coefficients. Plane value of dependent
of regression (three variables case) and derivation of variable.
regression coefficients. Definitions (without proof) of
multiple and partial correlation coefficients and numerical
problems.

Text Books:

1. Sheldon M. Ross. First Course in Probability, A, 9th Edition, Pearson, Boston, 2014.
2. V.K. Rohatgi and A.K. Md. Ehsanes Saleh, An Introduction to Probability and Statistics, John
Wiley & Sons, 3rd Edition, 2015.

Reference Books:

1. Hogg, R.V., McKean, J.W. and Craig, A.T., Introduction to Mathematical Statistics. 7 th Edition,
Pearson, Boston, 2013.
2. S.C. Gupta and V. K. Kapoor, Fundamentals of Mathematical Statistics (A Modern Approach)
10th Edition, Sultan Chand & Sons, 2002.

(A K Verma) (N Jana) (Subhashis Chatterjee) (G.N. Singh)


Assistant Professor Associate Professor Professor Professor

Common questions

Powered by AI

Moment generating functions (MGFs) offer a summary of a distribution by encapsulating all its moments, making it easier to derive distribution properties such as means, variances, and skewness. Importantly, MGFs uniquely determine the probability distribution, allowing analysts to transform complex integrations into simpler algebraic manipulations. They are instrumental in proving limit theorems, enabling derivation of asymptotic properties, and simplifying calculations in combinatorial and queuing theory. Using MGFs is important for understanding complex stochastic processes and enhancing theoretical developments in probability .

The correlation coefficient measures the strength and direction of a linear relationship between two variables, ranging from -1 to 1. A high absolute value indicates a strong relationship, while a value near zero suggests no linear relationship. Importantly, correlation does not imply causation; it merely identifies associations without proving one variable causes changes in another. Understanding this distinction is crucial in statistical analysis to avoid misleading interpretations that could affect scientific, economic, and social research conclusions .

Mathematical expectation provides a weighted average of all possible values a random variable can take, serving as a measure of its central tendency (expected value). Understanding the expectation helps in predicting and making informed decisions based on probabilistic models, as it allows analysts to summarize data distributions succinctly and compute expected outcomes, variances, and covariances. This understanding aids in evaluating long-term patterns and is fundamental in decision theory, risk assessment, and economic modeling .

Sampling distributions like chi-square and t-distributions are foundational for hypothesis testing, providing the basis for statistical inference. The chi-square distribution is used mainly for variance analysis and goodness-of-fit tests, while the t-distribution is used when the population standard deviation is unknown and sample sizes are small, to estimate population means. Applying these distributions involves assessing hypotheses about population parameters, allowing researchers to make decisions based on sample data with quantified uncertainty. Conditions include assumed data normality and independent samples, critical for the validity of test results .

The Law of Large Numbers (LLN) and the Central Limit Theorem (CLT) together underpin the reliability of statistical methods. LLN states that as the sample size grows, the sample mean converges to the expected value, reinforcing data stability. CLT complements this by describing how the distribution of the sample mean becomes normal with larger samples. Together, they justify using sample statistics to infer population parameters, enabling accurate predictions and estimates in varied contexts such as quality control and econometrics. This synergy is essential for the validity of inferential statistics .

The Central Limit Theorem (CLT) states that the distribution of the sample mean approaches a normal distribution as the sample size increases, regardless of the original distribution. This is foundational as it justifies using normal distribution in inferential statistics and analytical applications, making it possible to apply z-tests and t-tests. It's crucial in estimating population parameters and computing confidence intervals underpins the robustness of many statistical methods applicable to various fields such as quality control, opinion polls, and research studies .

Bayes' Theorem is used in medical diagnosis to update the probability of a disease given new evidence, such as a test result. It combines prior knowledge (initial estimates) with new data to provide a posterior probability. This makes it a powerful tool as it allows medical practitioners to revise probabilities with the addition of new, relevant evidence, leading to more accurate diagnosis and improved decision-making under uncertainty .

Skewness measures the asymmetry of a distribution, indicating whether data tails are balanced or more extensive on one side. Positive skewness indicates a longer tail on the right, while negative skewness suggests a longer tail on the left. Kurtosis measures the 'tailedness' of the distribution, revealing how flat or peaked it is compared to a normal distribution. High kurtosis indicates heavy tails and a sharper peak, while low kurtosis suggests light tails and a flatter peak. Together, these measures refine our understanding of a dataset's shape and provide insights into its deviation from a normal distribution .

Markov's inequality provides an upper bound for the probability that a non-negative random variable is greater than some threshold, given its expected value. Chebyshev's inequality, applicable to any distribution with a defined mean and variance, bounds the probability that a random variable deviates from its mean by more than a given amount of standard deviations. These inequalities are particularly useful when dealing with distributions that are not well-known or are unwieldy, providing insights into variability and extreme values without needing details of the actual distribution shape .

Discrete probability distributions are used for variables that can take on distinct, separate values, like the number of heads in a series of coin tosses, described by distributions such as binomial or Poisson. Continuous probability distributions model variables that can take any value in a given range, like heights or weights, described by distributions such as normal or exponential. These distributions are crucial in modeling real-world phenomena, as they help to predict outcomes and assess risks, with the proper distribution chosen based on the nature of the data and the event being modeled .

You might also like