CFA Level I: Rates and Returns Guide
CFA Level I: Rates and Returns Guide
Rates of Return
Three Perspectives
Minimum rate of return Minimum return demanded to accept receipt of funds at a later date.
Discount rate A rate applied to the future cash flow to determine its present value.
Opportunity cost Cost of spending the money today instead of saving it for a certain
period and earning a return on it.
Components of Rates
Real risk-free rate Purely reflects individuals’ preferences for current v/s future consumption.
Inflation premium Compensation against expected loss in purchasing power over the term of a loan.
Default premium Compensates investors for the risk that the borrower might fail to make promised
payments in full in a timely manner.
Liquidity premium Compensates investors for any difficulty that they might face in converting their
holdings readily into cash at their fair value.
Maturity premium Compensates investors for the higher sensitivity of the market values of longer-
term debt instruments to changes in interest rates.
Rates of Return
Types of Rates of Return
Holding Period Return 𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃1 − 𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃0 + 𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼1
𝐻𝐻𝐻𝐻𝐻𝐻 𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆 𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃 =
(Single period) 𝑃𝑃0
• Assumes the amount invested at the start of each period is the same.
• Heavily influenced by outliers.
Geometric Mean 𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺𝐺 𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅
1�
Return = [(1 + 𝐻𝐻𝐻𝐻𝐻𝐻1 )(1 + 𝐻𝐻𝐻𝐻𝑅𝑅2 ) … . (1 + 𝐻𝐻𝐻𝐻𝑅𝑅3 )] 𝑛𝑛 −1
Example: A manager invests $5,000 annually in an ETF for four years. Below were the prices of the ETF
at the time of purchase.
Y1: $62, Y2: $76, Y3 = $84, Y4: $90.
The average cost price of the ETF for the manager across the four years equals – ?
Solution:
$76.48
Other Rates of Return used in Portfolio Management
Money-Weighted Rate of Return (Internal Rate of Return or IRR)
• Basics: The value of an asset can be considered as the present value of its future cash flows when
applying a relevant discount rate.
𝐶𝐶1 𝐶𝐶2 𝐶𝐶𝑛𝑛
𝑉𝑉0 = 1
+ 2
+ ⋯+
(1 + 𝑟𝑟) (1 + 𝑟𝑟) (1 + 𝑟𝑟)𝑛𝑛
• The rate that is being used here, to equate the future cash flows to the current value is called as
the Internal Rate of Return.
Calculation of Internal Rate of Return
Example: At t=0, an investor buys a share at the price of $100. After a year, she purchases another
share of the same stock at $120. After another year (t=2), she sells both the shares for a total price of
$270. She also received a dividend of $5 a share at the end of each year. What is the investor’s IRR?
Calculator method:
CF, 2nd CE|C
CF0 = -100, Enter, Down
C01 = -115, Enter, Down
F01 = 1, Enter, Down
C02 = 270, Enter, Down
F02 = 1, Enter
IRR, CPT
Ans: 16.5%
Other Rates of Return used in Portfolio Management
Time-Weighted Rate of Return
• TWR measures the compounded rate of growth of $1 invested over a stated period.
• It neutralized the impact of cash withdrawals and deposits into the portfolio.
Calculation of Time-weighted Return
Example: At t=0, an investor buys a share at the price of $100. After a year, she purchases another
share of the same stock at $120. After another year (t=2), she sells both the shares for a total price of
$250. She also received a dividend of $5 a share at the end of each year. What is the investor’s IRR?
Step 1: Splitting the holding period into two distinct years. Subperiods can of any duration and need
not be uniform.
Year 1 return:
(25%)
Year 2 return:
(12.5%)
Step 3: Calculate the holding period returns by linking the subperiod returns.
Annualized Returns
• Enables comparison across investment assets / portfolios with different maturities.
Stated vs Effective Annual Return (Discrete Compounding)
Example: An investor deposits money with a financial institution who provides her with three options.
Option 1: 5% per annum, compounded annually.
Option 2: 4.91% per annum, compounded quarterly.
Option 3: 4.88% per annum, compounded weekly.
Which option should the investor pick from?
Annualized Returns
Stated vs Effective Annual Return (Continuous Compounding)
Example: An investor is set to receive a return of 5% p.a., continuously compounded. What is the
effective annual return?
Ans: 5.13%
Calculator steps:
Example: An investor earns an effective 10% in a year in the stock market. What is the continuously
compounded rate?
Ans: 9.53%
Another Nuanced Example: An investor invests $1000 in the stock market, and it grows to $1,250 in 2
years. What is the continuously compounded rate?
Ans: 11.16%
Leveraged Return
• Investors can also used borrowed funds to invest in the market.
• Borrowed funds create leverage – an effect which magnifies profits and losses.
• More of this will be covered in equity and derivatives.
𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑒𝑒𝑑𝑑𝑑𝑑 𝑏𝑏𝑏𝑏
𝑟𝑟𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙𝑙 = 𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑟𝑛𝑛𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝 + (𝑟𝑟 − 𝑟𝑟𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑 )
𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑒𝑒𝑒𝑒𝑒𝑒𝑢𝑢𝑢𝑢𝑢𝑢𝑢𝑢 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝𝑝
Level I Quantitative Methods
of the TIME VALUE OF MONEY IN FINANCE
CFA® Program
Learning Outcome Statements
LOS : Calculate and interpret the present value (PV) of fixed-income and
equity instruments based on expected future cash flows.
LOS : Explain the cash flow additivity principle, its importance for the no-
arbitrage condition, and its use in calculating implied forward interest rates,
forward exchange rates, and option values.
Fixed Income and Time Value of Money
Fixed Income Cash Flow Patterns
• Zero Coupon Bonds: The investor receives one payment at the bond maturity date. The value of
the bond is the present value of the final single payment.
• Periodic Interest / Coupon Bonds: Investors receive periodic coupon payments and the par value
at maturity.
• Level Payment Bonds / Amortizing Bonds: Investors receive uniform payments every period which
consists of both principal and interest amount.
Example 2: A 10% coupon, semi-annual bond is currently priced at $105 and has 3 years to maturity.
What is the yield to maturity (required rate of return) on this bond?
Example 3: A borrower receives $100,000 from the bank as a part of a 10-year mortgage. The interest
rate charged is 8% p.a. What the is monthly installment required by the bank if the borrower needs to
pay the complete principal along with interest at the end of 10 years?
Example 4: A perpetual bond pays 4% quarterly coupon on a $100 par. What is the value of the bond
today if the required rate of return is 6%?
Equity and Time Value of Money
Dividend Patterns
• Constant $ Dividends: The investor expects to receive the same $ amount every period.
• Constant Dividend growth rate: Dividends are expected to grow at a constant rate forever.
• Changing Dividend growth rate: Dividends grow at different rates based on lifecycle phase.
Key points • The mean is also unique; that is, each data set has only one mean.
• The arithmetic mean of a sample is the best estimate of the next observed value.
• The sum of the deviations from the arithmetic mean always equals 0.
Geometric Mean
Formula
Key points • Smaller than Arithmetic mean, unless variability = 0, where it will be equal to AM.
Weighted Mean
Formula
Median
Formula • The middle value of the data once arranged in ascending or descending order.
• The median for an odd number of variables is the observation in the position
identified by (n + 1)/2.
• The median for an even number of variables is the average of observations in the
positions identified by n/2 and (n + 2)/2.
Key points • Not sensitive to outliers and is a useful measure for asymmetric distributions.
Mode
Concept • The most frequently occurring value in a data set.
• For grouped data, the mode is the interval with the greatest number of observations.
• A data set may have no mode, one mode, or more than one mode.
Measures of Location
Quantiles
Divides data and groups it into parts
Quartile • Data divided into four parts.
Quintile • Data divided into five parts.
Decile • Data divided into ten parts.
Percentile • Data divided into one hundred parts.
Interquartile Range
Interquartile range is the difference between the top of the third quartile and the bottom of the
second quartile (i.e., top of the first quartile) or IQR = Q3 − Q1.
• Quantiles may be visualized by a box-and-
whisker plot.
• To better identify outliers, the boundaries
of the IQR +/- some percentage will be
applied to establish the whiskers.
• Outliers will appear outside the whiskers.
Measures of Location
Quantiles
Example: (2024_L1V1, Pg: 94, Example 2)
If we include one more return observation of 10% in our data set, what is the new value of the first
quartile?
Key points • MAD, unlike range, uses all the observations in the data set.
• Difficult to manipulate mathematically.
Measures of Dispersion
Variance and Standard Deviation
Variance • Average of the squared deviations around the mean.
• However, when squaring, the unit of the original data set is not preserved.
2
∑𝑛𝑛𝑖𝑖=1(𝑥𝑥𝑖𝑖 − 𝜇𝜇)2 2
∑𝑛𝑛𝑖𝑖=1(𝑥𝑥𝑖𝑖 − X̄)2
𝜎𝜎 = 𝑠𝑠 =
𝑛𝑛 𝑛𝑛 − 1
• Mean Absolute Deviation will always be lower than standard deviation due to the
squaring of deviations.
Example: Calculate the variance and standard deviation for five golfers with the scores of 67, 71, 72,
75, 68. How do the numbers differ if they represent (i) a sample or (ii) the entire population.
Calculator Method:
Measures of Dispersion
Target Downside Deviation / Semi Deviation
Concept & • Measures the average deviation of the values below a target.
Formula • Focuses only on downside risk.
Key Points • When observations are returns, CV measures the amount of standard deviation per
unit of return.
• If the mean return is negative, then the statistic is economically useless.
Measures of Shape of a Distribution
Skewness
Concept • Measures the level of asymmetry in a data distribution.
Skewness = 0
Positively
Skewed,
Skewness > 0
• A positive covariance indicates that both the variables can be expected to be above
(or below) their respective means simultaneously.
• A negative covariance indicates that when one variable is higher than its mean, the
other is expected to be lower than its respective mean and vice-versa.
• Variables could be of different scales and magnitudes. Covariance does little to
explain about the strength of the linear relationship.
Correlation • Standardized measure of linear relationship that describes not only the direction of
the linear relationship but also the strength.
Dependent event The occurrence of one is related to the occurrence of the other.
Conditional Probability ≠ Unconditional Probability
P (Pass/Easy) ≠ P (Pass)
Independent event The occurrence of one does has no bearing on the occurrence of the other.
Conditional Probability = Unconditional Probability
P(Pass/Rain) = P(Pass)
Rules of Probability
Rule What it computes? Formula
Multiplication • Probability of 2 key events Dependent events
occurring together (joint 𝐴𝐴
𝑃𝑃(𝐴𝐴𝐴𝐴) = 𝑃𝑃(𝐵𝐵) × 𝑃𝑃 � �
probability). 𝐵𝐵
• Key words: and, both, together and Independent events
combined. 𝑃𝑃(𝐴𝐴𝐴𝐴) = 𝑃𝑃(𝐵𝐵) × 𝑃𝑃(𝐴𝐴)
Addition • Probability of at least one of the P(A or B) = P(A) + P(B) − P(AB)
two events occur.
• Key words: or, either and at least. 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒 𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒
P(AB) = 0
Total • Expresses the unconditional 𝑃𝑃(𝐴𝐴) = 𝑃𝑃(𝐴𝐴𝐴𝐴) + 𝑃𝑃(𝐴𝐴𝑆𝑆 𝑐𝑐 )
probability rule probability of an event in terms of
conditional probabilities for 𝐴𝐴 𝐴𝐴
𝑃𝑃(𝐴𝐴) = 𝑃𝑃(𝑆𝑆) × 𝑃𝑃 � � + 𝑃𝑃(𝑆𝑆 𝑐𝑐 ) × 𝑃𝑃 � 𝑐𝑐 �
mutually exclusive and exhaustive 𝑆𝑆 𝑆𝑆
events.
Where S and 𝑆𝑆 𝑐𝑐 represent 2 mutually
• Trick: the question give conditional
exclusive and exhaustive scenarios.
probabilities but asks to find
unconditional probabilities.
Deriving conditional probability from joint probability:
A P(AB) B P(BA)
P� � = or P � � = Note: P(AB) = P(BA)
B P(B) A P(A)
Probability Trees
Example: There is 30% probability that the exam shall be easy. There is 80% chance of passing if the
exam is easy and 40% chance of passing if the exam is hard. Given the probabilities find the following:
A. Probability that the exam is easy, and student shall pass the exam.
B. Probability that the student shall pass the exam.
C. Probability that student shall pass the exam, or the exam is easy.
Step for tree:
1. Start with independent probabilities.
2. Layout conditional probabilities in each scenario.
3. Calculate join probability.
A. Probability that the exam is easy, and student shall pass the exam.
P(PE) = P(E) x P(P/E) = 24%
C. Probability that student shall pass the exam, or the exam is easy.
P(P or E) = P(P) + P(E) – P(PE) = 52% + 30% - 24% = 58%
Probability Trees & Bayes’ Formula
Bayes’ formula uses the occurrence of the event to infer the probability of the scenario generating it.
• For this reason, Bayer’s formula is sometimes called an inverse (posterior) probability.
Hint: the question shall ask for inverse conditional probability compared to the one given in the
question.
Example: In the above question, what is the probability that the exam is easy given the student
passed the exam.
E P(EP) 24%
P� � = = = 46.15%
P P(P) 52%
Trick to identify Bayes question: watch the question asked for P(E/P) when in the question we were
given P(P/E).
Expected Value, Variance, Standard Deviation of a Random Variable
Expected The probability-weighted average of all possible outcomes for the random variable.
𝑛𝑛
value
𝐸𝐸(𝑋𝑋) = � 𝑃𝑃𝑖𝑖 𝑋𝑋𝑖𝑖
𝑖𝑖=1
• Expected values are forecasts of what might happen based on probabilities.
• The expected value represents our best guess – the average of all outcomes over
the long run.
Variance The probability weighted sum of the squared differences between each outcome and
the expected value.
𝑛𝑛
2
𝜎𝜎 = � 𝑃𝑃𝑖𝑖 [𝑋𝑋𝑖𝑖 − 𝐸𝐸(𝑋𝑋)]2
𝑖𝑖=1
• If variance equals zero, there is no dispersion in the distribution.
• In this case, the outcome is certain, and the variable, X, is not a random variable.
Standard Standard deviation is the positive square root of the variance.
deviation 𝜎𝜎 = �𝜎𝜎 2
The variance has no units while the standard deviation is expressed in the same unit as
the expected value and the random variable itself.
Example: Calculate the expected value, variance and standard deviation of coal prices given the
following possible outcomes and their associated probabilities:
Probability 8% 15% 40% 25% 12%
Coal Price $40 $50 $60 $70 $80
E (Coal price) = (0.08 × 40) + (0.15 × 50) + (0.40 × 60) + (0.25 × 70) + (0.12 × 80) = $61.80
σ2 = 0.08(40 − 61.8)2 + 0.15(50 − 61.8)2 + 0.40(60 − 61.8)2 + 0.25(70 − 61.8)2 + 0.12(80 − 61.8)2 = 116.76
σ = √116.76 = $10.81
Calculator Steps
Expected Value, Variance, Standard Deviation of a Random Variable
The total probability rule for expected value state (unconditional) expected values in terms of
conditional expected values.
𝑋𝑋 𝑋𝑋
𝐸𝐸(𝑋𝑋) = 𝑃𝑃(𝑆𝑆) × 𝐸𝐸 � � + 𝑃𝑃(𝑆𝑆 𝑐𝑐 ) × 𝐸𝐸 � 𝑐𝑐 �
𝑆𝑆 𝑆𝑆
EPS = $10
Prob. = 70%
Good -> P(G) = 40%
E(EPS/Good) = 10 x 70% + 6 x 30% = $8.80
EPS = $6
Prob. - 30%
Economy
E(EPS)= 8.80 x 40% + 3.60 x 60% = $5.68
EPS = $6
Prob. = 20%
Bad -> P(B) = 60%
E(EPS/Bad) = 6 x 20% + 3 x 80% = $3.60
EPS = $3
Prob = 80%
Level I Quantitative Methods
of the PORTFOLIO MATHEMATICS
CFA® Program
Learning Outcome Statements
LOS : Define shortfall risk, calculate the safety-first ratio, and identify an
optimal portfolio using Roy’s safety-first criterion.
Portfolio Risk and Return
Expected Return on a Portfolio
The expected return on a portfolio is a function of the returns on the individual assets and of their
respective weights.
𝑡𝑡
Similar to above we can replace covariance in the formula by correlation and product of individual
asset standard deviation.
Standard deviation
𝜎𝜎𝑃𝑃 = �𝜎𝜎 2 𝑃𝑃
Standard deviation can be directly calculated as the weighted average of individual asset standard
deviation when the correlation between assets is 1.
Covariance
Covariance between two random variables is the probability-weighted average of the cross-products
of each random variable’s deviation from its own expected value.
COVX,Y = P × [X i − E(X)] × [Yi − E(Y)]
Properties
• Covariance is symmetric, i.e., Cov(X,Y) = Cov(Y, X).
• Covariance can range from positive infinity to negative infinity.
• The covariance of X with itself, Cov(X,X), is equal to variance of X, Var(X).
• Covariance can be positive, negative and zero (no relationship).
Limitation
• Difficult to compare covariance across data sets that have different scales.
• Difficult to interpret covariance as it can take on extreme large values.
• Does not tell us anything about the strength of the relationship between the two variables.
Portfolio Risk and Return
Correlation
Correlation coefficient measures the strength and direction of the linear relationship between two
random variables.
COVX,Y
CORR X,Y =
𝜎𝜎𝑋𝑋 𝜎𝜎𝑌𝑌
Properties
• The correlation coefficient can have a maximum value of +1 and a minimum value of −1.
• A correlation coefficient greater than 0 means that when one variable increases (decreases) the
other tends to increase (decrease) as well.
• A correlation coefficient less than 0 means that when one variable increases (decreases) the other
tends to decrease (increase).
• A correlation coefficient of 0 indicates that no linear relation exists between the two variables.
Limitation
• It does not specify which factor or variable causes the linear relationship between the two
variables.
• Further, like covariance, the correlation coefficient is a measure of linear association.
Example: Calculate the covariance and correlation using the below joint probability distribution.
RA = 10% RA = 15% RA = 20%
RB = 15% 0.30 - -
RB = 20% - 0.60 -
RB = 25% - - 0.10
Properties
• It is completely described by its mean (μ) and variance (σ2).
• Skewness is 0 (50% observation lie above the mean and 50% observations lie below the mean).
• Kurtosis equals 3 and excess kurtosis equals 0.
• A linear combination of normally distributed random variables is also normally distributed.
• The probability of the random variable lying in ranges further away from the mean gets smaller
and smaller, but never goes to zero.
Univariate vs Multivariate Distributions
Univariate • Describes the distribution of a single random variable.
Multivariate • Specify probabilities associated with a group of random variables considering
the interrelationships that may exist between them.
• A multivariate normal distribution for the return on a portfolio with n stocks is
completely defined by the following three parameters:
o n means
o n variances
o n(n-1)/2 pairwise correlation
• Example: Identify the parameters required for portfolio that includes 4 stocks?
o 4 means
o 4 variances
o 6 (4 x 3 / 2) pairwise correlation
The Normal Distribution – Confidence Interval
Confidence Interval
Represents the range of values within which a certain population parameter is expected to lie in a
specified percentage of the time.
𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶𝐶 𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼𝐼 = 𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 ± 𝑍𝑍 𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣𝑣 × 𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆 𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆 𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷
Approximate Z values Precise Z values
Degree of Confidence Z value Degree of Confidence Z value
50% 2/3 90% 1.65
68% 1.00 95% 1.96
95% 2.00 99% 2.58
99% 3.00
Graphical Representation
Example: The average annual return on a stock is 15% and the standard deviation of returns equals
5%. Given that the stock's returns are distributed normally, calculate the 90% confidence interval for
the return in any given year.
Solution
The 90% confidence interval is calculated as:
Upper limit = Sample Mean + Z value × Sample Standard Deviation
Upper limit = 15% + 1.65 × 5% = 23.25%
Interpretation: The probability that the return on the stock for a given year lies between 6.75% and
23.25% equals 0.90.
The Normal Distribution – Probability Applications
Calculating Probabilities with Normal Distribution Tables
• Standardization allows converting variables in different units into a single unit of measurement
(standard deviation).
𝑂𝑂𝑂𝑂𝑂𝑂𝑂𝑂𝑂𝑂𝑂𝑂𝑂𝑂𝑂𝑂 𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉 − 𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃𝑃 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀
𝑍𝑍 =
𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆𝑆 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑
• This allows us to infer probabilities by referring to a single normal curve (standard normal
distribution curve with a mean = 0 and standard deviation = 1).
Key point: the z-value represents the number of standard deviations away from the population mean,
a given observation lies.
Key point:
• As the normal is a symmetrical distribution, we can say F(-Z) = 1 – F(Z)
• Remember for continuous random variables X > x is equal to X ≥ x.
The Normal Distribution – Probability Applications
Example: The points, X, scored by students in a class on their final exam are normally distributed with
a mean of 60 and standard deviation of 15.
Question 1: What is the probability that the points scored by a given student will be less than 80?
Question 2: What is the probability that the points scored by a given student will be more than 70?
Question 3: What is the probability that the points scored by a given student will be less than 40?
Question 4: What is the probability that the points scored by a given student will be between 30 and
90?
The Normal Distribution – Safety First & Shortfall Risk
Shortfall risk Refers to the probability that a portfolio's value or return, E(RP), will fall below a
particular target value or return (RT) over a given period.
Roy’s Safety- States that an optimal portfolio minimizes the probability that the actual portfolio
First Criteria return, RP, will fall below the target return, RT.
𝐸𝐸(𝑅𝑅𝑃𝑃 ) − 𝑅𝑅𝑇𝑇
𝑆𝑆𝑆𝑆 𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅𝑅 =
𝜎𝜎𝑃𝑃
• A random variable, Y, follows the lognormal distribution if its natural logarithm, ln Y, is normally
distributed.
• The reverse is also true: If the natural logarithm of random variable Y, ln Y, is normally distributed,
then Y follows the lognormal distribution.
Key features
• It is bounded by zero on the lower end.
• The upper end of its range is unbounded.
• It is skewed to the right (positively skewed).
Application
• The lognormal distribution is frequently used
to model the probability distribution of asset
prices because it is bounded by zero on the
lower side.
• The normal distribution, on the other hand, Uses the ratio of ending value to beginning value
can be (and is frequently) used as an (V1/V0) as the random variable.
approximation for returns.
Key point: If a stock's continuously compounded return is normally distributed, then future stock
price must be lognormally distributed.
𝐴𝐴𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛𝑛 𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉 = 𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷𝐷 𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉𝑉 ∗ √𝑁𝑁𝑁𝑁. 𝑜𝑜𝑜𝑜 𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑 𝑖𝑖𝑖𝑖 𝑡𝑡ℎ𝑒𝑒 𝑦𝑦𝑦𝑦𝑦𝑦𝑦𝑦
Monte Carlo Simulation
Concept • Generates random numbers and operator inputs to synthetically create
probability distributions for variables.
• It is used to calculate expected values and dispersion measures of random
variables, which are then used for statistical inference.
Steps Specifying the simulation
• Step 1: Specify the quantity of interest (revenue) along with the underlying
variables (e.g., price and quantity).
• Step 2: Specify a time period and break it into a number of subperiods.
• Step 3: Specify the distributional assumptions for the risk factors affecting the
underlying variables.
Running the simulation
• Step 4: Use a computer program or a spreadsheet function to draw random
values for each risk factor.
• Step 5: Based on the random values generated in Step 4, calculate the values for
the underlying variables (price and quantity).
• Step 6: Based on the average values of the underlying variables calculated in Step
5 calculate the revenue.
• Step 7: Iteratively go back to Step 4 until a sufficient number of trials has been
performed.
Investment • To experiment with a proposed policy before actually implementing it.
Applications • To provide a probability distribution that is used to estimate investment risk (Ex:
VAR).
• To provide expected values of investments that can be difficult to price.
• To test models and investment tools and strategies.
Limitations • Answers are only as good as the assumptions and model used.
• Does not provide cause-and-effect relationships.
Bootstrapping
Concept • Uses resampling – repeatedly draw samples from an original observed data.
• A special method of simulation where multiple samples are drawn from a sample
drawn from the original population.
Steps: • Follows similar steps to Monte Carlo method, but the difference is that the
bootstrapping draws its samples from a sample from the original population,
whereas the Monte Carlo method generates random samples from the
underlying distribution.
Limitations • A risk factor that was not represented in historical data will not be considered in
the simulation.
• It does not facilitate “what if” analysis when the “if” factor has not occurred in
the past. Monte Carlo simulation can be used for “what if” analysis.
• It assumes that the future will be similar to the past.
• It does not provide any cause-and-effect relationship information.
Level I Quantitative Methods
of the ESTIMATION AND INFERENCE
CFA® Program
Learning Outcome Statements
Sampling Error
Concept Is the error caused by observing a sample instead of the entire population to
draw conclusions relating to population parameters.
Formula Sampling error of the mean = Sample mean − Population mean = 𝑋𝑋� – 𝜇𝜇
The Sampling Distribution
A sampling distribution is the probability distribution of a given sample statistic under repeated
sampling of the population.
• Sampling distribution of the mean show the distribution of means obtained drawing same size
sample from the population and calculating the mean.
Example A population is made of 3 values: 2, 4 and 6. Given we can draw sample size of two
numbers with replacement, construct the sampling distribution of mean.
Key point: mean of sample mean is 4 which is equal to the population mean of 4 [(2 + 4 +
6) / 3 = 4
Central Limit Theorem
Make accurate statements about the population mean and variance using the sample mean and
variance regardless of the distribution of the population, as long as the sample size is adequate.
• An adequate sample size is defined as one that has more than 30 observations (n ≥ 30).
Key Properties
1 No matter what the distribution of the population, for a sample whose size is greater than or
equal to 30, the sample mean will be normally distributed.
2 The mean of the population (μ) and the mean of the distribution of sample means (𝑋𝑋�) are equal.
3 The variance of the distribution of sample means equals σ2/n, or population variance divided by
sample size.
Example: Given the population mean of 10% and population standard deviation of 20%, what shall be
the mean, variance, and standard deviation of sampling distribution with a sample size of 49.
Solution
• Mean of sampling distribution = population mean = 10%
• Variance of sampling distribution = σ2/n = 202/49 = 8.1632
• Standard deviation of sampling distribution = √8.1632 = 2.8571% or 20/√49 = 2.8571
Standard Error
The standard deviation of the distribution of sample means is known as the standard error of the
statistic. Standard error decreases as sample size increases.
Population variance is known Population variance is not known
𝜎𝜎 𝑠𝑠
𝜎𝜎𝑋𝑋� = 𝑠𝑠𝑋𝑋� =
√𝑛𝑛 √𝑛𝑛
Example: A manufacturer claims that the life of the batteries that it produces is normally distributed
with an average life of 30 hours, and a population standard deviation of 5 hours. For a random sample
of 40 batteries calculate the standard error.
𝜎𝜎 5
𝜎𝜎𝑋𝑋� = = = 0.79
√𝑛𝑛 √40
Interpretation: If we were to take all the possible random samples of 40 batteries each from the
entire population of batteries produced by this manufacturer, and prepare a distribution of sample
means, the distribution would have an average life of 30 hours, and a standard error (standard
deviation of the sampling distribution) of 0.79 hours.
Example: A sample containing the monthly returns for 50 U.S. stocks has a mean of 5% and a
standard deviation of 10%. Calculate the standard error of the sample mean.
𝑠𝑠 10
𝑠𝑠𝑋𝑋� = = = 1.41%
√𝑛𝑛 √50
Interpretation: If we were to take all possible samples of 50 stocks’ monthly returns from the
population of all stocks listed in the U.S., the sampling distribution would have a mean of 5% and a
standard error of 1.41%.
Standard deviation vs standard error
• If we want to draw conclusions about how spread out the data are, we use the standard
deviation.
• If we want to find out how precise the population parameter estimate (based on sampled data) is
relative to its true value, we use standard error.
Resampling
Resampling is the process of repeatedly drawing samples from the original observed data sample in
order to infer population parameters.
2 popular methods of resampling
Bootstrap • Process of using random sampling (with replacement) to construct the sampling
distribution of the sample mean.
• However, we have no knowledge of the population and thus the sample is treated as
the population.
• Powerful method for complicated estimators and does not rely on an analytical
formula to estimate the distribution of those estimators.
• Very accurate, and thus is used extensively in historical simulations in asset allocation
and in gauging an investment strategy's performance against a benchmark.
• The estimate of the standard error of the sample mean is calculated as follows:
𝑛𝑛
1 2
𝑠𝑠𝑋𝑋� = � ��θ�b − θ��
𝐵𝐵 − 1
𝑏𝑏=1
Decision rule: Reject if the test statistics is more than 1.96 or less than -
1.96.
Conclusion: as we reject the null, we can say that the career scoring
average is different from 30.
Testing Two Means
Population Relationship Assumptions regarding Type of Test
Distribution between Samples Population Variance
Normal Independent Equal t-test pooled variance
Normal Dependent Not applicable t-test paired comparison
Decade Number of months Mean monthly return (%) Standard deviation (%)
1950s 110 0.680 5.598
1960s 110 1.570 5.738
Step 1 State the hypothesis H0 : 𝜇𝜇1 − 𝜇𝜇2 = 0 This is a two-sided
Ha : 𝜇𝜇1 − 𝜇𝜇2 ≠ 0 (tailed) test.
Step 2 Identify the appropriate test This is test of two means and thus we shall use t test
statistics statistics.
Step 3 Specify the level of 10% (given in the question).
significance
Step 4 State the decision rule To find critical value, look at the at 10% column under
two tailed heading or 5% column under one tailed
heading at a degree of freedom of 218 (110 + 110 - 2)
Critical value = ±1.653.
(0.68 − 1.57 ) − 0
𝑡𝑡 = = −1.165
�32.1311 + 32.1311
110 110
Where
(𝑛𝑛1 − 1)𝑠𝑠 2 1 + (𝑛𝑛2 − 1)𝑠𝑠 2 2
𝑠𝑠 2 𝑃𝑃 =
𝑛𝑛1 + 𝑛𝑛2 − 2
𝑠𝑠 2 𝑃𝑃 = 32.311
Step 6 Make a decision Fail to reject the null as the test statistic (-1.165) falls in-
between upper (1.653) and lower critical value (-1.653).
• The p-value is the smallest level of significance at which the null hypothesis can be rejected.
• It represents the probability of obtaining a critical value that would lead to rejection of the null
hypothesis.
Decision rule
Reject p value < level of significance
Fail to reject p value > level of significance
Example: using a software, an analyst calculates the p value as 2%. At what level of significance from
those given below will the null be rejected?
• 5% level of significance: reject as p value is lower
• 1% level of significance: fail to reject as p value is higher
Type-I, Type-II Errors
Decision H0 is true H0 is false
Do not reject H0 Correct decision Incorrect decision
Type 2 error
Reject H0 Incorrect decision Correct decision
Test the hypothesis that the correlation amongst the returns on the funds is different from zero at 5%
significance level.
Non-Parametric Test of Correlation
Step 1: state the hypothesis H0 : rS = 0
Ha : rS ≠ 0
Step 2: calculate the spearman 6 𝑥𝑥 1,982
𝑟𝑟𝑠𝑠 = 1 − = 0.6397
correlation 32(322 − 1)
Step 3: calculate t statistics. 𝑟𝑟√𝑛𝑛 − 2 0.6397√32 − 2
𝑡𝑡 = = = 4.522
(as n > 30) √1 − 𝑟𝑟 2 √1 − 0.63972
Step 4: find the critical value For a two-tailed test with a 5% significance level, the critical
values for n − 2 = 32 − 2 = 30 degrees of freedom are ±2.042.
Step 5: make the decision Since the test statistic (4.522) is greater than the upper critical
value (2.042) for this two-tailed test, we reject the null.
Scaled Squared
Deviation
LOS: Describe a simple linear regression model, how least squares criterion is
used to estimate regression coefficients and interpretation of these coefficients.
LOS: Explain the assumptions underlying the simple linear regression model, and
describe how residuals and residual plots indicate if these assumptions may have
been violated.
LOS: Calculate and interpret the measures of fit and evaluate tests of fit and of
regression coefficients in a simple linear regression
LOS: Describe the use of analysis of variance (ANOVA) in regression analysis,
interpret ANOVA results, and calculate and interpret the standard error of
estimate in a simple linear regression.
LOS: Calculate and interpret the predicted value for the dependent variable, and a
prediction interval for it, given an estimated linear regression model and a value
for the independent variable.
• The second method describes computing the variation from the mean and explaining the reasons
for deviation from the mean.
𝑛𝑛
�(𝑌𝑌𝑖𝑖 − 𝑌𝑌�)2
𝑖𝑖=1
• The variation of Y is often referred to as sum of squared totals (SST).
Example: Compute the variation of Y for the above data?
Solution
Mean
Variation
Simple Linear Regression
Introducing Independent Variable
• Let’s evaluate if it’s possible to explain the variation in scores using another variable.
• We believe that the number of hours of effort in studies impacts the score and thus we have now
extended the table to include the hours put in by thhe students in 6 months prior to the test:
Student Scores (Y) Hours (X)
1 30 140
2 20 110
3 75 220
4 95 350
5 50 170
6 90 300
• It’s possible to observe a positive relationship between the hours and scores.
• The scores are dependent variable and hours is independent variable.
Example: Compute the variation of X for the above data?
Solution
Mean
Variation
Scatter Plot
• To visually observe the relationship, we can plot the same on the scatter plot.
Scatter Plot
100
300, 90 350, 95
80
220, 75
60
Scores
170, 50
40
140, 30
20 110, 20
0
0 100 200 300 400
Hours
• Vertical axis is the dependent varibale and horizontal axis is the independent variable.
• Each on the scatter plot repesents a paired observation that consists of scores and hours.
Simple Linear Regression
Establishing a Linear Relationship
• A common method for relating the dependent and independent variables is through the
estimation of a linear relationship, which implies describing the relation between the two
variables as represented by a straight line.
• The following linear regression model describes the relation between the dependent and the
independent variables.
𝑌𝑌𝑖𝑖 = 𝑏𝑏0 + 𝑏𝑏1 𝑋𝑋𝑖𝑖 + 𝜀𝜀𝑖𝑖
𝑏𝑏0 & 𝑏𝑏1 are the regression coefficients.
𝑏𝑏1 is the slope coefficient.
𝑏𝑏0 is the intercept term.
𝜀𝜀𝑖𝑖 is the error term that represents the variation in the dependent variable that is not explained
by the independent variable.
Estimating a Regression Line
• We cannot observe the population parameter values 𝑏𝑏0 and 𝑏𝑏1 in a regression model and instead
we observe, 𝑏𝑏�𝑜𝑜 and 𝑏𝑏�1 , which are estimates of population parameter using sample.
• The regression process estimates the line of best fit from the data in the sample.
• The regression line equation with one independent variable takes the following form:
𝑌𝑌�𝑖𝑖 = 𝑏𝑏�𝑜𝑜 + 𝑏𝑏�1 𝑋𝑋𝑖𝑖
𝐶𝐶𝐶𝐶𝐶𝐶(𝑋𝑋, 𝑌𝑌) ∑𝑖𝑖=1(𝑌𝑌𝑖𝑖 − 𝑌𝑌�)(𝑋𝑋𝑖𝑖 − 𝑋𝑋�)
𝑛𝑛
𝑏𝑏�1 = =
𝑉𝑉𝑉𝑉𝑉𝑉(𝑋𝑋) ∑𝑛𝑛𝑖𝑖=1(𝑋𝑋𝑖𝑖 − 𝑋𝑋�)2
60
40
20
0
0 100 200 300 400
Hours
• In simple linear regression, the estimated intercept, 𝑏𝑏�𝑜𝑜 and 𝑏𝑏�1 , are such that the sum of the
squared vertical distances from the observations to the fitted line is minimized.
𝑛𝑛 𝑛𝑛 𝑛𝑛
2 2
𝑆𝑆𝑆𝑆𝑆𝑆 = ��𝑌𝑌𝑖𝑖 − �𝑌𝑌�𝑖𝑖 �� = ��𝑌𝑌𝑖𝑖 − �𝑏𝑏�𝑜𝑜 + 𝑏𝑏�1 𝑋𝑋𝑖𝑖 �� = �[𝜀𝜀𝑖𝑖 ]2
𝑖𝑖=1 𝑖𝑖=1 𝑖𝑖=1
Simple Linear Regression
Simple Linear Regression
Example
Develop a linear regression equation using the below information provided:
Student Scores (Y) Hours (X)
1 30 140
2 20 110
3 75 220
4 95 350
5 50 170
6 90 300
Solution
Student Scores (Y) Hours (X) � )𝟐𝟐 (𝑿𝑿𝒊𝒊 − 𝑿𝑿
(𝒀𝒀𝒊𝒊 − 𝒀𝒀 � )(𝑿𝑿𝒊𝒊 − 𝑿𝑿
� )𝟐𝟐 (𝒀𝒀𝒊𝒊 − 𝒀𝒀 �)
1 30 140
2 20 110
3 75 220
4 95 350
5 50 170
6 90 300
Sum
Average
Y X
Variance
Std. Dev.
Covariance (X,Y)
Correlation (X,Y)
Simple Linear Regression
Example
Slope
Intercept
Regression Equation
Slope Coefficient
Intercept
Final relationship
Interpretation
Assumptions in Linear Regression
Linearity • The relationship between the dependent variable, Y, and the independent
variable, X, is linear.
Homoskedasticity • The variance of the regression residuals is the same for all observations.
Independence • The observations, pairs of Ys and Xs, are independent of one another. This
implies the regression residuals are uncorrelated across observations.
Normality • The regression residuals are normally distributed.
• F-test, which tests whether all the slope coefficients in the regression are equal to zero.
• With one independent variable, the F-test basically tests a null hypothesis:
𝐻𝐻0 ∶ 𝑏𝑏1 = 0 / 𝐻𝐻𝑎𝑎 ∶ 𝑏𝑏1 ≠ 0
In order to the hypothesis, we need to compute the F stat and degree of freedom.
F statistics 𝑀𝑀𝑀𝑀𝑀𝑀
𝐹𝐹 − 𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠 =
𝑀𝑀𝑀𝑀𝑀𝑀
Degree of freedom Numerator = k = 1
Denominator = n – k – 1 = n – 1 - 1 = n - 2
Decision rule The F-test is a one-tailed test. The decision rule for the test is that we reject H0
if F-stat > Fcrit.
Example Application of F-test to determine whether the slope coefficient of the
regression equals 0 at the 5% significance level.
• Total variation in dependent variable (inflation) = 0.003094
• Sum of squared errors (unexplained variation) = 0.000259
• Number of observations = 6
• Number of independent variables = 1
• The F-critical value with 1 and 4 degrees of freedom and a 5% significance
level is 7.7086.
Calculation of F stat:
Source Sum of squares Degree of Mean Square F-stat
Freedom
Regression
Error
Total
Decision:
Key point • For a one-independent variable regression, the F-stat is basically the same
as the square of the t-stat for the slope coefficient.
• Since it duplicates the t-test for the significance of the slope coefficient for
a one-independent variable regression, analysts only use the F-stat for
multiple-independent variable regressions.
Measures of Goodness of Fit –F Test
• F-test, which tests whether all the slope coefficients in the regression are equal to zero.
• With one independent variable, the F-test basically tests a null hypothesis:
𝐻𝐻0 ∶ 𝑏𝑏1 = 0 / 𝐻𝐻𝑎𝑎 ∶ 𝑏𝑏1 ≠ 0
In order to the hypothesis, we need to compute the F stat and degree of freedom.
F statistics 𝑀𝑀𝑀𝑀𝑀𝑀
𝐹𝐹 − 𝑠𝑠𝑠𝑠𝑠𝑠𝑠𝑠 =
𝑀𝑀𝑀𝑀𝑀𝑀
Degree of freedom Numerator = k = 1
Denominator = n – k – 1 = n – 1 - 1 = n - 2
Decision rule The F-test is a one-tailed test. The decision rule for the test is that we reject H0
if F-stat > Fcrit.
Example Application of F-test to determine whether the slope coefficient of the
regression equals 0 at the 5% significance level.
• Total variation in dependent variable (inflation) = 0.003094
• Sum of squared errors (unexplained variation) = 0.000259
• Number of observations = 6
• Number of independent variables = 1
• The F-critical value with 1 and 4 degrees of freedom and a 5% significance
level is 7.7086.
Calculation of F stat:
Source Sum of squares Degree of Mean Square F-stat
Freedom
Regression
Error
Total
Decision:
Key point • For a one-independent variable regression, the F-stat is basically the same
as the square of the t-stat for the slope coefficient.
• Since it duplicates the t-test for the significance of the slope coefficient for
a one-independent variable regression, analysts only use the F-stat for
multiple-independent variable regressions.
Hypothesis Test of Regression Parameters | Slope Coefficient
t-distributed test statistic is used to test hypotheses that a regression coefficient is different from a
specific value or whether the slope is positive.
t test statistics 𝑏𝑏�1 − 𝐵𝐵1
𝑡𝑡 =
𝑠𝑠𝑏𝑏�1
Where
𝑆𝑆𝑒𝑒
𝑠𝑠𝑏𝑏�1 =
�∑𝑛𝑛𝑖𝑖=1(𝑋𝑋𝑖𝑖 − 𝑋𝑋�)2
𝑠𝑠𝑏𝑏�1 = standard error of the slope coefficient
𝑆𝑆𝑒𝑒 = standard error of the estimate
(X i − � X)2 = variation of the independent variable
Critical value n − k − 1 or n − 2 degrees of freedom
Example Use the t-test to determine whether the slope coefficient of the regression
equals 0 at the 5% significance level given the below information:
• Variation of the independent variable = 0.004921
• Standard error of estimate = 0.0085
• Estimated value of the slope coefficient = 0.7591
• n=6
• Critical value: With 4 degrees of freedom at the 5% significance level, the
critical t values are −2.776 and +2.776.
Calculation of t stat:
Calculation of standard error of
slope coefficient
Calculation of t stat
Decision:
Hypothesis Test of Regression Parameters | Intercept
t-distributed test statistic can also be used to test hypotheses as to whether the population intercept
is a specific value
t test statistics 𝑏𝑏�0 − 𝐵𝐵0
𝑡𝑡 =
𝑠𝑠𝑏𝑏�0
Where
1 𝑋𝑋� 2
𝑠𝑠𝑏𝑏�0 = � + 𝑛𝑛
𝑛𝑛 ∑𝑖𝑖=1(𝑋𝑋𝑖𝑖 − 𝑋𝑋�)2
𝑠𝑠𝑏𝑏�1 = standard error of the slope coefficient
𝑆𝑆𝑒𝑒 = standard error of the estimate
(X i − � X)2 = variation of the independent variable
Critical value n − k − 1 or n − 2 degrees of freedom
Example Use the t-test to determine whether the intercept is less than 0 at the 5%
significance level. given the below information:
• Variation of the independent variable = 0.004921
• Mean value of independent variable = 0.1012
• Estimated value of the intercept = -0.005
• n=6
• Critical value: The t-critical value for this one-tailed test at the 5% level of
significance with 4 degrees of freedom is −2.132.
State the hypothesis
Calculation of t stat:
Calculation of standard error of
slope coefficient
Calculation of t stat
Decision:
Linear Regression using Indicator (Dummy) Variable
Indicator variables in regression models help analysts determine whether a particular qualitative
variable explains the variation in the model's dependent variable to a significant extent.
Example: A study is conducted to see if the return on a stock over a particular month is influenced
significantly by the company's quarterly earnings announcement.
𝑅𝑅𝑅𝑅𝑅𝑅𝑖𝑖 = 𝑏𝑏0 + 𝑏𝑏1 𝐸𝐸𝐸𝐸𝐸𝐸𝐸𝐸1 + 𝜀𝜀𝑖𝑖
𝐸𝐸𝐸𝐸𝐸𝐸𝐸𝐸1 = value of 0 if there is no earnings announcement that month and 1 if there is an earnings
announcement.
The table shown here presents the results of a regression of 30 months of returns data with EARN,
the return during a month when there was an earnings announcement, as the indicator variable.
Regression Estimation Results
Source Estimated coefficients Standard error of Calculated t Statistics
coefficients
Intercept 0.5629 0.0560 10.0596
Earn 1.2098 0.1158 10.4435
Degree of freedom: 28 / Critical value = ±2.0484 (5% level of significance)
Analysis
• The intercept term indicates the average return for a non-earnings announcement month
(0.5629).
• The slope coefficient indicates the average difference in returns between an earnings
announcement month and a non-earnings announcement month (1.2098).
• The t-statistic (10.4435) enables us to reject the null hypothesis (at the 5% significance level) that
the slope coefficient on EARN is zero (since it is greater than 2.0484) and thus conclude that the
return during a month where earnings are announced is significantly different from the return
during a month with no earnings announcement.
• The t-statistic on the intercept (10.0596) allows us to reject the null hypothesis that the intercept
is zero at the 5% significance level.
Linear Regression using Indicator (Dummy) Variable
Regression results can also enable us to test the hypothesis that the return is the same across
earnings and non-earnings months.
𝐻𝐻0 : 𝜇𝜇𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒 = 𝜇𝜇𝑛𝑛𝑛𝑛𝑛𝑛 −𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒 / 𝐻𝐻0 : 𝜇𝜇𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒 ≠ 𝜇𝜇𝑛𝑛𝑛𝑛𝑛𝑛 −𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒𝑒
The table presented here shows the results of this hypothesis test.
Test of Differences in Means
Ret. – earning Ret. – non-earning Difference in mean
announcement month announcement month
Mean 1.7727 0.5629 1.2098
Variance 0.1052 0.0630
Observations 7 23
Pooled variance 0.07202
Calculated t statistics 10.4435
Degree of freedom: 28 / Critical value = ±2.0484 (5% level of significance)
Since the calculated t-statistic (10.4435) exceeds the critical value (2.0484), we reject the null
hypothesis that there is no difference in the mean RET for the earnings-announcement and non-
earnings-announcement months at the 5% level of significance.
Predicted value
Confidence
interval
Different Functional Forms of Regression Models
• Sometimes the relationship between the independent variable and the dependent variable may
not be linear.
• In such cases, we can often transform one or both of these variables to convert this relation to a
linear form, which then allows the use of simple linear regression
Three commonly used functional forms
Log-lin model • The dependent variable is logarithmic, but the independent variable is linear.
• The slope coefficient in this model is the relative change in the dependent
variable for an absolute change in the independent variable.
Lin-log model • The dependent variable is linear, but the independent variable is logarithmic.
• The slope coefficient in this regression model provides the absolute change in
the dependent variable for a relative change in the independent variable.
Log-log model • Both the dependent and independent variables are logarithmic form.
In order to determine the appropriate functional form of a simple linear regression one must examine
the goodness of fit measures:
• Coefficient of determination (R2)—higher is better,
• F-statistic—higher is better, and
• Standard error of the estimate (se)—lower is better.
Further, the residuals should be examined. Ideally, the residuals should be random.
Level I Quantitative Methods
of the INTRODUCTION TO BIG DATA TECHNIQUES
CFA® Program
Learning Outcome Statements
LOS : Describe aspects of “fintech” that are directly relevant for the
gathering and analyzing of financial data.
LOS : Describe Big Data, artificial intelligence, and machine learning.
LOS : Describe applications of Big Data and Data Science to investment
management.
What is Fintech?
• Technological innovation in the design and delivery of financial services and products.
• It encompasses advanced systems used to analyze information and make decisions based on
machine-learning logic.
• Using such systems has brought about high levels of efficiency that surpass human capabilities.
• Services, and applications of fintech relevant to the investment industry include:
o Analysis of large datasets: Includes traditional data like security prices, financial
statements etc. and alternative data obtained from non-traditional sources like social
media, satellite images, online mentions, sensor networks etc.
o Analytical tools: Artificial Intelligence (AI) identifies complex, non-linear relationships.
AI analyses humungous amounts of data.
• Volume: Data represents billions of data points. In the form of tables, files, records.
• Velocity: Accelerated speeds of data recording and transmission. Real-time data is the new norm.
• Variety: Data collected from different sources and formats – structured (ex: SQL tables), semi-
structured (ex: HTML code), unstructured (ex: video messages).
Big Data not only draws from traditional sources but also from alternative data sources –
• Data generated by people: This is primarily generated through website clicks and page visits.
• Data produced by commercial operations: This is data left behind by business transactions and
activities including point-of-sale data, banking records from credit cards, corporate exhaust.
It is structured.
• Data produced by sensors: Typically, unstructured and is gathered through satellites,
smartphones, and webcams.
• Investment Professionals must beware of potential legal and regulatory issues. Ex: Scraping of
web data could lead to accessing personal information forbidden by regulators.
• Quality: Dataset used may have selection bias, missing data or has data outliers.
• Volume: The volume of the data set used may be insufficient.
• Appropriateness: The data may not be appropriate for the intended use.
Alternative data, is usually difficult to source, clean and organize before performing any analysis,
since it is mostly unstructured data.
Artificial intelligence and machine learning can be used to solve this problem.
Artificial Intelligence
Machine Learning
• It’s the idea that when exposed to more data, machines can make changes on their own and
produce solutions to problems without reliance on human expertise, improving their
performance over time.
• Requires large amount of data for “training.”
• Computer algorithm is given a set of variables or datasets as inputs OR it may be given the target
data as output.
• The algorithm then learns from the data provided how best to model the inputs to outputs or
how to describe the data structures if no output is provided.
Machine Learning | Types
Supervised Learning
• Computer is given a set of inputs and labeled outputs.
• The computer seeks to describe the data or understand its structure.
• Ex: Categorizing given data into sectors / groups.
Unsupervised Learning
• Computer is given a set of inputs but no outputs.
• Using the provided data, the machine can learn relationships that link inputs to output.
• Based on the above learning, the computer predicts outcomes for new datasets.
• Ex: Identify technical signals, forecast stock returns, predict stock market performance etc.
Deep Learning
• Relates to neural networks.
• Neural networks have many hidden layers, use a lot of non-linear processing to identify patterns.
• Can use both supervised / unsupervised techniques.
Data Visualisation
• Refers to how data would be formatted, displayed, and summarized in graphical form.
• Traditional data is visualized in tables, charts, trend lines etc.
• Non-traditional data uses new techniques such as 3D graphics. Multidimensional data extending
beyond 3D would demand more data to accurately be visualized.
• Various visualization techniques are used based on underlying data. Ex: Heat maps, Tag (Word)
Clouds, Mind maps etc.
Tackling Big Data with Data Science
Text Analytics and Natural Language Processing (NLP)
• Text Analytics uses computer programs to derive meaning from large unstructured text and voice
data. Ex: From Company filings, earnings calls, emails, surveys etc.
o Lexical Analysis: Word frequency is used to detect patterns based on key words / phrases.
o Predictive Analysis: Identify indicators of future performance like consumer sentiment.
• NLP, within the larger scope of text analysis, is used for translation, speech recognition, text
mining, sentiment analysis, topic analysis etc.
o It uses computer science, AI, and linguistics to interpret human language.
o When aided by ML techniques, NLP is used to analyse annual reports, call transcripts,
news articles etc. with more scale and accuracy than what a human can achieve.
o NLP analyzes nuances to provide insights around trending topics of interest like interest
rate policy, aggregate output, inflation expectations.