a
University of Central Punjab
FOIT
Project-1
Probability and
Course Title Course Code MAT253
Statistics
Course
Aniqa Naeem Section SEC2
Instructor
Time Allowed Total Marks 20
Mian Muhammad
Abdullah, L1F24BSSE0084,
Muhammad Zain
L1F24BSSE0001,
Ali, Muhammad Registration
Student Name
Naeem Ali, No. L1F24BSSE0384,
Muhammad L1F24BSSE0090,
Yousuf, Sameer L1F24BSSE00,
Ahmed
TASK-01
Data Collection
Collect a data-set (or generate synthetic data) related to a real-world
CS topic.
The data is taken from a sample company that has 4 developers,
Dev1, Dev2, Dev3, Dev4. These 4 developers worked for 22 days and
the number of lines of code are given as data. The number of lines
written is taken as x-axis and y-axis shows the frequency of the data.
Result
Answer the following questions based on the histogram:
1. What is the shape of the distribution (e.g., Normal, Skewed,
Uniform)?
The histogram shows that it is Right skewed.
2. What does the histogram tell you about the data-set?
Its shows us that most of the set values are in interval of 164-204.
3. Are there any outliers or unusual patterns?
Q1 (25th percentile) = 185
Q3 (75th percentile) = 265
IQR = Q3 – Q1 = 265 – 185 = 80
Lower Bound
Q1 − 1.5 × IQR = 185 − 120 = 65
Upper Bound
Q3 + 1.5 × IQR = 265 + 120 = 385
Tasks-02
1- Calculate Mean, median, Mode and standard deviation using
column of Unit_Cost.
Mean = 1064.2
Median = 1252
Mode = 1252
Standard Deviation = 428.25
2- Calculate Mean, Median, Mode of column of Unit_Price.
Mean = 1945.7
Median = 2295
Mode = 2295
3- Discuss the shape of Distributions (i.e skewed, symmetrical
etc.)
Both Unit_Cost and Unit_Price have means lower than their medians and
modes, indicating left-Skewed (negatively skewed) distributions.
4- Calculate standard deviation, mean using column of Profit.
Mean = 1858
Standard Deviation = 1400
5) Coefficient of Variation (CV): Cost vs Profit
Unit_Cost
428.25/1064.2 = 0.40 or 40 Percent
Profit
1400 / 1858 = 0.75 or 75 Percent
6. Interpretation
Both Unit_Cost and Unit_Price distributions are left-skewed, with most values concentrated
at higher levels.
Costs are more stable than profits, as indicated by a lower coefficient of variation.
Profit is riskier and more inconsistent, suggesting uncertainty in revenue generation.
The business may benefit from strategies focused on stabilizing profit, such as demand
forecasting, pricing optimization, or cost control.
7. Formulas Used in Excel
Below are the Excel formulas, written heading-wise, for all three columns you used: Unit Price, Unit
Cost, and Profit.
This is assignment / exam ready and you can copy it directly.
1. Unit Price
Assume Unit Price values are in column P (from P2:P113037)
Mean (Average)
=AVERAGE(P2:P113037)
Median
=MEDIAN(P2:P113037)
Mode
=[Link](P2:P113037)
Standard Deviation
=STDEV.S(P2:P113037)
2. Unit Cost
Assume Unit Cost values are in column O (from O2:O113037)
Mean (Average)
=AVERAGE(O2:O113037)
Median
=MEDIAN(O2:O113037)
Mode
=[Link](O2:O113037)
Standard Deviation
=STDEV.S(O2:O113037)
3. Profit
Assume Profit values are in column Q (from Q2:Q113037)
Mean (Average)
=AVERAGE(Q2:Q113037)
Median
=MEDIAN(Q2:Q113037)
Mode
=[Link](Q2:Q113037)
Standard Deviation
=STDEV.S(Q2:Q113037)