0% found this document useful (0 votes)
20 views6 pages

Probability and Statistics Project Report

The document outlines a project for a Probability and Statistics course at the University of Central Punjab, focusing on data collection and analysis related to a sample company with four developers. It includes tasks such as calculating statistical measures (mean, median, mode, standard deviation) for various data columns and interpreting the distribution shapes. The findings indicate left-skewed distributions for Unit_Cost and Unit_Price, with recommendations for improving profit stability.

Uploaded by

Ali Baba
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
20 views6 pages

Probability and Statistics Project Report

The document outlines a project for a Probability and Statistics course at the University of Central Punjab, focusing on data collection and analysis related to a sample company with four developers. It includes tasks such as calculating statistical measures (mean, median, mode, standard deviation) for various data columns and interpreting the distribution shapes. The findings indicate left-skewed distributions for Unit_Cost and Unit_Price, with recommendations for improving profit stability.

Uploaded by

Ali Baba
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

a

University of Central Punjab


FOIT
Project-1

Probability and
Course Title Course Code MAT253
Statistics

Course
Aniqa Naeem Section SEC2
Instructor

Time Allowed Total Marks 20

Mian Muhammad
Abdullah, L1F24BSSE0084,
Muhammad Zain
L1F24BSSE0001,
Ali, Muhammad Registration
Student Name
Naeem Ali, No. L1F24BSSE0384,
Muhammad L1F24BSSE0090,
Yousuf, Sameer L1F24BSSE00,
Ahmed

TASK-01
Data Collection
Collect a data-set (or generate synthetic data) related to a real-world
CS topic.
The data is taken from a sample company that has 4 developers,
Dev1, Dev2, Dev3, Dev4. These 4 developers worked for 22 days and
the number of lines of code are given as data. The number of lines
written is taken as x-axis and y-axis shows the frequency of the data.
Result

Answer the following questions based on the histogram:


1. What is the shape of the distribution (e.g., Normal, Skewed,
Uniform)?
The histogram shows that it is Right skewed.
2. What does the histogram tell you about the data-set?
Its shows us that most of the set values are in interval of 164-204.
3. Are there any outliers or unusual patterns?
Q1 (25th percentile) = 185
Q3 (75th percentile) = 265
IQR = Q3 – Q1 = 265 – 185 = 80
Lower Bound
Q1 − 1.5 × IQR = 185 − 120 = 65
Upper Bound
Q3 + 1.5 × IQR = 265 + 120 = 385

Tasks-02

1- Calculate Mean, median, Mode and standard deviation using


column of Unit_Cost.
Mean = 1064.2
Median = 1252
Mode = 1252
Standard Deviation = 428.25
2- Calculate Mean, Median, Mode of column of Unit_Price.
Mean = 1945.7
Median = 2295
Mode = 2295
3- Discuss the shape of Distributions (i.e skewed, symmetrical
etc.)
Both Unit_Cost and Unit_Price have means lower than their medians and
modes, indicating left-Skewed (negatively skewed) distributions.
4- Calculate standard deviation, mean using column of Profit.
Mean = 1858
Standard Deviation = 1400
5) Coefficient of Variation (CV): Cost vs Profit

Unit_Cost

428.25/1064.2 = 0.40 or 40 Percent

Profit

1400 / 1858 = 0.75 or 75 Percent

6. Interpretation

 Both Unit_Cost and Unit_Price distributions are left-skewed, with most values concentrated
at higher levels.

 Costs are more stable than profits, as indicated by a lower coefficient of variation.

 Profit is riskier and more inconsistent, suggesting uncertainty in revenue generation.

 The business may benefit from strategies focused on stabilizing profit, such as demand
forecasting, pricing optimization, or cost control.

7. Formulas Used in Excel


Below are the Excel formulas, written heading-wise, for all three columns you used: Unit Price, Unit
Cost, and Profit.

This is assignment / exam ready and you can copy it directly.

1. Unit Price

Assume Unit Price values are in column P (from P2:P113037)

Mean (Average)

=AVERAGE(P2:P113037)

Median

=MEDIAN(P2:P113037)
Mode

=[Link](P2:P113037)

Standard Deviation

=STDEV.S(P2:P113037)

2. Unit Cost

Assume Unit Cost values are in column O (from O2:O113037)

Mean (Average)

=AVERAGE(O2:O113037)

Median

=MEDIAN(O2:O113037)

Mode

=[Link](O2:O113037)

Standard Deviation

=STDEV.S(O2:O113037)

3. Profit

Assume Profit values are in column Q (from Q2:Q113037)

Mean (Average)

=AVERAGE(Q2:Q113037)

Median

=MEDIAN(Q2:Q113037)

Mode

=[Link](Q2:Q113037)

Standard Deviation
=STDEV.S(Q2:Q113037)

You might also like