Paras Final Year Project File
Paras Final Year Project File
(BTCS 703-18)
Submitted By:
Name: Paras
University Roll No. 2337722
Submitted To:
Assistant Professor
Date: 04-12-2025
DECLARATION
I hereby declare that the Project Report entitled ("Virtual Drawing Tool") is an authentic record of my own
work as requirements of 7th sem academic during the period from 25th June,2023 to 27th Nov, 2023 for the
award of degree of B. Tech. ( Computer Science & Engineering , Chandigarh Engineering College under
the guidance of Mr. Rajeev Sharma
Date: 28-12-2023
Paras
2337722
Certified that the above statement made by the student is correct to the best of our knowledge and belief.
Signatures
Examined by
1. 2. 3. 4.
Head of Department
(Signature and Seal)
ACKNOWLEDGMENT
I take this opportunity to express my sincere gratitude to the Director- Principal Dr. Rajdeep Singh
Chandigarh Engineering College, Landran for providing this opportunity to carry out the present work.
I am highly grateful to the Dr. Sukhpreet Kaur HOD CSE, Chandigarh Engineering College, Landran
(Mohali). I would like to expresses my gratitude to other faculty members of Computer Science &
Engineering department of CEC, Landran for providing academic inputs, guidance & Encouragement
throughout the training period. The help rendered by Mr. Rajeev Sharma, Supervisor for
Experimentation is greatly acknowledged. Finally, I express my indebtedness to all who have directly
or indirectly contributed to the successful completion of my software training
(i)
TABLE OF CONTENTS
CERTIFICATE
DECLARATION
ACKNOWLEDGEMENT (i)
Table of content (ii)
List Of Figures (iii)
List Of Tables (iv)
ABSTARCT 1
1 INTRODUCTION 2
1.1 pneumonia impacts
1.2 motivation
1.3 Aim and Objectives of the work Literature Survey
2- LITERATURE REVIEW 3
2.1vandecia Fernandes et al.
2.2 hongen lug et al.
2.3sammy v. militant et al
2.4 Nanette Dionisio et al
3 SOFTWARE REQUIREMENT SPECIFICATION 5
3.1 introduction
3.2 functional requirement
3.3 non-functional requirement
(ii)
LIST OF TABLES
(iii)
LIST OF FIGURES
(iv)
ABSTRACT
Gesture is a symbol of physical behavior or emotional expression. It includes body gesture and hand
gesture. It falls into two categories: static gesture and dynamic [Link] the former, the posture
of the body or the gesture of the hand denotes a sign. For the latter, the movement of the bodyor
the hand conveys some messages. Gesture can be used as a tool of communication between
computerand human It is greatly different from the traditional hardware based methods and can
accomplish human-computer interaction through gesture recognition. Gesture recognition
determines the user intent through the recognition of the gesture or movement of the body or body
parts. In the pastdecades, many researchers have strived to improve the hand gesture recognition
technology. Hand gesture recognition has great value in many applications such as sign language
recognition augmented reality (virtual reality) [sign language interpreters for the disabled and robot
control].
In the authors detect the hand region from input images and then track and analyze the movingpath
to recognize America sign language. In Shimada et al. propose a TV control interface using hand
gesture recognition. Keskin et al. divide the hand into 21 different regions and train a SVM
classifier to model the joint distribution of these regions for various hand gestures so as to classify
the gestures. Zeng et al. improve the medical service through the hand gesture recognition. The
HCI recognition system of the intelligent wheelchair includes five hand gestures and three
compound states. Their system performs reliably in the environment of indoor and outdoor and in
the condition of lighting change.
The overview of the hand gesture recognition is described in Figure 1. First, the hand is detected
using the background subtraction method and the result of hand detection is transformed to a binary
image. Then, the fingers and palm are segmented so as to facilitate the finger recognition. Moreover,
the fingers are detected and recognized. Last, hand gestures are recognized using a simple rule
classifier.
1
CHAPTER 1- INTRODUCTION
2
CHAPTER 2-LITERATURE REVIEW
3
2.3 SAMMY V. MILITANTE ET AL.[
Proposed,The observations employ flexible and efficient approaches of deep learning by making use
of six models of CNN in predicting and spotting whether the patient is unaffected or affected by the
disease employing a chest X-ray picture. Google Net, Le Net, VGG-16, Alex Net, Strident, and
ResNet-50 models with a dataset of 28,000 images and using a 224x224 decision with 32 and sixty-
four batch sizes are implemented to verify the performance of every version being educated. The study
likewise implements Adam as an optimizer that continues an adjusted 1e-four learning rate and an
epoch of 500 hired to all the models. Both Google Net and Le Net acquired a ninety-eight% price,
VGG Net -sixteen earned an accuracy charge of ninety-seven%, Alex Net and Stride Net model
acquired a ninety-six% whilst the ResNet-50 version acquired 80% throughout the training of models.
Google Net and Le Net fashions performed the best accuracy rate for overall performance training.
The six models identified had been capable of hitting upon and are expecting pneumonia sickness
including a wholesome chest X-ray.
1
1. Bayesian convolutional neural Vand
ecia
202
1
Bayesian
convolut
network estimation for Fern ional
paediatric pneumonia detection ande neural
and diagnosis s et network
al.
4
3 Pneumonia and COVID-19 Sam 202 Convolu
Detection using Convolutional my 1 tional
Neural Networks V. Neural
Milit Network
ante s
et al.
5
CHAPTER 3 - SOFTWARE REQUIREMENT SPECIFICATION
3.1 INTRODUCTION
This software requirement specification (SRS) report expresses a complete description about
the proposed System. This document includes all the functions and specifications with their
explanations to solve related problems.
1. The product must have an interface which is simple enough to under - stand.
3. User-Friendly.
• Dependencies:
1. All necessary software is available for implementing and use of the system.
2. The proposed system would be designed, developed and implemented based on the software
requirements specifications document.
3. End users should have basic knowledge of computers and we also assure that the users will be given
software training documentation and reference material.
6
3.2 FUNCTIONAL REQUIREMENT
• Accuracy: - System should correctly execute the process and display the result accurately. System
output should be in user required format.
2. Functionality: The ability of the system to do the work for which it was intended.
3. Performance: The response time, utilization, and throughput behavior of the system. Not to be
confused with human performance or system delivery time.
7
4. Security: A measure of system’s ability to resist unauthorized attempts at usage or behavior
modification, while still providing service to legitimate users.
5. Availability: (Reliability quality attributes fall under this category) the measure of time that the
system is up and running correctly; the length of time between failures and the length of time needed
to resume operation after a failure.
6. Usability: The ease of use and of training the end users of the system. Sub qualities: learn ability,
efficiency, affect, helpfulness, control.
● Google Collab
● TensorFlow v2.7.0
● CUDA v11.5
● CuDNN v8.3
● 8 GB RAM
8
with its features and weaknesses. Evolving from the first and oldest waterfall SDLC model, their
variety significantly expanded.
The SDLC models diversity is predetermined by the wide number of product types starting with a web
application development to a complex medical software. And if you take one of the SDLC models
mentioned below as the basis in any case, it should be adjusted to the features of the product, project,
and company. The most used, popular and important SDLC models are given below:
1. Waterfall Model
2. Iterative Model
3. Spiral Model
4. V-shaped Model
5. Agile Model
Waterfall is a cascade SDLC model, in which the development process looks like the flow, moving
step by step through the phases of analysis, projecting, realization, testing, implementation, and
support. This SDLC model includes gradual execution of every stage completely. This process is
strictly documented and predefined with features expected to every phase of this software development
life cycle model.
9
clearly understood the tasks and how every requirement is going to be implemented. Often, the
discussion involves the QA specialists who can interfere the process with additions even during the
development stage if it is necessary.
After the requirements are approved, the process goes to the next stage of actual development.
Programmers start here with the source code writing while keeping in mind previously defined
requirements. The system administrators adjust the software environment, frontend programmers
develop the user interface of the program and the logic for its interaction with the server. The
programming by itself assumes four stages:-
• Algorithm development
• Compilation
4. Testing
The testing phase includes the debugging process. All the code flaws missed during the development
are detected here, documented, and passed back to the developers to fix. The testing process repeats
until all the critical issues are removed and software work ow is stable.
5. Deployment
When the program is finalised and has no critical issues it is time to launch it for the end users. After
the new program version release, the tech support team joins. This department provides user feedback;
consult and support users during the time of exploitation. Moreover, the update of selected components
is included in this phase, to make sure, that the software is up-to-date and is invulnerable to a security
breach.
10
CHAPTER 4- DESIGN AND MODELLING OF SYSTEM
Unified Modelling Language (UML) is analogous to the blueprints used in other fields and consists of
different types of diagrams. In the aggregate, UML diagrams describe the boundary, structure, and
behaviour of the system and the objects within it.
Following UML diagrams have been designed for the project:
12
4.4 ACTIVITY DIAGRAM
The activity diagram is another important diagram in UML to describe the dynamic aspects of the
system. An activity diagram is basically a flowchart to represent the flow from one activity to another
activity. The activity can be described as an operation of the system. The control flow is drawn from
one operation to another.
13
Fig 4.5. State Diagram
In 2017 Dr. Paul Mooney started a competition on Kaggle on viral and bacterial pneumonia
classification. It contained 5,863 pediatric images; hence it is very different from the other datasets.
We are referring to the revised version of this dataset.[6]
In addition, the database is organized into three folders (Train, Test, Val) and contains subfolders for
each category of image (Pneumonia / General). All images have been resized to a static, A few
examples of common and pneumonia images are listed in Figure 1. Chest X-ray images always have
signs of limited brightness on account of the low dose of exposure in patients, due to chest X-ray
images always containing black, white, and grey pants. The lungs are located on both sides of the
thoracic cavity and the lung area can be easily detected by X-ray, which is almost black. The heart,
located between the lungs, appears almost as white as X-rays can completely pass through the heart.
Bones are made of protein and very dense, so X-rays cannot cross it and the bones are shown almost
white. Ku moreover, the bones have clear edges.
(a)
(b)
Fig 5.1 Examples from the dataset. (a) normal cases (b) pneumonia cases
5.2 Data Pre-Processing
The strategies used throughout this paper are listed in Table 2. In our study, rescale is a value by which
we will multiply the data before any other processing. Our original images consist of RGB coefficients
14
in the 0-255, but such values would be too high for our models to process (given a typical learning
rate), so we target values between 0 and 1 instead of by scaling with a 1/255. factor. shear range is for
randomly applying shearing transformations zoom range is for randomly zooming inside pictures,
horizontal flip is for randomly flipping half of the images horizontally --relevant when there are no
assumptions of horizontal asymmetry (e.g. real-world pictures)
Rescale 1./255
Table 2
Tanh, ReLU, etc. can be used for activation function, but ReLU is the most preferred activation
function. input shape takes in image width & height with the last dimension as a color channel. then
we Flatten the input after CNN layers and added ANN layers.
f(x)=max(0,x)
S(x) = Sigmoid
f(x) = ReLU
Used activation function as SoftMax for the last layers(ANN Layers) also defined units as the total
number of classes and used sigmoid for binary classification and set unit to 1.
15
Fig 5.3. Details of proposed DL mode
Conv Layers
The first block makes the particularity of this type of neural network since it functions as a
feature extractor. To do this, it performs template matching by applying convolution filtering
operations. The first layer filters the image with several convolution kernels and returns
“feature maps”, which are then normalized (with an activation function) and/or resized.
Image
Convolved feature Fig 5.4.1 Conv Layer
16
(feature Map)
Pool Layers
The second block is not characteristic of a CNN: it is in fact at the end of all the neural networks
used for classification. The input vector values are transformed (with several linear combinations and
activation functions) to return a new vector to the output. This last vector contains as many elements
as there are classes: element I represents the probability that the image belongs to class I. Each
element is therefore between 0 and 1, and the sum of all is worth 1. These probabilities are calculated
by the last layer of this block (and therefore of the network), which uses a Sigmoid function (binary
classification) or a RELU function (multi-class classification) as an activation function.
Activation Function
An activation function is a mathematical function applied to the output of a neuron in a neural network.
It introduces non-linearity to the network, allowing it to learn and model complex relationships
between inputs and outputs. Common activation functions include the sigmoid, ReLU, and tanh
functions.
ReLU
Rectified linear unit is most widely used and preferred activation function right now which ranges from
0 to infinity, All the negative values are converted into zero.
f(x)=max(0,x)
Sigmoid
17
The sigmoid function also called a logistic function. having a characteristic that can take any real
value and map it to between 0 to 1. It decides which value to pass as output and what not to pass.
VGG16 Model
VGG16 is a popular convolutional neural network architecture for image classification tasks. It
consists of 16 layers, including 13 convolutional layers and 3 fully connected layers. The
convolutional layers use small 3x3 filters with a stride of 1 and are followed by max pooling layers.
The fully connected layers at the end of the network perform the classification task. The VGG16
model has achieved state-of-the-art performance on several image recognition benchmarks, making it
a popular choice for computer vision tasks.
18
CHAPTER 6- IMPLEMENTATION AND TESTING
19
6.2.1 DEFINING PARAMETERS FOR DATA AUGMENTATION
Fig 6.2.2 Applying defined parameters to Train, Test & Valid sets
Using the “[Link]” library, for the Train Set, we created an Image
Data Generator that randomly applies defined parameters to the train set and for the Test &
Validation set, we’re just going to rescale them to avoid manipulating the test data beforehand.
20
We have analyzed the performance of these bellow CNN architectures namely; Alex Net, ResNet-50,
and VGGNet-16. [3]
Table 3. Analysis of model Performance for CNN Models with 32 Batch Size
22
CHAPTER 7- RESULTS
rescale - Each digital image is made up of pixels with values ranging from 0 to 255, with 0
representing black and 255 representing white. As a result, rescale the scales array of the original
picture pixel values to [0,1], ensuring that the photos contribute evenly to the overall loss. Otherwise,
a higher pixel range image causes more loss and should be utilised with a lower learning rate, whereas
a lower pixel range image requires a higher learning rate.
shear range - The shear's transformation is the image's shape. It fixes one axis and extends the image
at a specific angle called the shear angle.
The image has been magnified by a zoom of less than 1.0. The image has been zoomed out by more
than 1.0.
23
rotation range - The image is rotated randomly between 0 and 180 degrees.
brightness range - a brightness of 0.0 means there is no light, and 1.0 means there is a lot of light.
fill mode - Fills the image's missing value with the nearest value, wrapped value, or reflected value.
24
Fig. 7.2.2 Confusion Matrix
25
7.3 VISUALIZING SOME PREDICTED IMAGES WITH PERCENTAGE %
27
CHAPTER 8 - CONCLUSION
The project “Pneumonia Detection and Classification using Deep Learning” demonstrates the
significant potential of artificial intelligence in assisting the medical community with fast, reliable, and
cost-effective diagnosis of lung infections. Pneumonia remains a major global health concern, affecting
millions of individuals every year, especially children and elderly people. Early detection plays a
crucial role in reducing mortality, but manual interpretation of chest X-ray images can be time-
consuming, prone to human error, and limited by the availability of trained radiologists. This project
provides a technological solution to address these challenges through automated image-based
diagnosis.
In this work, a deep learning model was trained on chest X-ray images to differentiate between
pneumonia-infected lungs and normal lungs. Convolutional Neural Networks (CNN), known for their
strong capability in extracting spatial features from medical images, were utilized to achieve efficient
classification. The model successfully learned to identify crucial patterns, such as lung opacity and
infiltrations, which are commonly associated with pneumonia. The experimental results indicate that
the system can classify X-ray images with high accuracy, demonstrating the effectiveness of deep
learning in medical imaging applications.
The development of this system also highlights the benefits of automation in healthcare. An AI-based
diagnostic tool can support radiologists by providing a second opinion, reducing workload, and
improving diagnostic throughput. In rural or resource-limited areas where expert radiologists may not
be available, such a system can serve as a valuable preliminary screening tool. Moreover, the model’s
ability to process large volumes of images at high speed ensures timely diagnosis, which is critical for
early intervention and patient recovery.
Throughout the project, several challenges were addressed, including dataset imbalance, overfitting,
and variations in X-ray quality. Techniques such as data augmentation, normalization, and model
tuning helped improve the model’s performance and generalization ability. The study also emphasizes
the importance of using standardized and diverse datasets to ensure robustness in real-world clinical
environments.
Overall, this project successfully demonstrates how deep learning can enhance the reliability and
efficiency of pneumonia detection. While the current system provides promising results, there remains
room for improvement, such as multi-disease classification, explainable AI integration, model
optimization, and deployment on mobile or cloud platforms. With further research and refinement, this
approach can evolve into a fully functional diagnostic support system capable of assisting healthcare
professionals and improving global health outcomes
28
CHAPTER 9- BIBLIOGRAPHY
➢ Vandecia Fernandes et al., “Bayesian convolutional neural network estimation for pediatric
pneumonia detection and diagnosis”, Computer Methods and Programs in Biomedicine,
Elsevier, 2021
➢ Sammy V. Militante et al., “Pneumonia and COVID-19 Detection using Convolutional Neural
Networks”, 2020 the third International on Vocational Education and Electrical Engineering
(ICVEE), IEEE, 2021
➢ Nanette V. Dionisio et al., “Pneumonia Detection through Adaptive Deep Learning Models of
Convolutional Neural Networks”, 2020 11th IEEE Control and System Graduate Research
Colloquium (ICSGRC 2020), 8 August 2020
➢ Md. Jahid Hasan et al., “Deep Learning-based Detection and Segmentation of COVID-19 &
Pneumonia on Chest X-ray Image”, 2021 International Information and Communication
Technology for Sustainable Development (ICICT4SD), 27-28 February 2021
➢ [Link]
➢ LeCun, Y.; Boser, B.; Denker, J.S.; Henderson, D.; Howard, R.E.; Hubbard, W.; Jackel, L.D.
Backpropagation applied to handwritten zip code recognition. Neural Comput. 1989, 1, 541–
551.
➢ Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional
neural networks. Adv. Neural Inf. Process. Syst. 2012, 25, 1097–1105.
➢ Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image
recognition. arXiv 2014, arXiv:1409.1556
➢ L. Wang and A. Wong, "COVID-Net: A tailored deep convolutional neural network design for
detection of COVID-19 cases from chest radiography images," arXiv:2003.09871, 2020.
29
CHAPTER 10 - FUTURE SCOPE
30
10. Clinical Trials and Regulatory Approval
Further work may focus on clinical validation, regulatory approvals, and real-world
integration into healthcare workflows.
31