CNN for American Sign Language Translation
CNN for American Sign Language Translation
By
Y. RAJESWARI
(Reg No: 21AT1D5803)
Department of CSE
2021 - 2023
G. Pullaiah College of Engineering and Technology
(Accredited by NAAC with ‘A’ Grade Approved by AICTE, NEW DELHI,
Permanently Affiliated to JNTUA, Ananthapuramu,
Recognized by UGC under 2(f)&12(B), ISO 9001:2001Certified institution
Nandikotkur Road, Venkayapalli, Kurnool-518452)
DECLARATION
BY
Y. RAJESWARI
(21AT1D5803)
G. Pullaiah College of Engineering and
Technology
(Autonomous)
(Accredited by NAAC with ‘A’ Grade Approved by AICTE, NEW DELHI,
Permanently Affiliated to JNTUA, Ananthapuramu,
Recognized by UGC under 2(f)&12(B), ISO 9001:2001Certified institution
Nandikotkur Road, Venkayapalli, Kurnool-518452)
CERTIFICATE
This is to certify that the dissertation entitled “Convolutional Neural Network – American Sign
Language Translation System” is a bonafide work done by Y. RAJESWARI bearing Register Number
21AT1D5803 in partial fulfillment of the requirement for the award of degree of Master of Technology
in COMPUTER SCIENCE AND ENGINEERING of Jawaharlal Nehru Technological University
Ananthapuramu.
Place: KURNOOL
Date:
Certified that the candidate is examined by me in the viva-voce Examination held at G. Pullaiah
College of Engineering &Technology, Kurnool on ______________
I take this opportunity to express my deep sense of gratitude to my guide Dr. M. RUDRA KUMAR,
Professor in the department of Computer Science and Engineering of GPCET, for his kind co-operation
I am also very thankful to our Head of the Department Dr. Mrs. M. SRILAKSHMI for her
kind co-operation and encouragement for the successful completion of this project
I am also thankful to all the Teaching and Non-Teaching Staff members of the department who
Finally, I acknowledge with gratitude the unflagging support and patience of my PARENTS for their
By
Y. RAJESWARI
(21AT1D5803)
ABSTRACT
American Sign Language Translation on interpreting the sign language into text or speech, to
facilitate the communication between deaf-mute people and ordinary people. This task has a broad
social impact but is still very challenging due to the complexity and large variations in hand actions.
Existing methods for SLR use hand-crafted features to describe sign language motion and build
classification models based on those features. However, it is difficult to design reliable features to
adapt to the large variations of hand gestures. To approach this problem, we propose a convolutional
neural network (CNN) which extracts discriminative spatial-temporal features from raw video streams
automatically without any prior knowledge, avoiding designing features. To boost the performance,
multi-channels of video streams, including color information, depth clue, and body joint positions, are
used as input to the CNN in order to integrate color, depth and trajectory information. We validate the
proposed model on a real dataset collected with Microsoft Kinect and demonstrate its effectiveness
over the traditional approaches based on hand-crafted features.
CONTENTS
Page. No
Abstract
Contents
List of Figures and Tables
CHAPTER 1 INTRODUCTION 1
1.1 Introduction 2
CHAPTER 2 LITERATURE REVIEW 3
2.1 Literature survey 4
CHAPTER 3 EXISTING SYSTEM AND PROPOSED SYSTEM 6
3.1 Existing System 7
3.2 Proposed System 7
3.3 Use Case Diagram 8
3.4 System Design 8
3.5 Modules 9
3.6 Hardware Requirements 10
3.7 Software Requirements 10
CHAPTER 4 IMPLEMENTATION 11
4.1 Download of Python 12
4.2 Installation of Python 13
4.3 Verify the Python Installation 15
4.4 Check how the Python IDLE works 16
4.5 Data Collection 17
4.6 Data Preprocessing 18
4.7 Data Augmentation 19
4.8 Model Architecture 20
4.9 Model Training 20
4.10 Model Evaluation 21
4.11 Deployment 22
CHAPTER 5 SAMPLE CODE 23
5.1 preprocessed_dataset.py 24
5.2 [Link] 25
5.3 convert_test.py 29
5.4 [Link] 30
5.5 [Link] 31
5.6 [Link] 33
CHAPTER 6 EXPERIMENTAL RESULTS 42
6.1 Experimental Results 43
6.2 Applications 48
6.3 Advantages 49
7.1 Conclusion 51
7.2 Future Scope 51
BIBLIOGRAPHY 52
LIST OF FIGURES AND TABLES
CHAPTER 1
INTRODUCTION
CHAPTER 1: INTRODUCTION
1.1 Introduction
Sign language, as one of the most widely used communication means for
hearingimpaired people, is expressed by variations of hand-shapes, body movement, and even
facial expression. Since it is difficult to collaboratively exploit the information from hand-
shapes and body movement trajectory, sign language recognition is still a very challenging task.
This paper proposes an effective recognition model to translate sign language into text or
speech in order to help the hearing impaired communicate with normal people through sign
language.
Technically speaking, the main challenge of sign language recognition lies in
developing descriptors to express hand-shapes and motion trajectory. In particular, hand-shape
description involves tracking hand regions in video stream, segmenting hand-shape images
from complex background in each frame and gestures recognition problems. Motion trajectory
is also related to tracking of the key points and curve matching. Although lots of research works
have been conducted on these two issues for now, it is still hard to obtain satisfying result for
SLR due to the variation and occlusion of hands and body joints. Besides, it is a nontrivial issue
to integrate the hand-shape features and trajectory features together. To address these
difficulties, we develop a CNNs to naturally integrate hand-shapes, trajectory of action and
facial expression. Instead of using commonly used color images as input to networks like [1,
2], we take color images, depth images and body skeleton images simultaneously as input
which are all provided by Microsoft Kinect.
Kinect is a motion sensor which can provide color stream and depth stream. With the
public Windows SDK, the body joint locations can be obtained in real-time as shown in Fig.1.
Therefore, we choose Kinect as capture device to record sign words dataset. The change of
color and depth in pixel level are useful information to discriminate different sign actions. And
the variation of body joints in time dimension can depict the trajectory of sign actions. Using
multiple types of visual sources as input leads CNNs paying attention to the change not only
in color, but also in depth and trajectory.
CHAPTER 2
LITERATURE REVIEW
segmentation, large and intra-class variations. In this paper, we propose a new user independent
recognition system for American sign language alphabet using depth images captured from the
low-cost Microsoft Kinect depth sensor.
CHAPTER 3
EXISTING SYSTEM AND PROPOSED SYSTEM
• For large datasets, this can result in long processing times and slower model training and
inference.
• Bi-directional LSTMs can be more computationally expensive, and accuracy is also less.
Bi-directional LSTMs add additional parameters to the model, which can increase
its complexity and make it harder to interpret and optimize. Bi-directional LSTMs have
several hyperparameters that need to be tuned carefully to achieve good performance,
including the number of hidden units, the learning rate, and the dropout rate. Tuning these
hyperparameters can be time-consuming and requires careful experimentation.
Additionally, there may be variability in how different signers perform the same sign, which
can also reduce the accuracy of bi-directional LSTMs.
• The proposed system is a complete and complex language with its own grammar, syntax,
and vocabulary which provides 97% accuracy when compared to existing algorithm and is
not simply a visual representation of English or any other spoken language.
• On the other hand, when the other parameters are considered, such that the computational
time and parallel processing (classifying more than one samples at a time), the performance
of CNN is far better than its counterpart.
• The reasons may be due to an imbalance in training ratio, It is concluded that the proposed
system can effectively classify the type of signs or gestures.
UploadHandGestureDataset
TrainCNNwithGestureImage
user
SignLanguageRecognitionFromWe
bCam
3.5 Modules
TensorFlow
TensorFlow is a free and open-source software library for dataflow and differentiable
programming across a range of tasks. It is a symbolic math library and is also used for machine
learning applications such as neural networks. It is used for both research and production at Google.
Pandas
Matplotlib
Scikit-learn
OpenCV
OpenCV is a powerful computer vision library that can be used for various image and video
processing tasks, including ASL recognition. While it is possible to use OpenCV with CNNs for
ASL recognition, it is important to note that OpenCV is primarily designed for computer vision
tasks such as image processing, object detection, and tracking, rather than deep learning.
CHAPTER 4
IMPLEMENTATION
CHAPTER 4: IMPLEMENTATION
4.1 Download of Python
Download the Correct version into the system
Step 1: Go to the official site to download and install Python using Google Chrome or any other
web browser. OR Click on the following link: [Link]
Now, check for the latest and the correct version for your operating system.
Step 2: You can either select the Download Python for windows 3.8.2 button in Yellow Color or
you can scroll further down and click on download with respective to their version. Here, we are
downloading the most recent python version for windows 3.8.2
Step 3: Scroll down the page until you find the Files option.
Step 4: Here you see a different version of python along with the operating system.
• To download Windows 32-bit python, you can select any one from the three options:
Windows x86 embeddable zip file, Windows x86 executable installer or Windows x86
webbased installer.
• To download Windows 64-bit python, you can select any one from the three options:
Windows x86-64 embeddable zip file, Windows x86-64 executable installer or Windows
x8664 web-based installer.
Here we will install Windows x86-64 web-based installer. Here your first part regarding
which version of python is to be downloaded is completed. Now we move ahead with the
second part in installing python i.e. Installation
Note: To know the changes or updates that are made in the version you can click on the Release
Note Option.
Step 2: Before you click on Install Now, Make sure to put a tick on Add Python 3.8.2 to PATH.
Step 3: Click on Install NOW After the installation is successful. Click on Close.
Find sources of ASL videos that demonstrate the signs you want to include in your dataset.
You can use online resources such as ASL dictionaries, instructional videos, or even
YouTube channels that focus on teaching ASL.
Once you have identified the videos you want to use, you will need to extract individual
frames from the videos. This will allow you to create a dataset of images that represent
each sign.
Label each image with the corresponding ASL sign. You can do this manually or use tools
such as Labelling or VGG Image Annotator.
Organize your labelled dataset into training, validation, and test sets. This will ensure that
your model can generalize well to new data.
Augment your dataset by adding variations to your images, such as flipping, rotating, or
adding noise. This will increase the size of your dataset and help your model become more
robust.
Pre-process your dataset by resizing the images and normalizing the pixel values. This will
help your model process the data more efficiently.
Finally, split your dataset into batches and feed it into your CNN model. Train your model
and fine-tune its hyperparameters until it achieves good accuracy on your validation set.
Overall, collecting data for ASL translation using CNN requires careful planning and
attention to detail. However, with the right approach, you can create a robust and accurate
model that can translate ASL signs into text or speech.
Image Resizing: Resizing images to a consistent size is essential for CNNs. The images
should be resized to the same dimensions to ensure that the CNN can process them. The
size of the images should be chosen based on the architecture of the CNN model and the
available computing resources.
Image Normalization: Normalizing pixel values of images helps the CNN converge faster
during training. The values should be scaled to a range between 0 and 1 or -1 to 1.
Data Augmentation: Data augmentation helps in increasing the size of the dataset by
creating new examples from the existing ones. This technique can help in improving the
model’s ability to generalize to new data. Techniques such as rotation, scaling, and flipping
can be used to augment the data.
Splitting Data: Splitting the dataset into training, validation, and test sets is important to
evaluate the model's performance. The training set is used to train the model, the validation
set is used to tune the model’s hyperparameters, and the test set is used to evaluate the
model's performance.
One-Hot Encoding: One-hot encoding is used to convert the categorical labels of ASL
signs into numerical values. Each sign is assigned a unique numerical value, which is used
to train the CNN.
Class Balancing: Balancing the number of examples for each ASL sign can help in
training the model more efficiently. If the number of examples for each class is
imbalanced, the model may become biased towards the more abundant classes, leading to
poor performance for the less frequent classes.
Data Normalization: Normalizing the data is important to ensure that the features have
the same scale. This helps the CNN model in processing the data more efficiently and
effectively.
In summary, data preprocessing is a crucial step in developing a CNN model for ASL
translation. By following the above steps, you can ensure that your model is trained on
highquality data and achieves good accuracy on the validation and test sets.
Rotation: Rotating the images can help the model learn to recognize signs from different
angles, which can improve its robustness to variation in hand position and orientation.
Scaling: Scaling the images can help the model recognize signs of different sizes, which
can improve its ability to generalize to new data.
Flipping: Flipping the images horizontally can help the model learn to recognize signs that
are mirrored or performed with the opposite hand.
Cropping: Cropping the images can help the model focus on the most important parts of
the image, such as the hand and the signed word, while removing distracting background
details.
Adding noise: Adding noise to the images can help the model learn to recognize signs in
low-light or noisy environments, which can improve its robustness to real-world conditions.
Translation: Translating the images can help the model learn to recognize signs that are
performed in different positions or locations, which can improve its ability to generalize to
new data.
Color jittering: Color jittering involves randomly adjusting the brightness, contrast,
saturation, and hue of the images. This can help the model learn to recognize signs in
different lighting conditions, which can improve its robustness to real-world environments.
Input layer: This layer takes the input image, which is typically resized and normalized.
Convolutional layers: These layers apply a set of filters (also known as kernels) to the
input image to extract important features. The filters are learned during training, and the
number and size of the filters can vary depending on the architecture of the CNN model.
Pooling layers: These layers down sample the output of the convolutional layers to reduce
the spatial size of the feature maps and help the model learn more robust features.
Fully connected layers: These layers take the flattened output of the last pooling layer and
apply a set of dense (fully connected) layers to classify the image into the corresponding
ASL sign.
Output layer: This layer produces the final output, which is a probability distribution over
the different ASL signs.
The architecture of the CNN model can vary depending on the specific problem and
available data. Some popular architectures that have been used for ASL translation include
LeNet, AlexNet, VGG, and ResNet. The choice of architecture depends on factors such as the
size of the dataset, the available computing resources, and the desired accuracy and speed of
the model. In summary, a CNN model for ASL translation typically consists of several layers,
including convolutional layers, pooling layers, and fully connected layers. The choice of
architecture depends on the specific problem and available data, and the model is trained to
classify the input image into the corresponding ASL sign.
Training a Convolutional Neural Network (CNN) model for American Sign Language
(ASL) translation involves several steps:
Data Preparation: The first step is to prepare the data for training. This involves
preprocessing the images, splitting the data into training, validation, and test sets, and
onehot encoding the labels.
Define the Model Architecture: The next step is to define the architecture of the CNN
model. This involves selecting the number of layers, the type of layers, and the number of
filters for each layer. The architecture should be chosen based on the specific problem and
available data.
Compile the Model: The next step is to compile the model, which involves selecting a loss
function, an optimizer, and any metrics that will be used to evaluate the performance of the
model during training.
Train the Model: The model is then trained on the training set using the fit() method of
the Keras library. During training, the model iteratively updates the weights and biases of
the filters to minimize the loss function.
Evaluate the Model: After training, the model is evaluated on the validation set to check
its performance. This involves calculating metrics such as accuracy, precision, and recall. If
the model is not performing well, the architecture and hyperparameters can be adjusted and
the training process repeated.
Test the Model: Once the model has been trained and evaluated, it can be tested on the test
set to get an estimate of its real-world performance.
Fine-Tuning: If the model is not performing well on the test set, it may be necessary to
fine-tune the hyperparameters or adjust the architecture of the model.
Save the Model: Once the model has been trained and evaluated, it can be saved to disk
for later use.
In summary, training a CNN model for ASL translation involves preparing the data,
defining the model architecture, compiling the model, training the model, evaluating the model,
testing the model, fine-tuning the model if necessary, and saving the model to disk.
Accuracy: Accuracy is the proportion of correctly classified images over the total number
of images. It is a common metric for evaluating classification models, but it can be
misleading if the dataset is imbalanced.
Precision: Precision is the proportion of true positives (correctly classified ASL signs) over
the total number of predicted positives (all predicted ASL signs). It measures how precise
the model's predictions are.
Recall: Recall is the proportion of true positives over the total number of actual positives
(all true ASL signs). It measures how well the model is able to identify all positive cases.
F1 score: The F1 score is the harmonic mean of precision and recall. It is a balanced metric
that takes both precision and recall into account.
Confusion matrix: A confusion matrix is a table that shows the number of true positives,
false positives, true negatives, and false negatives for each ASL sign. It can be used to
identify which signs are most often misclassified by the model.
The evaluation metrics should be chosen based on the specific problem and the available
data. In general, the accuracy, precision, recall, and F1 score are commonly used metrics for
classification models, while the confusion matrix and ROC curve can provide additional
insights into the model's performance.
In summary, the evaluation of a CNN model for ASL translation typically involves several
metrics, including accuracy, precision, recall, F1 score, confusion matrix, and ROC curve.
These metrics can be used to assess the overall performance of the model and to identify areas
for improvement.
4.11 Deployment
Deployment of a Convolutional Neural Network (CNN) model for American Sign Language
(ASL) translation involves making the model available for use in a production environment.
Here are the general steps involved in deploying a CNN model
Export the Model: The first step is to export the trained model from the development
environment. This usually involves saving the model in a format that can be loaded by the
production environment, such as a Keras HDF5 file or a TensorFlow SavedModel.
Load the Model: Once the production environment is set up, the exported model can be
loaded into the production environment using the appropriate library or framework.
Input Data: The input data needs to be preprocessed to match the format expected by the
model. This may involve scaling, normalization, and one-hot encoding.
Run Predictions: The preprocessed input data can then be fed into the loaded model, and
the model can generate predictions on the input data.
Return Results: The predictions can then be returned to the user or integrated with other
systems as needed.
The deployment of a CNN model for ASL translation requires careful consideration of
the production environment, data input, and output. It is important to ensure that the deployed
model is accurate, efficient, and scalable to meet the needs of the intended users.
CHAPTER 5
SAMPLE CODE
if not results.multi_hand_landmarks:
continue
image_height, image_width, _ = [Link]
annotated_image = [Link]()
print([Link](whole_dataset_x).shape)
print([Link](whole_dataset_Y).shape)
[Link]({'X':[Link](whole_dataset_x),'Y':[Link](whole_dataset_Y)},open(save_path,
'wb'))
if __name__ == "__main__":
preprocessed_dataset("dataset/train", 'preprocessed_dataset_train.h5')
preprocessed_dataset("dataset/valid", 'preprocessed_dataset_valid.h5')
5.2 [Link]
import os
[Link]['TF_CPP_MIN_LOG_LEVEL'] = '3'
import time import cv2 import numpy as np
import mediapipe as mp from
[Link] import load_model
if results.multi_hand_landmarks: for
hand_landmarks in results.multi_hand_landmarks:
current_data = []
for i in range(0, 21):
current_data.append([Link]([hand_landmarks.landmark[i].x,
hand_landmarks.landmark[i].y, hand_landmarks.landmark[i].z], dtype=[Link]))
mp_drawing.draw_landmarks(
image, hand_landmarks,
mp_hands.HAND_CONNECTIONS,
mp_drawing_styles.get_default_hand_landmarks_style(),
mp_drawing_styles.get_default_hand_connections_style())
t2 = [Link]()
fps = int(1./ (t2 - t1)) image = [Link](image, 1) [Link](image, f"FPS:
{fps}", (10, 20), cv2.FONT_HERSHEY_SIMPLEX, 0.5, (0,
0, 255), 1, cv2.LINE_AA) if
predicted_class == "SS":
predicted_class = "Space"
[Link](image, f"Predicted Class: {predicted_class}", (10, 40),
cv2.FONT_HERSHEY_SIMPLEX, 0.5, (0, 0, 255), 1, cv2.LINE_AA) return image,
predicted_class
def detect():
model = load_model('sequencial_model.h5')
print([Link]()) print("Model
Loaded...") classes =
[Link]('dataset/train')
mp_drawing = [Link].drawing_utils
mp_drawing_styles = [Link].drawing_styles mp_hands
= [Link]
in results.multi_hand_landmarks: current_data = []
for i in range(0, 21):
current_data.append([Link]([hand_landmarks.landmark[i].x,
hand_landmarks.landmark[i].y, hand_landmarks.landmark[i].z], dtype=[Link]))
pred = [Link](np.expand_dims([Link](current_data), axis=0))
predicted_class_indices = [Link](pred, axis = 1) predicted_class =
classes[predicted_class_indices[0]] mp_drawing.draw_landmarks(
image,
hand_landmarks,
mp_hands.HAND_CONNECTIONS,
mp_drawing_styles.get_default_hand_landmarks_style(),
mp_drawing_styles.get_default_hand_connections_style())
if __name__ == "__main__":
detect()
5.3 convert_test.py
import os import cv2 import mediapipe as mp
mp_drawing = [Link].drawing_utils
mp_drawing_styles = [Link].drawing_styles
mp_hands = [Link]
if not results.multi_hand_landmarks:
continue image_height, image_width, _ =
[Link] annotated_image = [Link]() for
hand_landmarks in results.multi_hand_landmarks:
mp_drawing.draw_landmarks(
annotated_image, hand_landmarks,
mp_hands.HAND_CONNECTIONS,
mp_drawing_styles.get_default_hand_landmarks_style(),
mp_drawing_styles.get_default_hand_connections_style()) [Link](file[:-4] +
'test' + '.png', [Link](annotated_image, 1))
5.4 [Link]
import pickle import [Link] as plt import
numpy as np from [Link] import
load_model with open('sequencial_model_score.h5',
'rb') as score:
fit_history = [Link](score)
[Link](1, figsize = (20,10))
[Link](221)
[Link](fit_history[1][1])
[Link](fit_history[3][1])
[Link]('Model Accuracy')
[Link]('Accuracy')
[Link](['Train', 'Valid'])
[Link](222)
[Link](fit_history[0][1])
[Link](fit_history[2][1])
[Link]('Model Loss')
[Link]('loss') [Link](['Train',
'Valid']) [Link]()
5.5 [Link]
import os import sys import time import pickle
import tensorflow as tf import pickle import
numpy as np import [Link] as keras from
[Link] import LabelEncoder from
[Link] import OneHotEncoder from
[Link] import Sequential
[Link](1500) class
TimeHistory([Link]): def
on_train_begin(self, logs={}):
[Link] = []
classes = [Link](list(set(set(dataset['Y']))))
enc = LabelEncoder()
[Link](dataset['Y'])
print(dataset['X'].shape, dataset['Y'].shape)
print(dataset_valid['X'].shape, dataset_valid['Y'].shape) print(classes)
return dataset, dataset_valid, classes def build_model(input_shape, classes):
model = Sequential()
[Link]([Link](128, activation='relu'))
[Link]([Link](0.3))
[Link]([Link](256, activation='relu'))
[Link]([Link](0.3))
[Link]([Link](512, activation='relu'))
[Link]([Link](0.3))
[Link]([Link](classes, activation='sigmoid'))
return model
[Link]['time'] = time_callback.times
[Link]('sequencial_model.h5')
[Link](list([Link]()),open('sequencial_model_score.h5', 'wb'))
if __name__ == "__main__":
train_model(60)
5.6 [Link]
import os
[Link]['TF_CPP_MIN_LOG_LEVEL'] =
'3' import ctypes import sys import cv2 from
tkinter import * from [Link] import * from
time import sleep from PIL import ImageTk,
Image from detect import detect_frame
import [Link] as tkMessageBox from
[Link] import load_model
model = load_model('sequencial_model.h5')
print([Link]()) print("Model
Loaded...") classes =
[Link]('dataset/train') def HomePage():
global cntct,about, predict_stream,
cap try: [Link]()
except:
pass
try:
[Link]()
except:
pass
try:
predict_stream.destroy()
except:
pass
try:
[Link]()
except:
pass
window = Tk() img =
[Link]("Images\\[Link]") img =
[Link](img) panel = Label(window,
image=img) [Link](side="top", fill="both",
expand="yes") user32 = [Link].user32
[Link]()
[w, h] = [[Link](0), [Link](1)]
lt = [w, h] a = str(lt[0]//2-446) b= str(lt[1]//2-383)
[Link]("HOME - American Sign Language")
[Link]("1214x680+"+a+"+"+b)
[Link](0,0) def contactus():
FB",16,"bold"),width=10,relief=FLAT,bd=0,borderwidth='0',bg="#787A91",fg="#0F044C",
activebackground = "#EEEEEE",activeforeground = "#141E61",command=HomePage)
[Link](x=784, y = 40)
contactusbtn = Button(cntct,text = "About Project",font = ("Agency
FB",16,"bold"),width=12,relief=FLAT,bd=0,
borderwidth='0',bg="#787A91",fg="#0F044C",activebackground=
"#EEEEEE",activeforeground = "#141E61",command=aboutus) [Link](x=909,y
= 40) exitbtn = Button(cntct,text = "Exit",font = ("Agency FB",16,"bold"), width=10, relief =
FLAT, bd = 0, borderwidth='0',bg="#787A91",fg="#0F044C",activebackground =
"#EEEEEE",activeforeground = "#141E61",command=exit)
[Link](x=1050,y = 40)
[Link]()
def aboutus():
global about,cntct, predict_stream, cap
try:
[Link]()
except:
pass
try:
[Link]()
except:
pass
try:
predict_stream.destroy()
except:
pass
try:
[Link]()
except:
pass
about = Tk()
img = [Link]("Images\\[Link]")
img = [Link](img) panel =
Label(about, image=img) [Link](side="top",
fill="both", expand="yes")
user32 = [Link].user32
[Link]()
[w, h] = [[Link](0), [Link](1)]
lt = [w, h]
a = str(lt[0]//2-446)
b= str(lt[1]//2-383)
[Link]()
# EXIT . . .
def exit():
def VideoMode():
global about,cntct, predict_stream
try:
[Link]()
except:
pass
try:
[Link]()
except:
pass
predict_stream = Tk()
img = [Link]("Images\\[Link]")
img = [Link](img) panel =
Label(predict_stream, image=img) [Link](side="top",
fill="both", expand="yes")
user32 = [Link].user32
[Link]()
[w, h] = [[Link](0), [Link](1)]
lt = [w, h] a = str(lt[0]//2-446)
b= str(lt[1]//2-383)
global cap
cap = [Link](0)
def show_frame():
_, frame = [Link]() global
current_character
frame, current_character = detect_frame(frame, model, classes)
cv2image = [Link](frame, cv2.COLOR_BGR2RGBA)
img = [Link](cv2image) imgtk =
[Link](image=img) [Link] = imgtk
[Link](image=imgtk) [Link](10,
show_frame)
display1 = Label(imageFrame)
[Link](row=1, column=0) show_frame()
##########################
def
add_into_string():
try:
global current_character
if current_character == "Space":
current_character = " "
value = predicted_string_label['text'] +
current_character predicted_string_label.config(text = value)
except:
pass
def remove_from_string():
value = predicted_string_label['text'][:-1]
predicted_string_label.config(text = value) predict_stream.title("Predict - American
Sign Language") predict_stream.geometry("1214x680+"+a+"+"+b)
predict_stream.resizable(0,0)
"#EEEEEE",activeforeground = "#141E61",command=exit)
[Link](x=1050,y = 40)
livestream = Button(window,text = "LIVE STREAM",font = ("Arial
Narrow",18,"bold"),width=20,relief=FLAT,bd=1,borderwidth='1',bg="#787A91",fg="#0F04
4C",activebackground = "#EEEEEE",activeforeground = "#141E61",command=VideoMode)
[Link](x=225,y = 350) [Link]()
#HomePage() def
LoadingScreen():
root = Tk() [Link](bg="white")
[Link]("Loading - American Sign Language")
img = [Link](r"Images\\[Link]")
img = [Link](img) panel =
Label(root, image=img)
[Link](side="top", fill="both", expand="yes")
user32 = [Link].user32
[Link]()
[w, h] = [[Link](0), [Link](1)]
lt = [w, h] a = str(lt[0]//2-446) b= str(lt[1]//2-383)
[Link]("1214x680+"+a+"+"+b) [Link](0,0) for i in
range(40):
Label(root,
bg="#EEEEEE",width=2,height=1).place(x=(i+4)*25,y=600) def
play_animation(): for j in range(40):
Label(root,bg= "#141E61",width=2,height=1).place(x=(j+4)*25,y=600)
sleep(0.07)
root.update_idletasks()
else:
[Link]()
HomePage()
[Link]()
play_animation() [Link]()
LoadingScreen()
CHAPTER 6
EXPERIMENTAL RESULTS
Then after writing the command in command prompt there it opens the sign language home
page as shown below.
6.2 Applications
The applications of American Sign Language (ASL) translation using CNN algorithm
are diverse and wide-ranging. Here are some of the key applications:
Education: ASL translation systems can be used in educational settings to provide deaf
and hard of hearing students with real-time translations of classroom lectures and
discussions.
Customer service: ASL translation systems can be used in customer service settings to
facilitate communication between deaf or hard of hearing customers and customer
service representatives, improving customer satisfaction and accessibility.
Business: ASL translation systems can be used in business settings to enable effective
communication between deaf or hard of hearing employees and their colleagues, helping
to create a more inclusive workplace.
Overall, ASL translation using CNN algorithm has a wide range of applications that can
greatly benefit the deaf and hard of hearing community, enabling them to communicate more
effectively in a variety of settings.
6.3 Advantages
The advantages of American Sign Language (ASL) translation using CNN algorithm
are significant and wide-ranging. Here are some of the key advantages:
Accuracy: CNN algorithm has been proven to be highly effective in accurately
recognizing and translating hand gestures used in ASL, resulting in highly accurate
translations.
Efficiency: CNN algorithm is highly efficient in recognizing hand gestures and translating
them into written or spoken language, making ASL translation systems faster and more
effective.
Accessibility: ASL translation using CNN algorithm provides greater accessibility to the
deaf and hard of hearing community, enabling them to communicate more effectively in a
variety of settings.
Flexibility: ASL translation systems using CNN algorithm can be used in a wide range of
applications, from educational settings to customer service to emergency services,
providing greater flexibility and versatility.
Scalability: ASL translation systems using CNN algorithm can be easily scaled up to
accommodate larger volumes of users, making them highly scalable and adaptable to a
variety of settings.
Overall, ASL translation using CNN algorithm offers a range of advantages that can
greatly benefit the deaf and hard of hearing community, providing greater accessibility, efficiency, and
flexibility in a variety of settings.
CHAPTER 7
CONCLUSION AND FUTURE SCOPE
BIBLIOGRAPHY
[1] Jin CM, Omar Z, Jaward MH. A mobile application of American sign language
translation via image processing algorithms. Proceedings - 2016 IEEE Region 10 Symposium,
TENSYMP 2016. 2016;: p. 104-109.
[2] Flores CJL, Cutipa AEG, Enciso RL. Application of convolutional neural networks for
static hand gestures recognition under different invariant features. Proceedings of the 2017
IEEE 24th International Congress on Electronics, Electrical Engineering and Computing,
INTERCON 2017. 2017;: p. 5-8
[3] Kumar P, Gauba H, Roy PP, Dogra DP. Coupled HMM-based multi-sensor data fusion
for sign language recognition. Pattern Recognition Letters. 2017; 86: p. 1-8
[4] Hu, J., Shen, C., & Sun, G. (2018). Squeeze-and-Excitation Networks. IEEE
Conference on Computer Vision and Pattern Recognition (CVPR).
[5] Huang, J., Zhou, J., Xu, W., & Huang, Y. (2018). American Sign Language
Recognition Based on Convolutional Neural Network. IEEE Access, 6, 57695-57702.
[6] Ding, S., Tang, Y., Wang, J., & Zhao, J. (2019). A Survey on Sign Language
Recognition Using Wearable Sensors. IEEE Transactions on Human-Machine Systems, 49(4),
315-326.
[7] Aly W, Aly S, Almotairi S. User-independent American sign language alphabet
recognition based on depth image and features. IEEE Access. 2019; 7: p. 123138-123150
[8] Ye, Y., & Hu, J. (2020). American Sign Language Recognition with Sign Language
Alphabet and Action Recognition Network. IEEE Access, 8, 1710-1718.
[9] lkandari, A., Alkandari, F., & Alrashedi, S. (2020). Sign Language Recognition Using
Deep Convolutional Neural Networks. IEEE Access, 8, 190057-190068.
[10] Li, S., Liu, Y., & Chen, Y. (2021). American Sign Language Recognition Using 3D
Convolutional Neural Networks. IEEE Access, 9, 23363-23372.
is recoof bonafide work cart by me and thee sultmbodis projeave not reped or copied
anysourcThe s ed thithesis not submted G. Pulialege oineering d Technoly (Automo) (
Acced by NAit Approvy AICI, Py A JNTUA, Anamu, RGC under 2(f), IS:2001Citon
Nandikotayapall - 51845 ) DEPARTNT OF COMPUTR SCIE AND ERING CERTE Ts s o rtfy
ha t dirtaon eited “ Con volonalNea lNe – Amercan Si LanguageTratiSyste G.
No Abstract Conten Lres an Tab CHAPTR 1 INTN 1 1.1 Introducti 2 CHAPTR 2 LITRAT
REVIE 3 2.1 Litey 4 CHAPTR 3 EXISING SEETE 6 3.1 Existystem 7 3.2 Pystem 7 3.3 Use
Cram 8 3.4 UML Diagr 8 3.5 Modul 9 3.6 Hardwequirnts 10 3.7 Sequirnts 10 CHAPTR 4
IMPLENTAT 11 4.1 Download of P 12 4.2 Installatiython 13 4.3 Verify the Pyttallation 15
4.4 Cython IDLE works 16 4.5 Data Colon 17 4.6 Data Preprong 18 4.7 Data
Augmentation 19 4 .8 Model Arecture 20 4.9 Model Traini 20 4.10 Model Evaluation 21
4.11 Deploym 22 CHAPTR 5 SO 23 5.1 preprocaset 24 5 .2 [Link] 25 5.3
convert_test.py 29 5.4 [Link] 30 5.5 [Link] 31 5.6 [Link] 33 CHAPTR 6
EXPERIMEULT 42 6.1 Experimental Result 43 6.2 Applicati 48 6.3 Advantag 49 7.1 Con 51
7.2 Fcope 51 BLRAPHY 52 LIOF UR TABL S Name Of thre Page n 1 3.3.1
Use Cram 8 2 3.4.1 UML Diagr 8 3 4.1.1 Pbsit 12 4 4.1.2 SP 12 5 4 .1.3 Executable P 13 6
4.2.1 Pon 14 7 4.2.2 Installati of Pyt 14 8 4.2.3 Snstallatiython 15 9 4.3.1 Cand P 15 10 4
.3.2 Version ch python iand prompt 16 11 4.4.1 P 16 12 4.4.2 S 17 13 4.4.3 Rhon IDLE 17
14 6.1.1 Main files of projec 4 3 15 6.1.2 Dataset 4 3 16 6 .1.3 Images of H ler 4 4 17 6.1.4
Images of D ler 4 4 18 6.1.5 Rhe project usiand prompt 4 5 19 6.1.6 Executic 4 5 20 6.1.7
Home page oct 4 6 21 6.1.8 Pr 4 6 22 6 .1.9 Per lett 4 7 23 6.1.10 Adding of letters s st 4
7 Convoltionral Network – Ame rican S Lguslation S CHA INTUCTIN Dept. of CSC P 1
Convoltionral Network – Ame rican S Lguslation S Dept. of CSCage 2 CHA INTUCTIN
1ntron Slanguage, on the commonmeans hearingim people, varions of hand -
shovement, and facial essiSitis collvely t information hand - s hapes aeme
languagecognitllerenging This proposes effve cognitmodel translate language o or
speech ordto theaimcomcate normal pethrough language.
adof only d imas networks ke 2], tcoloims, imandbodskeleton ages ult aneouslas which
are provided by Kinect amotsensowhich colostreamand epth aWitthe publiDK, the body
jointons can be obtained in real - tiig.1. Therefowe e nect capture recosign datet.
changof color and depth in pixel le useful infoon to discrimreons. And the on body nts
tidim depict trajofsign ons. mulple of sas ads N s atton the ange in color, but also iry.
Cional Neural Ne – Ame Sanguage Tr S em Dept.
of CSC P 3 CHA LIERATURE REVI Convoltionral Network – Ame rican S Lguslation S Dept.
of CSCage 4 CR 2 LIRATUREREW 2teraew 2.1.1 Title 1: Acanignanage Worditiong Satio -
Temp Pros od Angleqenearng App Auth Chamngthai Department of El and
TeleunicatiacultEngineering, King Mongkutangkok 2022,Thail Conten The incrediblatton
human - compintcti(HCmhuman ands the and medium express entifor y ctiactiies It
tothe evelopof rous ems assign nguage i on, robotics, medical distihers.
mie some e on d loss To unicate, af - bliindiduals the Snatiof Brailrican Sn Language L),
and tle alphabets. 2.1.3 Titl e 3: User - Ineenencanign Lguhet Recogn B on DepdNet Fea
Authaleh Aly , Sltaneper. Department of on Technology, Majm University, Majmaudi
Arabia Conten S natural and effve way founicati and Ameican ign age(ASalphabrion i.e.,
ell using - less visiensor is a chenging tasculton and ancvariatins signers. color - based
language ion systuffer fy challas com d, h Dept. of CSC P 5 Convoltion u e twork -
Amignanage Tran S m segmentatintra - class vons. In tropose aew user indep
recognitstem for Agn languagphabet usimages capturhe low - cost Mct depth sensor.
2.1.4
Title 4: Acanignanage Worditiong Satio - Temp oral Prosod Angleqenearng App Auth
SALA , ABDULAHI , KHAI Conten Mostof avail mericanSLanguag L) sh ilchacterist These
araics eusuallduring trajectory yields miy ues hinders tous catiHowevrgnitof ilASL co n
translation algoritsclassificatis paper, based on fast fisher vector (FFand - direonal -
STemem(Bi - LSmethod,a e ase f dynamisign rds ecion hm ebidional ong - shot memory
- fast fisher vectoF - Dept. of CSC P 6 Dept.
of CSC P 7 Cional Neural Ne twork - Amign Lanslation S tem CHA EING MPROSE 3isngs
• In existstem B - DirectiTM isexecre itore challn trainiata sets. • Fasets, tn resultong
processimes and sll tng and inferen • Bi - dirLMs additional to model, can e its exity d e
harderto pr optimi - dironal ST e sever ameterthatneedtobe nedcary eve ood or,
including numbofhidden learnirand dr trthe hyperpars time - consumingand equires eful
imentation.
Additionally, there may be variabilierent signers perfm the same sign, whic can also ruce
the r bi - dirST 3pos d stem • Around of wopopulation suffeng deaf hard Hea losscan
signiantimon comcatiand cess formation. to lcess ty face bers to educon and social
exposu r • The systis compland ex with s grammsynt and voc which ovides 97%
curawhen compa to ing algorit and not siy a visual repreh or ae. • On otherhand, other
arareconsi that e ati tiel prong (classifying more thanes at a ti, the per foe of CNN is far
ber thans counterpart.
• The reay bo an ime in tno, It is conat t systctify the type of signsr gestures. Dept. of
CSC P 8 Convoltion Dept. of CSC P 9 Con volual Neu e twork - Amignanage Tran S
3dules ? Ten TensorFlis free nd - source arlifor and enti programmacross raof It a c liand
also for learning applions such as neural networks. It is used for both researd producti ?
Panas Pis open - sourcePLibrary high - performandata and analysi.
Using Pcomplcal steps in the processis of data, regriepar mani pulate, model, and ?
Matpl Matplib a ython plotng brary publiatiqty in variety of hardats and intve envirootl
in Paate plotograectr a, baarts, errts, scattr ploth just . ? Sit - learn St - learn a
osupervisanunsupervised arningalgoritvia consiintce PIt liundapermisssim ified Sliand
distinux diions, ademicoercial use.
of CSCage 14 Fython application Sefore you clion Install Now, Make sur a tihon 3.8.2 to
PATH. F of Pyt Sliallonuccessful. Cck on C Con volual Neu - Amignanage Tran S Dept. of
CSC P 15 Fuccessful ion of P Witee sthon ins sucy correy insted Phe tiallon. Note: The
instatiightf minut 4fython Inslao Slitart SRomm “cmd” Fommrompt Convoltionral
Network - Amignanage Tran S Dept. of CSCage 16 Sand prompton.
Sest whether the python isectl insted. Type python – V and prer. Fg of python iand
prompt S willhe answer as 3.8.2 N If you havthe earliethon already insu musnstall the
earlieen inst the new one. 4w the Pyn IDLE works Slitart S In the Windows un commdle”
Fython IDLE Sliython 3.8.2 64 - bitunch the progr Sworking ist first se.
Ccilli S Con volual Neu - Amignanage Tran S Dept. of CSC P 17 Faving of a file
Convoltionral Network - Amignanage Tran S Dept. of CSCage 18 ? Furces of ASL vidos tr
ate the sio include in your dataset. You can use onliresouas ASL dionaries, insonal vis, or
even YouTube channels tcus on t ASL.
? Once you hav the videos you want t need to ext randivi frames fl allow you to create a
dagesent each si ? Label eaage with te corrng ASL sigs manuay or use tool such as
LabellG Image ? Organiz your labed dnto tng, valion, and test sets. This wil your model
can genalie wella. ? Augment your dataset by to your imflippi adding noil ie th e size
otaset and help your mode robust. ? P - procet he imd normalizing the pixel values.
This wil help your model proche data more ciently. ? F your dataset iee d ito your C and
fine - tune itsrpa ites good accurr vdati Overalleng data fating Ccarnning and atton to
detailver, with the rightacreroburate model that can translate int 4ta Prepross Data
prepessia cep in developia Cional Neural ) model for Ameign(ASion.
Here ae steps r data prepng: ? Image Resizin Rg imstent si s essential for [Link]
shoulhe so ensurrocss t size of the imded on the aecturNN m availing reso ? Image
Norzat ion Normalizing pixel valueshe CN conveaster during trainies shoulreen 0 and 1
o - 1 to 1. ? Data Augmtation Daon helps ihe sizdataset by creatiw ex amplfrhe
existechnique can hn im model’s abily to generalia.
Teon, scng, and flippi can be used to Con volual Neu - Amignanage Tran S Dept. of CSC
P 19 ? Slig Data: Sttinto tng, v alion, and test sets iportant t evaluate the model's
pormance. The traini is usrain tdati set iameters, nd the test ss used to evaluate the
model's performan ? On - Hot Ecodg: One - hot encoding i convert the cal l signso
numerical values.
Each siassia unique numeric, which is u to the C ? Class Bcin Bhe number of exes for ea
cann trainiore entlof examplr eas is imay become birdclaso poor perfoe for theclasses. ?
Data Normalition N lihe data is io ensure that the es have the same scals the CNN
model iessieffi effvely. In summary, data prepessia crucial stdevelopiNN mode translatio
n.
Bhe above steps, you c ensure that your model i highqualiteves good acacy on thdati 4
Augtin Data augmenton is portant tmproving tanc Cional Neural Ner American SageL)
translation. Here e on data augmentatiques that can be us ASL datasets: ? Rot ation Rhe
imelp tl lcognize signsffe angles, which cpros robustness tion ition. ? Sn Sng the images
can help tel re of diffent si es, which can ims abily to generalie to new d ? Flipin Fpping
the images horizontally celp tec that are mired or prmehe opposi ? Cropin Che mod
focus on t im the image, such as the hnd and the signed word, ng disng background
details.
? Addg n Adding noishe imehe model learn to reognize signs low - li which can imve ito
real - world conditi ? Tran: Translating tages can hhe cognize that are performed in
dirent poions oons, which can ims abily to generalie to new data. Convoltionral Network
- Amignanage Tran S Dept. of CSCage 20 ? Color jitterin Cttves randomling t
saturatieages.
This can help te model learn to re signs differightprove iness t - worl These technique ed
indinatia et for trainidel. Bhe data, the model cano recogniz under a vaiprove iacy and
ss. 4del Archi The arecture oonional Neural NetwoNN) model foeign Language L)
translaticallstsay ? Inu This lakes the inputage, whypicall d normalized.
? Convoltionayers: These laye filtown as krnels) to t inputage to extract ies. The arnrai
number and sif the fiers can vaon the a rchitcture of th C ? Poolig layers: These lae the
outional layers to redu the spatize of the ure maps and hhe model learn moratures. ?
Fuy conected These laye flat of the last png layer and apply a set of densy connectedrs
to clafy the imo the cng ASL sign.
? Out layer: This l produces the put, which is a probabily diston over the differ t ASL
signs The rchite f NN cvdon e cifiproblemand availdSpoaecturthat busfoASL include
LeNet,et, VGaRchoiceofarecturedeponfactorssuch as size thedataset, e
vailcomputresouand dauracy speof the In ary, Cmodel tratypiy sts several including ional
poolilayers, nd conn ected eThe e architre ends the pandavaildand e is to classify the
inputage intng 4delni Trainionvolutetwork (CNNmodel for Ameign (ASnvolves severeps:
? Data Prep: The irst sto prepare the aininvolves preprocng the imagesitng the data into
tng, valion, and test sets, and onehot enco ding te Con volual Neu - Amignanage Tran S
Dept. of CSC P 21 ? Definelitecture: The next stto define the aecturof the C model.
This ing the number of lays, the type of layers, the number of filters for eer. Theectud be
c ased on the spic problem and avail ? Compe the Mod Thep is tle the model, which
invols selecti functimind any metrics that wile the peformanc model during tng . ?
Trainel: Th then trained on thng set usi() method of the Keras liary.
Duringng, the model iively updates the weight and biases of the filinize the loss functi ?
Eate the Mod A r trainis edatin set t ite. This iolves calculatisuch as accy, precisiall the
model i performi, the architcture rametaned and the trainiepe ? Test thel: One model
has been tand evaluat can be t set tmate of i - world prmanc ? Fin - Tuin If the model i
performing wellon the test st mry to fine - tune the hypram the eel.
? Sel: Onche model has been td and evaluat can besk for later us In ary, ng Cmodel ASL
anslation ves ring data, defining the model arecture, comping the model, trainivaluating
the model, testing t - tunif necessahe model to di 4 Mo Evuan The aluation
CvolutNeural rk for Am ign Language L) ion callinvolseveral etrics assitperformanHe are
somcommed metrics: ? Accu: Accy the of y imover totnumber of ages.
? Confus A matrix a that the of posives, false titrue ati s, false egaties e sign. be to
identify which simostsclassified by the model. ? ROC rve: TheReiver eratiCacic curvia al
represof trueposive (re) sus false tirate(1 - specificit at diffatids. It ce usede ove
pormancee model and to choose an omal classificatishol The metrics hould chosen othe
cific athe able data.
generthe cuprecisire, d 1 are onlused for classificatimodels, hilthe matriand OC can addit
insinto tce. In summNN model for AL translation ty involves severa metrics,
accurprecisirec, 1 confusion , Rcurve. These metrics can be usehe overalle of the model
and to identify areas for im 4 Deplment Deploymonvolonal Neural NetwoC Americign
Language (ASnvolves making the model avairoduct Here e geneep invol ? Eort thel:
Thfirst so export the trained model from teent environment.
This usy involn a format that ce loaded by the productisuch as a KDF5 filow Saved ? S
the Prodctionnmen The prodon environment neede set up t r un the deployed model.
This mve insting the necessarencies, configuring harde resettrity pro ? L the Mod Once
ton environmes set up, the exported e loaded intong the appropriate librfra ? Inu The
inputo be pcesatch the format xpected by the model. This mve scalion, a - hot encoding.
? Runiction The preprocess data cn then be fed intded model, and the model can gene
phe inputta. ? Returnlts: Thdictions can then be rned to tegted wit syst Con volual Neu -
Amignanage Tran S Dept. of CSC P 23 The deploymNion requires cful consion of the
productient, data inputt. It isportant t model isate, ent, and scalable to met te int d Dept.
of CSC P 42 Cional Neural N e twork - Amign Lanslation Conv olual Neu - Amignanage
Tran S m Dept. of CSCage 43 C ol uti e ural Ne - Amign Lanslation S CR 6 ML RES
6perilul In this project usiNN we aecognizing Agn languagovement and to train Cre
usiollages shown in below scre Fain files of project The above pictucts about tanguage it
- Dept. of CSC P 44 F .1.3 Images of H let Fter Convoltion eu - Amignanage Tran S Dept.
of CSCage 45 The below is he first prong in command pro: Funning tng commmpt Then
afte the coand in command prompt ithe sign page as shown b Fon of project
Convoltionral N e twork - Amignanage Tran latio nyst em Dept. of CSC P 46 Fproject The
above is taand fan proche recognit sign l Frediction of a letter Convoltion eu -
Amignanage Tran S Dept.
of CSCage 47 Frediction of another letter F .1.10 Adding of letters as st The abovee the
picturs of the sign languagecion which are recog a and ed the which being to webcam
itdisplthe desired letter. Dept. of CSC P 48 Convoltionral Network – Ame rican S
Lguslation S 6catio The applions ofican S(ASing Chm are divee - rging.
Here ae ocons: ? Eu: ASL translation systn educatings tf and hard of hnts wi - titiectur
discussi ? Health ASL trtiems can be used hcarings t effve comm icatibetween
hhcafeonals and deaf oof hearing patihat teive the bsible. ? Customer servic ASL ystems
can ber see setto facilate common eaf of ers stom service resentatives, isfon and abily.
? Bsin ASL translation sbusiings tve common between ard of he mpleng to create ve. ?
Etertaint: ASL traystems can be uhe entertainment iry to provide deard of hviduals
witccess tve perce ? Ecy s: ASystems canrgency srvices setto facil itunicatieafearing
indiduals and emergenesponders, eeive hey need in mely manner.
devent systhas the to y the af hof commy, increase accbily and common opti As arch
devent thiarea nuitis kely we eve more stiandaccu translatisystemergWitthe
availitchnologirces,bilies for imanslation are endle can xpect tright future for thi logy. 7
The e AmeSLangu(AL) usingCN hm is andpromisih nued ancemnts
deeplearningandmachine arning, is kely ASL slation ems even oracate, and userfriendly.
Onal ar of dent is the use oeal - ti ASL tems. Cy, ASL nslation ems the to cord upla of r
sign whichis transla ted o ritten spokenlanguagHowevewith developmof - titranslation
ems, would able commmore fluidland y r - ticonvons. could f nd ohe indito these es
ease d cy. Additiy, is need r ASL translation sthat arecificalled to rcts anons ig language.
ASL ariesby and itis portant developtra nslation ems that are arate and urallive.
Overallthe e eof translation g NN hm promisiand with nued rch developmwe n to
signiaadvancements thihat wil greatlhe deardcommy. Dept. of CSC P 52 Convoltionral
Network – Ame rican S Lguslation S BIIOGRH [1] Jin CMH.
A mobicatirican siage translation viage prong algoritP - 2016 IEEE Rion 10 Sium TENS p.
104 - 109. [2] FJL, Cpa AEG, Enciso Ron ional neural n static hand gestures rgnitent
iatures. Peedi IEEE 24th Internonal Cectronics, ned Cing, INTERC p. 5 - 8 [3] Kumar P,
Gauba H, RPoupled HMM - based mul - senso for sign recognion. Pern Recogniters.
2017; 86: p. 1 - 8 [4] Hu, J., S., & Squee - and - Exciton Networks.
IE Ce on Ceron and Pern R ecoion (CVP). [5] Huang, J., Zhou, J., Xu, W., & Huang, Y. ign
Langu Rion Bonvolutwork. IEEEcess, 6, 5 - 57702. [6] Ding, S J., &Sign Languag R ion
Usiable Sensors. IEEE Tracti - Matems, 49(4), 315 - 326. [7] Aly W, Aly Sairi . User -
independerican siage alph recognitepmage and feEE Ac 7: p. 123138 - 123150 [8] Ye, Y.,
& Hu, J.
(2020ign Langu Rion witign Language Alphabet and Actiecion Network. IEEE Access, 8,
1710 - 1718. [9] lkandari, A.,ari, F., & Alrashedi, S Secognitng Deep Cional Neu etworks. e
- 190068. [10] Li, Shen, Y. (2021). Amign eion Usi Cional Neural Neess, 9, - 23372.
INTERNET SOURCES:
-------------------------------------------------------------------------------------------
0% - Empty
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
0% - [Link]
International Journal of
INTELLIGENT SYSTEMS AND APPLICATIONS IN
IJISAE 1
ENGINEERING
ISSN:2147-6799 wwvv.l][Link] Original Research Paper
- Recommendation systems, Deep neural networb Collaborative filtering Multi-model deep learning, Explicit feedback
1
Prafessar, Dept. af CSE, G. PuJÆaiah College af Engvaeermg and
Technology, UumaaÆ, Andhra Pradesh, India
Email: mmdraktonan@[Link]
2
M. Tech Studert, Dept. af CSE, G. PulJaiah College af Engvaeermg and
Technology, UumaaÆ, Andhra Pradesh, India
E-mail: raiyerramaÆa@gmaiÆ.cam
3
Asst. Professor, HOD-CSE, Dept. af CSE, G. PuJÆaiah College aj
Eng meering and Technology, KurnaaÆ, Andhra Pradesh, India
E-mail: srddcshmicse@[Link].m
4
Sr. Professor and Dean Innmmians , Dept. af CSE, G. PuJÆaiah College ofEngiæering and Technology, UumaaÆ, Andhra Pradesh ,India Email:
dean@gpcä.[Link]
5
Asst. Professor, Dept. afCSE, G. PuJÆaiah College af Engineering and
Technology, UumaaÆ, Andhra
E-mail: Nasulu4u@gmaiÆ.cam
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-12 |
for more effective methods to enhance the performance recommendation algorithms. The methodology includes
of recommendation systems[2]. data preprocessing, splitting the dataset into training and
testing sets, using a deep autoencoder neural network to
The main challenges in enhancing the performance of
learn latent features of users and items, and combining
recommendation systems are the limited ability of
learned with collaborative filtering to generate
traditional collaborative filtering methods to learn from
recommendations. The proposed algorithm has the
user preferences and the issues of scalability and data
potential to be applied in various fields such as
availability. Data sparsity and cold start problems pose
ecommerce, social netvvorks, and online advertising.
significant challenges that must be addressed to provide
accurate recommendations to users. The multi-model deep In their paper[4] present a novel neural network approach
learning approach proposed in this study addresses these to collaborative filtering for implicit feedback data. The
challenges by comb ining user and item fi_lnctions to authors acknowledge that traditional collaborative filtering
produce a hybrid recommendation system with invroved methods have limitations in dealing with inv licit feedback
performance. data, such as data sparsity and cold-start problems. To
overcome these limitations, the authors proposed a neural
The main objective of this study is to propose a multimodel
nebvork-based approach that considers not only user-item
deep learning approach that comb ines traditional
interactions but also auxiliary information such as user and
collaborative filtering methods with deep neural nebvorks
item attributes.
to enhance the performance of recommendation systems.
The proposed approach aims to address the issues of The paper [5] presents a comprehensive survey of
scalability and data availability and overcome the collaborative filtering (CF) techniques for invlicit
limitations of traditional collaborative filtering methods in feedback datasets. The authors provide an overview of
learning from user preferences. The effectiveness of the the prob lem, including the challenges and limitations
proposed approach is evaluated by analyzing three models associated with the implicit feedback datasets. They
that produce a wide range of outcomes from a single review a variety of CF methods, including matrix
real-world dataset The objective is to demonstrate the factorization, neighborhood-based CF, and deep
potential of the MMDL approach to enhance learning-based CF. The survey also covers recent
recommendation systems with explicit feedback. advances in the field, such as the use of graph-based
approaches and hybrid models that combine multiple CF
This study proposes a solution to address research issues
techniques. The authors discuss the evaluation metrics
related to noisy invlicit feedback signals using I)NNs. To
used to measure the performance of CF methods,
enhance the efficiency of the collaborative filtering
including accuracy, coverage, and diversity. They also
algorithm, the Deep Auto-Encoder Neural Netvvork
highlight the invortance of addressing the cold-start
(DeepAEC) and One-Dimensional Traditional Neural
problem and provide an overview of the techniques
Network (ID-CNN) approaches are combined in a
proposed to tackle it. The paper concludes with a
multimodel deep learning method. We evaluate
discussion of the fi_lture directions in the field, including
models on a single real-world dataset and show how
the need for more research on the interpretability of CF
well DeepAEC and ID-CNN function as collective filtering
methods and the integ•ation of context-awareness into
strategies.
recommendation systems. Overall, the paper provides a
The remainder of the paper is structured into the following convrehensive overview of the current state of the art in
key sections: Section 2 presents the literature review, CF for invlicit feedback datasets and is a valuable
Section 3 outlines the proposed method, and Section 4 resource for researchers and practitioners in the field
presents the results of the testing conducted. Section 5
The author proposed a novel approach to invrove the
presents conclusion of the paper.
accuracy of collaborative filtering recommendation
2. Literature Survey systems a combination of deep neural netvvorks [6]. The
authors utilized a dataset from the MovieLens website to
Collaborative filtering (CF) is a popular approach for
evaluate their proposed algorithm, which achieved
building recommender systems in which user-item
superior performance over traditional collaborative
interactions are used to generate personalized
filtering methods and other state-of-the-art
recommendations. Recently, there has been increasing
recommendation algorithms. The methodology involved
interest in enhancing CF with deep learning techniques.
data preprocessing, splitting the dataset into training and
In [3], a hybrid approach that combines collaborative testing sets, and applying a deep neural network to learn
filtering and deep autoencoder neural network is proposed the latent features of users and items. The authors used a
to enhance recommendation systems' accuracy. The combination of different types of neural networks,
proposed algorithm is evaluated using a dataset from including autoencoders and convolutional neural
MovieLens website and found to outperform traditional networks, to extract the relevant features from the data.
collaborative filtering methods and other state-of-the-art The learned features were then combined with
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-26 | 14
collaborative filtering to generate recommendations. The 3. Proposed Work
proposed algorithm has the potential to be applied to
To evaluate the effectiveness of Deep Auto-Encoder
various domains such as e-commerce, social networks,
Neural Network (DeepAEC) and One-Dimensional
and online advertising. It can also help address the
Traditional Neural Network (ID-CNN) approaches as
common limitations of collaborative filtering such as the
collective filtering strategies in the context of noisy
cold-start problem and sparsity in user-item interaction
invlicit feedback signals. This objective aims to assess
data. Overall, this study presents a promising approach
how well the proposed multi-model deep learning
for invroving the accuracy and scalability of
recommendation systems through the combination of
deep neural networks and collaborative filtering
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-26 | 15
(MMDL) method works in enhancing the efficiency of
the collaborative filtering algorithm in the presence of
noisy invlicit feedback signals
user The
passage
Rating
also notes
that the
matrix is
typically
sparse
because
of no Dark
Knight
users only
Movies rate a
1 5 4
small
user number of
movies,
4 resulting in
mostly
Rating 5 envty
cells. This
can pose a
Movies challenge
3 for
Fig. 2: Rating Matrix -2 Simple demonstration of inv licit and exp licit collaborati
feedback differences observed ve filtering
way to represent user-movie interactions in a approaches, which rely on finding similarities between
collaborative filtering approach. users and movies based on their ratings.
The invlicit feedback matrix R is defined as Rmn, where How to (_•alculate ratings:
m and n respectively represent a group of users and
The notation used in the explanation is as follows:
objects (movies). The entry rui in the matrix represents
the rating that user i has given to movie u. If the user has R: rating assigned by user U to item I
not rated the movie, the enh•y is left blank. A value of 1 is
used to indicate that there is a user-movie interaction, U: the user who is assigning the rating
which means that the user has rated the movie. l: the item being rated
The latent features of the users are represented by the n: the number of users who are similar to user
vector xu of dimension n[7]. The matrix x is defined as U and used to calculate the average rating
x[Rma], which means that the latent features of the users
h : rating given to item I by user i
are derived from the entries in the inv licit feedback
matrix R. Similarly, the latent features of the movies are sim(U, i): similarity between user U and user i
represented by the matrix x[Rnb].
The equation used to calculate the average rating is:
However, the lack of negative feedback can make training
recommendation systems difficult because it is not clear R (l/n) * * sim(U, i)) (1)
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-26 | 17
In this equation, E represents the sum of the values in Let M be a m x k matrix, where m is the
the brackets. So, for each user i who is similar to user total number of movies and k is the
U, we calculate the product of their rating for item I
(Ti)and their similarity to user U (sim(U, i)). We then total number of features.
add up all of these products and divide by the total
Let mi be the feature vector for movie i,
number of similar users (n) to get the average rating
assigned to item I by users similar to user U. This where -mi = [mil, mi2, •
average rating is then used as the predicted rating (R) Let mij be the value of the jth feature of
that user U would
movie i, where m ij is a scalar value.
assigl to iteml.
2. The u. data file: This file contains the
usersubmitted ratings for the movies in the dataset The
mathematical notation for the [Link] file can be
represented as follows:
genre, etc. The mathematical notation for the u. item file adjusting the embedding vector so that the predicted value
is as close as possible to the actual value. To improve
can be represented as follows:
accuracy, more epochs can be used during the training
process. The loss function used in this case is the
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-26 | 18
difference between the actual and predicted ratings for the Encoder: The encoder fi_lnction f takes the feedback
entire training set. matrix R and it to a hidden representation h of size k. f
(R)
Once the model has been trained, it can be used to make
Decoder: The decoder fi_lnction g takes the hidden
predictions on the test set. The predicted rating is
representation h and maps it back to the reconstructed
generated using the trained model, based on the user and
feedback matrix R'. g(h)
movie identifiers[13]. In this exanvle, the first 10 rows of
the test set are used to generate predicted values for user Loss function: The goal of the autoencoder-based
and movie IDs. However, this process can be extended to recommendation system is to minimize the difference
predict values for the entire test set Based on the highest between the actual feedback matrix R and the
predicted rating for a specific user, movie reconstructed feedback matrix R'. This is achieved by
recommendations can be made. using a loss function that measures the difference
3.3 Autoencoder for deep neural networks between R and R'. L(R, R') where . I l A
An autoencoder can also be used for recommendation 2 is the Frobenius norm between R and R'.
systems with ezvlicit feedback, where users provide Training: The autoencoder is trained by minimizing the
explicit ratings or feedback for items. In this context the loss function L(R, R') using an optimization algorithm
autoencoder is trained to learn an efficient representation such as stochastic gradient descent (SGD)[14], [15].
of the user- item feedback data.
The objective of the autoencoder-based
The mathematical notation for an autoencoder-based recommendation system is to learn an efficient
recommendation system with explicit feedback can be representation of the useritem feedback data that
expressed as follows: captures the most invortant features of the feedback.
Let R be the user-item feedback matrix of size n x m, This representation can then be used to recommend
where n is the number of users and m is the number of items to users based on their past feedback. By reducing
items. the dimensionality of the feedback data, autoencoders
can heh) to mitigate the sparsity prob
lem that is common in
recommendation systems with
explicit feedback.
(3)
The output of the ID convolutional
Fig. 3. Model of the encoder and decoder layer for user i and filter j is given
3.4 ID convolution neural network architecture by:
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-26 | 19
(5) Fully connected layer:
Step 4 : Fully connected layer: Let be the weights for neuron k in the fi_llly connected
layer. The output of the fi_llly connected layer for user i
Let gkbe the weights for neuron k in the fully connected
and filter j is given by:
layer. The output of the fully connected layer for user i
and filter j is given by: = * Pi,j
(6) Output layer:
Step 5: Output layer: Let Yi,j be the predicted rating or feedback value for user
Let Yi,j be the predicted rating or feedback value for i and item j. The output of the output layer is given by:
user i and item j. The output of the output layer is given
Yi,J 1, Zi,j, 2, .
by: Yi,J (7)
where h is the activation function for the output layer.
where h is the activation fi_lnction for the output layer.
3.5.2 DeepACE model:
3.5 Ensemble Model: ID-CNN and DeepACE
The DeepACE model takes as input a matrix of useritem
To combine the ID-CNN and DeepACE models for feedback values and produces a rating prediction for each
recommendation systems with explicit feedback, we
user-item pair. The architecture of the DeepACE model is
can use an ensemble approach where we take the
defined by the following mathematical notations :
predictions from both models and combine them to
produce a final prediction. Input layer:
Let R be the user-item feedback matrix of size n x m, Let X be the input matrix of user-item feedback values.
where n is the number of users and m is the number of
items. X = [x_1,1,x_1,2, ..., x_l, m;
ID convolutional layer:
ID convolutional layer:
Let wj be the filter weights for filter j, represented as a
Let w; be the filter weights for filter j, represented as a
sequence of weights. sequence of weights.
(9)
The output of the ID convolutional layer for user i and The output of the ID convolutional layer for filter j is
filter j is given by: given by :
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-26 | 20
Output layer: One approach to combining the ID-CNN invlemented using Python 2.7 and Keras 2.0 with
and DeepACE models is to use the output of the IDCNN TensorFlow 3.0 sewing as the backend [16], [17].
model as input to the DeepACE model. This can be done
4.2 Dataset description
by taking the flattened output of the ID-CNN layer and
using it as the input to the embedding layer of the The MovieLens dataset [18], [19], [20] is a result of
DeepACE model ongoing research by the Movielens project and is
published by the GroupLens Study at the University of
Let X be the input matrix, where each row represents a
Minnesota. It is widely used to evaluate collaborative
user and each column rqresents a movie, and R be the
filtering techniques and is available in various formats
rating matrix, where each element Rij represents the
on http ://[Link]. com. The MovieLens 100k
rating given by user i to movie j. Let fl be the function
dataset contains 100,000 movie reviews, with each user
that represents the ID-CNN model, and f 2 be the function
providing more than 20 ratings, ranging from 1 to 5. As
that represents the DeepACE model. Then, the comb ined
these ratings are explicit, we selected this dataset to
model can be represented as follows:
study how explicit feedback can be learned from inv
Z = fl(X) licit ratings. By converting each element to a binary 1 or
0, indicating whether the user has ranked the item or
E = f2(Z) not, we converted the invlicit data to explicit data. All
where Z is the output of the ID-CNN model, and E is the test processes return predicted ratings, accepting input
embedding matrix of the DeepACE model. parameters of (user, item) pairs. After the data is read in,
it is inserted into the rating matrix*s user row and item
The loss function for this combined model can be a column, creating matrices of sizes (943 x 1682), (6040 x
weighted sum of the losses from the two models: 3952), and (100k) for MovieLens.
L = a Ll + (1 - a)L2 4.3. Metrics for evaluation are displayed.
where Ll is the loss function for the ID-CNN model, L 2
The root mean square enor (RMSE) for the suggested
is the loss function for the DeepACE model, and a is a
model's prediction ability can be defined as:
weighting factor that determines the relative invortance of
the two losses.
RMSE = sqrt((l/n) * — -r) A 2))
The combined model can be trained using where:
backpropagation and stochastic gradient descent to
minimize the loss function. Once trained, the model can • r is the genuine rating for a particular film
be used to make predictions for unobserved ratings.
• n is the total number of predicted films
In summary, the combined model uses a ID-CNN model
to exrtmct features from the irvut matrix, and a DeepACE • rui is the predicted value for user u for the particular
model to learn embeddings for the users and movies. By film
combining these Ovo models, we can leverage the 4.3.1 Existing Method (Recommender Net)
strengths of both approaches to invrove the accuracy of
the rec ommendations.
4. Experimental Study
This section convrises several parts. In Section 4.1, we
discuss the experimental setup for our study. In Section
4.2, we provide a description of the dataset that was used
for the experiments. Section 4.3 describes the metrics that
were used to evaluate the performance of our model
Section 4.4 presents the settings of the proposed structure
for multi-modal deep learning and the results obtained
from the experiments. Finally, in Section
4.5, we perform a performance analysis of our model.
4.1 Experimental setup.
For our experiments, we utilized a system with the
Windows operating system, four 3.10 GHz Intel Core
i52400 CPUs, and a 500 GB hard disk. Our model was
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-26 | 21
num users, num movies, training set, and test set after In the context of a CNN ID model[24] for
the Output of the top 10 recommendations shown in the recommendation systems with explicit feedback, the
ExistingSystemModelloss
0.630
Q625
0.620
0.615
0.610
Fig. 4: Existing_plotgraph
4.3.2 Neural network architecture of ID convolution is particularly adept at processing data with a grid-Ike
A convolutional neural network sometimes referred to as architecture, such as user IDs, movie IDs, and ratings,
before producing an output concatenated into a fi_llly
a CNN or ConvNet, is a subclass of neural networks that
connected network.
CNN-ID System Model loss
(1325 (1320 e15 (1310
u305
(1300
Q95
Q90
2.0
epoch
Fig. 5: CNNIDp10t_yaph
Existing Method (Recommender Net) CNN ID accuracy measures how well the model can predict the
(Convolutional Neural Network ID) is a deep learning user's rating of a movie. The loss measures the
model commonly used for processing sequential data difference between the predicted rating and the actual
such as time-series or text data. One way to evaluate the rating. A low loss indicates that the predicted rating is
performance of a CNN ID model is to plot its training close to the actual rating, while a high loss indicates the
and validation accuracy and loss over epochs. opposite.
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-26 | 22
The training accuracy and loss indicate how well the Overfitting occurs when the model starts to memorize the
model is fitting the training data during the training training data instead of learning from it. This can lead to
process, while the validation accuracy and loss indicate high training accuracy and low training loss, but poor
validation accuracy and high validation loss. In this case,
DeepAEC System Model loss the model is not generalizing well to
new data, and we need to tune the
hyper parameters or modi_W the
model architecture to prevent
overfitting[21], [22], [23] .
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-26 | 23
RMSE measurements. We used
the 100k MovieLens dataset as a
real-world dataset to assess the
model. In future work, we plan to
focus on explicit feedback
because inv licit feedback is not
sufficient to create a conv lete
recommendation system Overall,
our proposed MMDL approach
has the potential to invrove the
effectiveness of recommendation
systems by addressing the sparsity
Fig. 7. Performance convarison issue commonly associated with
The results obtained from the experiments show that the CF techniques.
DeepACE architecture was able to achieve optimized
results with low loss convression convared to existing References
works. This means that DeepACE was able to J. Zhang, X. sun, Z. xu, and H. wU (2020). A hybrid
effectively reduce the size of the data while retaining its collaborative filtering algorithm based on deep
invortant features, resulting in a compressed version of autoencoder neural network. Journal of Amb ient
the data with minimal loss of information. Intelligence and Humanized Computing, 1 1(7),
Comparing the results of DeepACE to those of existing 2639-2650.
works, it is evident that DeepACE outperformed them [2] D. Kim, D. Lee, and S. Lee. (2020). A novel neural
in terms of convression and preservation of inv ortant network approach to collaborative filtering for inv
features. The low loss compression obtained with licit feedback data. Expert Systems with
DeepACE inv lies that the convressed data can be stored Applications, 142, 112963.
and transferred with ease, saving storage and network
[3] A. Das, A. Ghosh, and B. Chakrabarti. (2021).
resources, without significantly invacting the
Collaborative filtering for inv licit feedback datasets:
performance of downstream tasks that utilize the data.
A survey. ACM Convuting Surveys, 54(1), 1-45.
Overall, the results suggest that DeepACE can be a
promising architecture for handling large datasets in [4] Y. Kim, D. Park, H Shin, and S. Kim (2022).
recommendation systems with explicit feedback, Combination of deep neural networks for
providing effective convression while retaining essential collaborative filtering recommendation.
information. Knowledge-Based Systems, 239, 107224.
5. Conclusion and Future Scope [5] Y. Liu, X. Liu, and H Xiong (2022). Attentional
collaborative filtering with user item attention and
Collaborative filtering (CF) techniques are widely used
rating bias. IEEE Transactions on Neural Networks
in recommendation systems. However, one of the main
and Learning Systems, 33(3), 637-650.
challenges with CF approaches is the sparsity of data,
which stems from the fact that data is often presented in [6] Abba Almu& Ziya'u Bello (2021). An Everimental
the form of a matrix of ratings, making it difficult to Study on the Accuracy and Efficiency of Some
scale. In this research paper, we propose a multi-modal Similarity Measures for Collaborative Filtering
deep learning approach that combines a DeepACE Recommender Systems. International Journal Of
neural network with a ID conventional neural network Convuter Engineering In Research Trends, 2(11),
to address this issue. To evaluate the performance of the 809-813.
proposed model, we convared it to the state-of-the-art
[7] Z. Zhang, X. wu, J. wu, and H Shao. (2023).
techniques. Our experiments showed that the WIN'IDI_,
Collaborative filtering recommendation based on
outperforms other well-known approaches in terms of
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-26 | 24
deep neural network with dual channel attention. Neural Convuting and Applications, 35, 72097222.
[8] J. Zhu, C. Zhang, X. Gao, and H xu. (2021). A Emotion Recognition from Speech Audio
two-stage collaborative filtering recommendation S ignal", Security and Communication Nebvorks,
algorithm based on deep autoencoder neural vol. 2022, Article ID 8777026, 10 pages, 2022.
network. International of Distributed [Link]
Sensor Netwolks, 17(5), 15501477211016347.
[17] Chen, W., Wang, Y., & Zhang, X. (2020). A Novel
[9] L. Zhang Y. Li, and X. Li. (2021). Hybrid Multi-Model Deep Learning Approach for
collaborative filtering with neural network and Collaborative Filtering. Journal of Intelligent &
fuzzy logic for recommendation International Fuzzy Systems, 39(5), 6565-6574. doi:
Journal of Distributed Sensor Networks, 17(10), 10.3233/JIFS-189559
15501477211050750.
[18] Maloth, B. , Suman, J. , Saritha, G. , &
[10] Kirankumar, A. , Reddy, P. G. K. , Reddy, A. R. C. , Chandrasekhar, A. (2012). Non linear programming
Shivaji, B. , & Reddy, D. J. (2014). A Logic-based convutation outsourcing in the cloud. Int. J.
Friend Reference Semantic System for an online Convut. Sci. frig Technol., 2(3).
Social Networks. International Journal Of [19] Chen, W., Wang, Y., & Zhang, X. (2021). Multi-
Computer Engineering In Research Trends, 1 (6), Model Deep Learning Approach to Collaborative
501-506. Filtering. Journal of Intelligent & Fuzzy Systems,
[11] Chen, W., wang Y., & Zhang X. (2019). 41 (1), 1301-1310. doi: 10.3233/JIFS-201898
Hlhancing Collaborative Filtering with Multi- [20] S uneel, Chenna Venkata, K. Prasanna, and M.
Model Deep Learning Approach. In Proceedings of Rudra Kumar. "Frequent data partitioning using
the 2019 3rd International Conference on Cloud parallel milling item and
Computing and Big Data Analysis (pp. 219-224). MapReduce. " International Journal of Scientific
ACM. doi: 10.1145/3320254.3320281 Research in Convuter Science, Engineering and
[12] Chen, W., wang Y., & Zhang X. (2019). A Multi- Information Technology 2.4 (2017).
Model Deep Learning Approach to Hlhance [21] Rudra Kumar, M. , Rashmi Pathak, and Vinit
Collaborative Filtering. In Proceedings of the 2019 Kumar Gunjan. "Diagnosis and Medicine
IEEE International Conference on Big Data (pp. Prediction for CC) NAD-19 Using Machine Learning
3741-3746). IEEE. doi: Approach. " Convutational Intelligence in Machine
10.1109/BigData47090.2019.9005983 Learning: Select Proceedings of ICCIML 2021.
[13] Chen, W., wang Y., & Zhang, X. (2020). Singapore: Springer Nature Singapore, 2022. 123-
Collaborative Filtering with Multi-Model Deep 133.
Learning Approach. In Proceedings of the 2020 4th [22] N. Satish Kumar & Sujan Babu Vadde (2015).
International Conference on Cloud Convuting and Typicality Based Content-Boosted Collaborative
Big Data Analysis (pp. 115-120). ACM. doi: F i Itering Recommendation Framework.
10.1145/3371671.3371692 International Of Convuter Engineering In
[14] A. Avinash, & [Link]. (2016). Location-Aware Research Trends, 2(11), 809-813.
And Personalized Collaborative Filtering For Web [23] Chen, W., Wang, Y., & Zhang, X. (2022).
Service Recommendation. International Journal of Collaborative Filtering Enhanced by Multi-Model
Computer frigineering In Research Trends, 3(5), Deep Learning Approach. Journal of Intelligent &
356-360. Fuzzy Systems, 43(2), 1891-1900. doi:
[15] Rudra Kumar, M. , Rashmi Pathak, and Vinit 10.3233/JIFS-219870.
Kumar Gunjan. "Machine Learning-Based Project [24] Kumar, P. ., M. K. ., Rao, C. R. S.
Resource Allocation Fitment Analysis System Bhavsingh, M. ., & Srilakshmi, M. (2023). A
(ML-PRAFS). " Convutational Intelligence in Convarative Analysis of Collaborative Filtering
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-26 | 25
Machine Learning: Select Proceedings of ICC-INTL S imilarity Measurements for Recommendation
2021. Singapore: Springer Nature Singapore, 2022. Systems. International Journal on Recent and
Innovation Trends in Convuting and
[1 6] M. M. Venkata Chalapathi, M. Rudra Kumar, Communication, 11(3s),
Neeraj Sharma, S. Shitharth, 'Ensemble Learning by [Link] VI 1 i3s.6180.
High-Dimensional Acoustic Features for
International Journal of Intelligent Systems and Applications in Engineering IJISAE, 2023, Il(6s), 01-26 | 26