0% found this document useful (0 votes)

38 views31 pages

Machine Learning Speech Aid Project

The document describes a machine learning based speech aid project for silent communication. It presents the project to Visvesvaraya Technological University. The project uses EMG signals collected from the submental triangle region during silent speech to recognize letters using machine learning algorithms. It trains models on a dataset containing EMG signals for letters and evaluates the performance of algorithms like SVM, KNN, decision trees etc. to select the best for real-time use. It also details plans to design hardware to interface muscle sensors, process signals and integrate the trained model into a system for silent speech recognition and communication assistance.

Uploaded by

Laxman Jakati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

38 views31 pages

Machine Learning Speech Aid Project

Uploaded by

Laxman Jakati

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

VISVESVARAYA TECHNOLOGICAL UNIVERSITY

BELAGAVI, KARNATAKA – 590018

2022-2023
Project
Presentation on
“MACHINE LEARNING BASED SPEECH AID FOR
SILENT COMMUNICATION”
Presented By:

LAKSHMANA 1SK19EC016
ANAND 1SK20EC400
KIRAN KUMAR S 1SK20EC409
MAHESH S 1SK20EC410
Under the guidance of:

Dr. N Sathisha
Assistant Professor
Department of Electronics and Communication Engineering
Government Sri Krishnarajendra Silver Jubilee Technological Institute
K. R. Circle, Bangalore - 560001
Contents
 Introduction  Hardware Setup
 Literature Review  Results
 Aim & Objectives  Limitations and Future works
 Methodology  Conclusion
 Dataset  References
 Pattern Recognition
 Flow chart
 Hardware and Software Requirements
INTRODUCTION
1. Effective communication is essential for humans to express their emotions, convey their
opinions and ideas to others. The main objective of communication is to ensure that the
receiver comprehends the message that the speaker intends to convey.
2. Individuals with certain medical conditions such as paralysis, stroke, Amyotrophic Lateral
Sclerosis (ALS) and cerebral palsy lose their ability to speak and move their bodies.
3. Currently, there are conventional speech interfaces available in the market to assist these
individuals. However, these devices are often expensive and complex to use. Some of
these devices even require surgical implantation, adding to the complexity and cost.
4. It is essential to explore and develop advanced technologies that can assist these
individuals in communicating effectively without the need for invasive procedures or
Cont….
Prerequisites and important definitions for Silent
Communication:
 Silent speech: Silent speech is a minimally or internally
articulating words without producing sounds or moving the
mouth.
 Electromyograph (EMG) signals: Electromyography
signals are electrical signals generated by muscle
contractions.
 sEMG signals :EMG signals collected from surface
electrodes into real speech.
 Submental Triangle is a key location for EMG-based SSI
systems as it contains the mylohyoid muscle, which is an
important muscle involved in speech production.
Cont….
Silent speech

EMG Signals

Processing and
Pattern recognition

Computer Algorithms

Speech / Text
Literature Survey
SL NO TITLE OF THE PAPER AUTHOR AND YEAR OF OBSERVATION AND OUTCOMES
PUBLICATION

01 Parallel Inception CNN Jinghan Wu, Tao Zhao, Deep learning architecture named parallel
approach for Facials EMG Yakun Zhang inception convolutional neural network
based Silent Speech 2021 (PICCN) for up-to-date feature extraction.
Recognition Accuracy of 89% was achieved by this
proposed system.

02 A wearable graphene strain Dafydd Ravenscroft, Project aimed at designing a wearable

gauge sensor with haptic Ioanniks prattis, Tharun strain sensor, twisted and coiled polymer
feedback for silent Kandukarti and a graphene-based actuator and
communication 2021 developed a dataset for classification.
Haptic feedback was implemented to
inform the user or the listener on successful
translation.
Literature Survey
SL NO TITLE OF THE PAPER AUTHOR AND YEAR OBSERVATION AND OUTCOMES
OF PUBLICATION
03 A review on silent speech Jose E, Gonzalez Lopez This review focuses on providing new
interface for speech restoration 2020 alternative and augmentative
communication methods for the persons
with severe speech disorders.

04 Silent Speech Recognition by Andrzej [Link], Piotr This paper also describes the experimental
Surface Electromyography Pruchnicki, Przemysław setup and evaluation of the system using a
Plaskota, Piotr dataset of sEMG signals recorded from
Staroniewicz, Stefan eight healthy individuals performing silent
Brachmański and Maciej speech tasks. The results showed that the
Walczyński system achieved high accuracy in
recognizing silent speech commands, with
an average recognition rate of 91.5%.
Aim and Objectives
Aim:
To design ML based speech recognition aid for silent communication for disabled and
paralyzed people.

Objectives
 To design a low-cost speech aid system, compare to existing system.
 Use the available CSV data to train multiple ML models and compare each ML model
accuracy values to identify best EMG classifies and show result on web interface.
 Electromyography signals are traced from the sensor (muscle bio amp candy sensor/
myoware muscle sensor).
 To build hardware that can be interfaced with software and design a real-time system that
Methodology

Part A
1. Collect EMG CSV data from the internet.

2. Splitting available CSV data into Training, testing and validation datasets

3. Training ML model on dataset (which is Split into train and test datasets) ML model is trained 7
prominent classification algorithms such as, Support vector machine, K neighbor, Decision tree,
linear discriminant, quadratic discriminant and Naïve bayes.

4. Evaluate the performance of different machine learning algorithms and comparing their testing and
training accuracy of each model on split dataset.

5. Evaluating the performance by calculating the accuracy, precision and recall ability, and also the
F1 score for training and testing the model on all ML algorithms.
Cont…

6. Determining the best model for pattern recognition, based on the accuracy of individual
model.

7. For all these building a web application which shows the result on web interface

8. Showing the Step 4 result on streamlit based web application for all trained ML models.

9. Building the Functional switch button on Streamlit web interface so as to interface with the
hardware that will be built so as to predict the signals that occurs in real time.
Cont…

Part B

1. Design and implement a hardware system for the speech recognition process

2. Designing a simple system using muscle bioamp candy sensor that detects the muscle activity
and then assigning a different threshold value for letter A, E, I, O, U and Blank threshold
values.

3. Noting down the threshold values implementing signal processing and feature extraction using
models like CNN for real time signals and saving the Realtime CSV data to SD, so this will be
helpful to integrate with hardware built in Part A.

4. Making new dataset, training and testing new data set and testing the model on already trained
ML Models and test for real-time accuracy on all Machine learning Models.
DATASET
 EMG data was collected from the internet.

 The dataset contains EMG signals for the letters A, E, I,

O, U, and B, with a total of 1021 instances and 3000
attribute values.
 To split the dataset into training, testing, and validation
sets, we used the scikit-learn library in Python.
 Dataset randomly divided the data into 70% training,
20% testing, and 10% validation sets. This ensures that
the model is trained on a sufficiently large dataset while
also being tested and validated on new and unseen data.
Pattern Recognition

Algorithms Divisions

Linear Discriminant Linear Line

Quadratic Discriminant Quadratic Line

Native Bayes Independent Predictors

Tree Split into large sets

K Neighbors Closest data point

Ensemble Combining models

Support vector Machine Line with boundary

13
Flow Chart: Part A

Silent Speech EMG Split Dataset to Train

Train ML models
CSV Data and Test

Designing a web
interface to show result
Determining the best Evaluate Each Model
and should be
ML algorithm Performance
interfaced with part B
work
Flow Chart: Part B

Building a hardware Detect the signals Testing Sensor for

using Bio using sensor for muscle particular values and
amp/Myoware sensor Contractions at region observe threshold for
for Silent Speech of Submental triangle particular values

Using and testing Part Pattern recognition of

B work with Part A work Developing new csv EMG signals at same
and check for real time and testing this with threshold values using
working of the device. trained ML model models like CNN
Hardware/Software Requirements
Software requirements
1. Arduino IDE
2. Python+Google Colab

Hardware requirements
3. Arduino Mega 2560.
4. Bread board power supply stick 5v/3.3v.
5. Muscle Bioamp Candy Sensor
6. Myoware muscle sensor development kit.
7. 16 pin LCD
8. SD card.
9. Electrodes
10. Led's.
Hardware Setup
Results

ML Model train
Results

ML Model Train Test

Results

Model Evaluation
Results

Building WEB interface

Results

Building WEB interface

Limitations and Future work

Limitations:

 Sensor Limitations: The Muscle Bioamp candy sensor used in the project is only suitable
for hard muscle contractions and is not helpful in detecting speech signals. This limits the
ability of the system to accurately detect and recognize speech signals.

 Cost of Sensor: The cost of the Myoware muscle sensor is high, which can be a limitation
for implementing the system in real-world scenarios.
Limitations and Future work

Future work:

 Integration with a better sensor
 Real-time speech recognition
 Integration with other modalities
 Improved feature extraction techniques: The current system uses CNN for feature
extraction. Future work can involve exploring other feature extraction techniques that can
improve the accuracy and performance of the system.
 Development of a user-friendly interface
Conclusion

 Our methodology involved collecting EMG data for the letters A, E, I, O, U, and B, and
splitting this data into training, testing, and validation datasets.
 We trained seven different prominent classification algorithms, including Support Vector
Machine, K-Nearest Neighbors, Decision Tree, Linear Discriminant, Quadratic
Discriminant, and Naïve Bayes, on the split dataset.
 After training the machine learning models, we evaluated their performance by calculating
accuracy, precision, recall, and F1 scores for both training and testing datasets.
 We determined the best model for pattern recognition based on the accuracy of individual
models.
Conclusion

 We then built a web application that shows the results on a web interface, which can be used to
switch the hardware on and off to take new real time sensor values.
 In addition to the machine learning work, we designed and implemented a hardware system for the
speech recognition process.
 However, we faced some limitations in our project. The muscle bioamp candy sensor we used was
not suitable for detecting speech signals and was only helpful for detecting hard muscle
contractions.
 In conclusion, our project has demonstrated the potential of machine learning algorithms in
recognizing speech using EMG signals. Our results showed that the Random Forest Machine model
was the best performing model with an training accuracy (100%), and testing accuracy (79.41%).
References

[1]. Jinghan Wu, Tao Zhao, Yakun Zhang, “parallel inception CNN approach for Facial Semg BASED
Silent Speech Recognition”.
[2]. Dafydd Ravenscroft, Ioanniks prattis, Tharun Kandukarti “A wearable graphene strain gauge
sensor with haptic feedback for silent communication”.
[3]. Alaskr “convolutional neural network application in biomedical signals” (2018).
[4]. Arslan “Observations on the characteristics of EMG signals recorded at the different depths”2020.
[5] Benaroya, E-L., Chollet, G., Denby, B., Dreyfus, G., Stone, M., Development of a Silent Speech
Interface Driven by Ultrasound and Optical Images of the Tongue and Lips, Speech Communication
(2009)
[6]. D. E. King. Dlib-ml: A machine learning toolkit. The Journal of Machine Learning Research, 2009

THANK YOU

Silent Speech Recognition with EMG Signals
No ratings yet
Silent Speech Recognition with EMG Signals
2 pages
EMG Signal Classification for Silent Speech
No ratings yet
EMG Signal Classification for Silent Speech
36 pages
Improved Model for Voicing Silent Speech
No ratings yet
Improved Model for Voicing Silent Speech
7 pages
Silent Sound Technology for Communication
No ratings yet
Silent Sound Technology for Communication
23 pages
Gamified Sign Language Learning Project
No ratings yet
Gamified Sign Language Learning Project
23 pages
Multilingual Speech Trainer Kit
No ratings yet
Multilingual Speech Trainer Kit
5 pages
Speech Recognition System in Python
No ratings yet
Speech Recognition System in Python
49 pages
Irjet V9i334
No ratings yet
Irjet V9i334
5 pages
Talk Hands Paper
No ratings yet
Talk Hands Paper
6 pages
ML Health Monitoring System Report
No ratings yet
ML Health Monitoring System Report
55 pages
Sign Language Recognition with CNN
No ratings yet
Sign Language Recognition with CNN
18 pages
A CNN-Based EMG Silent Speech Recognition Framework For Wearable Body Area Networks
No ratings yet
A CNN-Based EMG Silent Speech Recognition Framework For Wearable Body Area Networks
2 pages
ASL Recognition with Machine Learning
No ratings yet
ASL Recognition with Machine Learning
8 pages
Speech Recognition Project Overview
No ratings yet
Speech Recognition Project Overview
9 pages
Computer-Aided Speech Therapy for DLD
No ratings yet
Computer-Aided Speech Therapy for DLD
6 pages
Speech Recognition System in MATLAB
No ratings yet
Speech Recognition System in MATLAB
16 pages
ASL Gesture Recognition Application
100% (1)
ASL Gesture Recognition Application
3 pages
Subvocal EMG Recognition Post-Laryngectomy
No ratings yet
Subvocal EMG Recognition Post-Laryngectomy
13 pages
Sign-to-Speech Software Specification
No ratings yet
Sign-to-Speech Software Specification
13 pages
Real-Time Sign Language Converter
No ratings yet
Real-Time Sign Language Converter
4 pages
AI Speech Recognition Overview and Applications
No ratings yet
AI Speech Recognition Overview and Applications
14 pages
The EMG-UKA Corpus For Electromyographic Speech Processing
No ratings yet
The EMG-UKA Corpus For Electromyographic Speech Processing
6 pages
Sign Language to Text Converter SRS
100% (2)
Sign Language to Text Converter SRS
19 pages
Speech To Image Conversion: Shaik Karishma, Siddu Devi Naga Susmitha, Nanditha Katari, G. Sirisha
No ratings yet
Speech To Image Conversion: Shaik Karishma, Siddu Devi Naga Susmitha, Nanditha Katari, G. Sirisha
5 pages
Kannada ASR for Aphasia Therapy
No ratings yet
Kannada ASR for Aphasia Therapy
4 pages
Speech Recognition Project Report 2019-20
No ratings yet
Speech Recognition Project Report 2019-20
40 pages
Batch9 Project Report April 2 ChangesNeeded
No ratings yet
Batch9 Project Report April 2 ChangesNeeded
95 pages
Silent Speech Interface Overview
100% (2)
Silent Speech Interface Overview
15 pages
Enhancing Virtual Personal Assistants
No ratings yet
Enhancing Virtual Personal Assistants
4 pages
Speech Recognition and Correction Project
100% (1)
Speech Recognition and Correction Project
27 pages
Silent Sound Technology Overview
100% (1)
Silent Sound Technology Overview
4 pages
Internship Progress on Speech Recognition
No ratings yet
Internship Progress on Speech Recognition
4 pages
ASL Sign Language Detection App Overview
No ratings yet
ASL Sign Language Detection App Overview
13 pages
Sign Language Recognition App Using CNN
No ratings yet
Sign Language Recognition App Using CNN
26 pages
ML-Based Real-Time ISL Interpreter
No ratings yet
ML-Based Real-Time ISL Interpreter
48 pages
Sign Language Recognition Using CNN
No ratings yet
Sign Language Recognition Using CNN
79 pages
Voice-Controlled C Programming Editor
No ratings yet
Voice-Controlled C Programming Editor
2 pages
Block Diagram of Speech Recognition System
No ratings yet
Block Diagram of Speech Recognition System
5 pages
Sign Language Recognition Project Report
No ratings yet
Sign Language Recognition Project Report
35 pages
Deep Learning for Speech-Impaired Kids
No ratings yet
Deep Learning for Speech-Impaired Kids
10 pages
Assistive Device For The Translation From Mexican
No ratings yet
Assistive Device For The Translation From Mexican
14 pages
Silent Speech
No ratings yet
Silent Speech
10 pages
Speech Therapy App for Aphasia Patients
No ratings yet
Speech Therapy App for Aphasia Patients
7 pages
Digital Voicing of Silent Speech Using EMG
No ratings yet
Digital Voicing of Silent Speech Using EMG
10 pages
Real-Time Sign Language Detection Project
No ratings yet
Real-Time Sign Language Detection Project
12 pages
Overview of Speech Recognition Systems
100% (2)
Overview of Speech Recognition Systems
19 pages
Lip Movement Detection for Communication
No ratings yet
Lip Movement Detection for Communication
5 pages
Hand Gesture Recognition System
No ratings yet
Hand Gesture Recognition System
5 pages
Example Research Proposal2
No ratings yet
Example Research Proposal2
7 pages
Understanding Speech Recognition Technology
100% (1)
Understanding Speech Recognition Technology
20 pages
Silent Sound Technology Overview
No ratings yet
Silent Sound Technology Overview
4 pages
Speech Recognition Application Report
No ratings yet
Speech Recognition Application Report
83 pages
Review of Speech Recognition Methods
No ratings yet
Review of Speech Recognition Methods
7 pages
Sign Language Detection Project Report
No ratings yet
Sign Language Detection Project Report
51 pages
Project File
No ratings yet
Project File
18 pages
Deep Learning for Speech Recognition
No ratings yet
Deep Learning for Speech Recognition
8 pages
Silent Sound Technology Overview
No ratings yet
Silent Sound Technology Overview
19 pages
AI-Based Silent Speech Recognition System
No ratings yet
AI-Based Silent Speech Recognition System
3 pages
Google
No ratings yet
Google
24 pages
Fine-Tuning ResNet18 for CIFAR10 Accuracy
No ratings yet
Fine-Tuning ResNet18 for CIFAR10 Accuracy
9 pages
Monte Carlo Analysis in Finance
No ratings yet
Monte Carlo Analysis in Finance
52 pages
IGCSE Additional Maths Syllabus 2025-2027
No ratings yet
IGCSE Additional Maths Syllabus 2025-2027
9 pages
Engineering Mathematics III Overview
No ratings yet
Engineering Mathematics III Overview
31 pages
Data and Network Security Quiz Answers
No ratings yet
Data and Network Security Quiz Answers
4 pages
Hybrid Swarm Optimization with PSO
No ratings yet
Hybrid Swarm Optimization with PSO
5 pages
Permutations and Combinations
No ratings yet
Permutations and Combinations
17 pages
Mathematics Periodic Test Paper - XII
No ratings yet
Mathematics Periodic Test Paper - XII
10 pages
TweepFake: Detecting Deepfake Tweets
No ratings yet
TweepFake: Detecting Deepfake Tweets
19 pages
Skillovilla Data Analytics Course Overview
No ratings yet
Skillovilla Data Analytics Course Overview
40 pages
Wavelet Detection in Biosignal Processing
No ratings yet
Wavelet Detection in Biosignal Processing
26 pages
NYU Calculus 3 Problem Set 3 Solutions
No ratings yet
NYU Calculus 3 Problem Set 3 Solutions
3 pages
Simulated Annealing in Optimization
No ratings yet
Simulated Annealing in Optimization
16 pages
Efficient Concurrent Filters for Economic Trends
No ratings yet
Efficient Concurrent Filters for Economic Trends
83 pages
Understanding Quantile Regression Methods
No ratings yet
Understanding Quantile Regression Methods
16 pages
Graph-Based SLAM Overview
No ratings yet
Graph-Based SLAM Overview
13 pages
Understanding Queue Types and Operations
No ratings yet
Understanding Queue Types and Operations
13 pages
A Computer Vision-Based Automatic System For Egg G
No ratings yet
A Computer Vision-Based Automatic System For Egg G
19 pages
R's Impact on Insurance Data Analytics
No ratings yet
R's Impact on Insurance Data Analytics
8 pages
Advances in IoT and Security With Computational Intelligence
No ratings yet
Advances in IoT and Security With Computational Intelligence
411 pages
ML Model for Depression & Anxiety Prediction
No ratings yet
ML Model for Depression & Anxiety Prediction
9 pages
Multilevel Monte Carlo Method Overview
No ratings yet
Multilevel Monte Carlo Method Overview
27 pages
Non-Parametric News Impact Curve Model
No ratings yet
Non-Parametric News Impact Curve Model
45 pages
Triple Booster Indicator Script
No ratings yet
Triple Booster Indicator Script
2 pages
PRAM Algorithms for Parallel Computing
No ratings yet
PRAM Algorithms for Parallel Computing
15 pages
Graph Theory for Image Clustering
No ratings yet
Graph Theory for Image Clustering
5 pages
Combatting Fake News with ML Techniques
No ratings yet
Combatting Fake News with ML Techniques
7 pages
AI Project Cycle and Ethical Frameworks
No ratings yet
AI Project Cycle and Ethical Frameworks
10 pages
Reinforcement Learning for Helicopter Aerobatics
No ratings yet
Reinforcement Learning for Helicopter Aerobatics
8 pages
Data Analytics Techniques Overview
No ratings yet
Data Analytics Techniques Overview
16 pages

Machine Learning Speech Aid Project

Uploaded by

Machine Learning Speech Aid Project

Uploaded by

VISVESVARAYA TECHNOLOGICAL UNIVERSITY

BELAGAVI, KARNATAKA – 590018

02 A wearable graphene strain Dafydd Ravenscroft, Project aimed at designing a wearable

 The dataset contains EMG signals for the letters A, E, I,

Linear Discriminant Linear Line

Quadratic Discriminant Quadratic Line

Native Bayes Independent Predictors

Tree Split into large sets

K Neighbors Closest data point

Ensemble Combining models

Support vector Machine Line with boundary

Silent Speech EMG Split Dataset to Train

Building a hardware Detect the signals Testing Sensor for

Using and testing Part Pattern recognition of

ML Model Train Test

Building WEB interface

Building WEB interface

You might also like