B.S.A.
COLLEGE OF ENGINEERING AND TECHNOLOGY, MATHURA
Department of Computer Science & Engineering
Mini Project Report on
Simple Voice Assistant Using Python
Submitted By:
Vimal Agrawal (Roll No. 2400650100134)
Submitted By:
Saurabh Kumar (Roll No. 2400650100114)
[Link] (CSE), 2nd Year
Session: 2025–26
Faculty Guide: Mrs. Sonal Yadav
DECLARATION
I hereby declare that the mini project entitled “Simple Voice Assistant Using Python”
submitted to B.S.A. College of Engineering and Technology, Mathura, is my original work
carried out under the guidance of Mrs. Sonal Yadav. This report has not been submitted
earlier for the award of any degree or diploma in any institution.
Date: 21-11-2025 Signature of Student: Vimal Agrawal
Saurabh Kumar
CERTIFICATE
This is to certify that the mini project report titled “Simple Voice Assistant Using Python”
has been successfully completed by Vimal Agrawal, student of [Link] (Computer Science &
Engineering), 2nd Year, during the academic session 2025–26 under my supervision.
The work embodied in this report is original and fulfils the requirements for the Mini
Project as prescribed by B.S.A. College of Engineering and Technology, Mathura.
Date: 21-11-2025 (Signature of Faculty Guide) ________________
Name of Faculty Guide: Mrs. Sonal Yadav
ACKNOWLEDGEMENT
I would like to express my sincere gratitude to Mrs. Sonal Yadav, my project guide, for their
constant support, guidance and encouragement throughout the development of this mini
project, “Simple Voice Assistant Using Python”.
I am also thankful to the Department of Computer Science & Engineering, B.S.A. College of
Engineering and Technology, Mathura, for providing the necessary facilities, lab
environment and resources. I extend my thanks to my friends and classmates for their
valuable suggestions and feedback, which helped me improve this project.
ABSTRACT
This mini project presents the design and implementation of a simple voice assistant using
Python. The main objective of this system is to allow a user to perform basic tasks on a
computer using voice commands instead of manual input. The assistant can listen through
the microphone, convert speech to text, understand simple commands, and perform actions
such as opening applications or websites (e.g., Google, Instagram), searching the web, and
responding using text-to-speech.
The project uses Python libraries such as speech_recognition for speech-to-text conversion,
pyttsx3 for text-to-speech output, webbrowser for opening URLs, and other standard
modules for integrating system operations. The overall workflow of the assistant is based
on the cycle: Listen → Recognize → Process → Respond.
This work demonstrates the practical application of basic concepts of Artificial Intelligence,
Human–Computer Interaction, and automation. It provides a hands-free, user-friendly way
to execute simple tasks and serves as a foundation for building more advanced intelligent
voice assistants in the future.
TABLE OF CONTENTS
CHAPTER 1: INTRODUCTION ........................................ 6
CHAPTER 2: OBJECTIVES .......................................... 7
CHAPTER 3: PROBLEM STATEMENT .................................. 8
CHAPTER 4: PROPOSED SOLUTION ................................ 9
CHAPTER 5: SYSTEM DESIGN OVERVIEW ........................... 10
CHAPTER 6: METHODOLOGY ...................................... 11
CHAPTER 7: IMPLEMENTATION ................................... 12
CHAPTER 8: ALGORITHM / FLOW OF WORKING ..................... 13
CHAPTER 9: TECHNOLOGIES USED ............................... 14
CHAPTER 10: TESTING & DEBUGGING ............................ 15
CHAPTER 11: RESULTS & OUTPUT ............................... 16
CHAPTER 12: SCREENSHOTS & INTERFACE DESCRIPTION ............ 17
CHAPTER 13: FUTURE IMPROVEMENTS ............................ 18
CHAPTER 14: REAL WORLD APPLICATIONS ........................ 19
CHAPTER 15: CONCLUSION ..................................... 20
CHAPTER 16: REFERENCES ..................................... 21
CHAPTER 1: INTRODUCTION
1.1 Overview
A voice assistant is a software application that can understand spoken commands and
respond through voice or actions. In this mini project, a simple voice assistant is developed
using Python. The assistant listens to the user through a microphone, converts the speech
into text, processes the recognized command, and then either executes an action (like
opening Google, Instagram, or another website) or speaks a response back to the user.
This project helps understand concepts like speech recognition, text-to-speech, automation,
and event-driven programming.
CHAPTER 2: OBJECTIVES
2.1 Primary Objectives
- To design and implement a simple Python-based voice assistant.
- To allow the user to perform tasks using voice commands such as opening websites and
performing web searches.
- To integrate speech recognition and text-to-speech in a single application.
2.2 Academic Objectives
- To gain practical knowledge of Python libraries like speech_recognition, pyttsx3, and
webbrowser.
- To understand the flow of Listen → Recognize → Process → Respond.
- To improve debugging, problem solving, and modular programming skills.
CHAPTER 3: PROBLEM STATEMENT
3.1 Problem Definition
Most users interact with computers using keyboard and mouse, which can be time-
consuming for repetitive tasks. There is a need for a simple and lightweight voice-based
interface that can automate some of these tasks and make the system more convenient to
use.
The problem addressed in this project is: 'How to design a basic Python application that can
understand voice commands and perform simple system tasks automatically?'
CHAPTER 4: PROPOSED SOLUTION
4.1 Design Approach
The proposed solution is a Python script that acts as a basic voice assistant:
- Uses a microphone as input.
- speech_recognition converts spoken words into text.
- Recognized text is checked for keywords like 'open google' or 'open instagram'.
- Based on the command, appropriate actions are performed using webbrowser and
pyttsx3.
CHAPTER 5: SYSTEM DESIGN OVERVIEW
5.1 Architecture
The system is divided into three layers:
1. Input Layer (Speech Capture)
2. Processing Layer (Command Recognition & Logic)
3. Output Layer (Action + Response)
This layered design makes the system easy to understand and maintain.
CHAPTER 6: METHODOLOGY
6.1 Development Method
- Start with a basic Python script to take input from the microphone.
- Add speech recognition and test recognized text.
- Implement command processing to detect keywords.
- Integrate text-to-speech for spoken responses.
- Test commands and refine accuracy.
CHAPTER 7: IMPLEMENTATION
7.1 Implementation Details
- Use speech_recognition.Recognizer() to recognize audio.
- Use [Link]() for offline text-to-speech.
- Implement take_command() and process_command() functions.
- Use webbrowser to open websites based on commands.
- Run the assistant in a loop to continuously listen and respond.
CHAPTER 8: ALGORITHM / FLOW OF WORKING
Algorithm: Simple Voice Assistant (Python)
1. Start
2. Initialize text-to-speech and recognizer.
3. Speak a welcome message.
4. Loop:
a. Listen to user.
b. Convert speech to text.
c. Process recognized text and perform action.
d. Speak response.
5. End.
CHAPTER 9: TECHNOLOGIES USED
Technologies Used:
- Programming Language: Python
- Libraries: speech_recognition, pyttsx3, webbrowser, os, requests (optional)
- Hardware: Computer/Laptop, Microphone/Headset
- Software: Python 3.x, code editor, required Python libraries.
CHAPTER 10: TESTING & DEBUGGING
Testing & Debugging:
- Tested commands like 'open google', 'open instagram'.
- Used print statements for debugging recognized text.
- Handled exceptions for microphone errors and unclear speech.
CHAPTER 11: RESULTS & OUTPUT
Results & Output:
- The assistant successfully listens, recognizes, and executes basic commands.
- Opens websites and speaks confirmations.
- Provides a hands-free way to perform simple tasks.
CHAPTER 12: SCREENSHOTS & INTERFACE DESCRIPTION
Screenshots & Interface Description:
- Screenshot of code editor with program.
- Screenshot of terminal/console showing recognized commands.
- Screenshot of browser window opened by the assistant.
CHAPTER 13: FUTURE IMPROVEMENTS
Future Improvements:
- Add more commands like time, weather, music, etc.
- Integrate GUI for better user interaction.
- Use advanced NLP and APIs for smarter responses.
CHAPTER 14: REAL WORLD APPLICATIONS
Real World Applications:
- Personal desktop assistant.
- Assistance for users with physical disabilities.
- Base for home automation and advanced AI assistants.
CHAPTER 15: CONCLUSION
Conclusion:
This mini project demonstrates the creation of a simple voice assistant using Python. It
combines speech recognition and text-to-speech to perform practical tasks and helps in
understanding automation, event-driven programming, and modular code design.
CHAPTER 16: REFERENCES
References:
1. Python Documentation — [Link]
2. SpeechRecognition Documentation
3. pyttsx3 Documentation
4. GeeksforGeeks — Python speech recognition and TTS tutorials
5. Various YouTube tutorials and Stack Overflow discussions.
THANK YOU!