0% found this document useful (0 votes)

12 views26 pages

PyDub: Audio Processing in Python

Uploaded by

pritamgamer1122

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views26 pages

PyDub: Audio Processing in Python

Uploaded by

pritamgamer1122

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Introduction to

PyDub
SPOKEN LANGUAGE PROCESSING IN PYTHON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Installing PyDub
$ pip install pydub

If using files other than .wav , install ffmpeg via [Link]

SPOKEN LANGUAGE PROCESSING IN PYTHON

PyDub's main class, AudioSegment
# Import PyDub main class
from pydub import AudioSegment

# Import an audio file

wav_file = AudioSegment.from_file(file="wav_file.wav", format="wav")

# Format parameter only for readability

wav_file = AudioSegment.from_file(file="wav_file.wav")

type(wav_file)

pydub.audio_segment.AudioSegment

SPOKEN LANGUAGE PROCESSING IN PYTHON

Playing an audio file
# Install simpleaudio for wav playback
$pip install simpleaudio

# Import play function

from [Link] import play

# Import audio file

wav_file = AudioSegment.from_file(file="wav_file.wav")

# Play audio file

play(wav_file)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Audio parameters
# Import audio files
wav_file = AudioSegment.from_file(file="wav_file.wav")
two_speakers = AudioSegment.from_file(file="two_speakers.wav")
# Check number of channels
wav_file.channels, two_speakers.channels

1, 2

wav_file.frame_rate

480000

SPOKEN LANGUAGE PROCESSING IN PYTHON

Audio parameters
# Find the number of bytes per sample
wav_file.sample_width

# Find the max amplitude

wav_file.max

8488

SPOKEN LANGUAGE PROCESSING IN PYTHON

Audio parameters
# Duration of audio file in milliseconds
len(wav_file)

3284

SPOKEN LANGUAGE PROCESSING IN PYTHON

Changing audio parameters
# Change ATTRIBUTENAME of AudioSegment to x
changeed_audio_segment = audio_segment.set_ATTRIBUTENAME(x)

# Change sample width to 1

wav_file_width_1 = wav_file.sample_width(1)
wav_file_width_1.sample_width

SPOKEN LANGUAGE PROCESSING IN PYTHON

Changing audio parameters
# Change sample rate
wav_file_16k = wav_file.frame_rate(16000)
wav_file_16k.frame_rate

16000

# Change number of channels

wav_file_1_channel = wav_file.set_channels(1)
wav_file_1_channel.channels

SPOKEN LANGUAGE PROCESSING IN PYTHON

Let's practice!
SPOKEN LANGUAGE PROCESSING IN PYTHON
Manipulating audio
files with PyDub
SPOKEN LANGUAGE PROCESSING IN PYTHON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Turning it down to 11
# Import audio file
wav_file = AudioSegment.from_file("wav_file.wav")
# Minus 60 dB
quiet_wav_file = wav_file - 60

# Try to recognize quiet audio

recognizer.recognize_google(quiet_wav_file)

UnknownValueError:

SPOKEN LANGUAGE PROCESSING IN PYTHON

Increasing the volume
# Increase the volume by 10 dB
louder_wav_file = wav_file + 10

# Try to recognize
recognizer.recognize_google(louder_wav_file)

this is a wav file

SPOKEN LANGUAGE PROCESSING IN PYTHON

This all sounds the same
# Import AudioSegment and normalize
from pydub import AudioSegment
from [Link] import normalize
from [Link] import play

# Import uneven sound audio file

loud_quiet = AudioSegment.from_file("loud_quiet.wav")
# Normalize the sound levels
normalized_loud_quiet = normalize(loud_quiet)

# Check the sound

play(normalized_loud_quiet)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Remixing your audio files
# Import audio with static at start
static_at_start = AudioSegment.from_file("static_at_start.wav")

# Remove the static via slicing

no_static_at_start = static_at_start[5000:]

# Check the new sound

play(no_static_at_start)

SPOKEN LANGUAGE PROCESSING IN PYTHON

Remixing your audio files
# Import two audio files
wav_file_1 = AudioSegment.from_file("wav_file_1.wav")
wav_file_2 = AudioSegment.from_file("wav_file_2.wav")

# Combine the two audio files

wav_file_3 = wav_file_1 + wav_file_2

# Check the sound

play(wav_file_3)

# Combine two wav files and make the combination louder

louder_wav_file_3 = wav_file_1 + wav_file_2 + 10

SPOKEN LANGUAGE PROCESSING IN PYTHON

Splitting your audio
# Import phone call audio
phone_call = AudioSegment.from_file("phone_call.wav")
# Find number of channels
phone_call.channels

# Split stereo to mono

phone_call_channels = phone_call.split_to_mono()
phone_call_channels

[<pydub.audio_segment.AudioSegment, <pydub.audio_segment.AudioSegment>]

SPOKEN LANGUAGE PROCESSING IN PYTHON

Splitting your audio
# Find number of channels of first list item
phone_call_channels[0].channels

# Recognize the first channel

recognizer.recognize_google(phone_call_channel_1)

the pydub library is really useful

SPOKEN LANGUAGE PROCESSING IN PYTHON

Let's code!
SPOKEN LANGUAGE PROCESSING IN PYTHON
Converting and
saving audio files
with PyDub
SPOKEN LANGUAGE PROCESSING IN PYTHON

Daniel Bourke
Machine Learning Engineer/YouTube
Creator
Exporting audio files
from pydub import AudioSegment

# Import audio file

wav_file = AudioSegment.from_file("wav_file.wav")
# Increase by 10 decibels
louder_wav_file = wav_file + 10
# Export louder audio file
louder_wav_file.export(out_f="louder_wav_file.wav", format="wav")

<_io.BufferedRandom name='louder_wav_file.wav'>

SPOKEN LANGUAGE PROCESSING IN PYTHON

Reformatting and exporting multiple audio files
def make_wav(wrong_folder_path, right_folder_path):
# Loop through wrongly formatted files
for file in [Link](wrong_folder_path):
# Only work with files with audio extensions we're fixing
if [Link](".mp3") or [Link](".flac"):
# Create the new .wav filename
out_file = right_folder_path + [Link]([Link]([Link]))[0] + ".wav"
# Read in the audio file and export it in wav format
AudioSegment.from_file([Link]).export(out_file,
format="wav")
print(f"Creating {out_file}")

SPOKEN LANGUAGE PROCESSING IN PYTHON

Reformatting and exporting multiple audio files
# Call our new function
make_wav("data/wrong_formats/", "data/right_format/")

Creating data/right_types/wav_file.wav
Creating data/right_types/flac_file.wav
Creating data/right_types/mp3_file.wav

SPOKEN LANGUAGE PROCESSING IN PYTHON

Manipulating and exporting
def make_no_static_louder(static_quiet, louder_no_static):
# Loop through files with static and quiet (already in wav format)
for file in [Link](static_quiet_folder_path):
# Create new file path
out_file = louder_no_static + [Link]([Link]([Link]))[0] + ".wav"
# Read the audio file
audio_file = AudioSegment.from_file([Link])
# Remove first three seconds and add 10 decibels and export
audio_file = (audio_file[3100:] + 10).export(out_file, format="wav")

print(f"Creating {out_file}")

SPOKEN LANGUAGE PROCESSING IN PYTHON

Manipulating and exporting
# Remove static and make louder
make_no_static_louder("data/static_quiet/", "data/louder_no_static/")

Creating data/louder_no_static/[Link]
Creating data/louder_no_static/[Link]
Creating data/louder_no_static/[Link]

SPOKEN LANGUAGE PROCESSING IN PYTHON

Your turn!
SPOKEN LANGUAGE PROCESSING IN PYTHON

PyDub: Audio Processing in Python
No ratings yet
PyDub: Audio Processing in Python
26 pages
PyDub Audio Processing in Python
No ratings yet
PyDub Audio Processing in Python
26 pages
Audio Processing in Python Basics
No ratings yet
Audio Processing in Python Basics
17 pages
Audio Processing in Python
No ratings yet
Audio Processing in Python
17 pages
Audio Processing with Python Basics
No ratings yet
Audio Processing with Python Basics
9 pages
Python Speech Recognition Guide
No ratings yet
Python Speech Recognition Guide
5 pages
Speech Recognition with Python Library
No ratings yet
Speech Recognition with Python Library
23 pages
Pydub: Audio Processing in Python
No ratings yet
Pydub: Audio Processing in Python
4 pages
Python Functions for Audio Transcription
No ratings yet
Python Functions for Audio Transcription
46 pages
Python Voice Converter Project Overview
No ratings yet
Python Voice Converter Project Overview
18 pages
Voice Assistant Project in Python
No ratings yet
Voice Assistant Project in Python
48 pages
Python Pyglet and Audio Handling Guide
No ratings yet
Python Pyglet and Audio Handling Guide
68 pages
Voice Assistant Development Report
No ratings yet
Voice Assistant Development Report
4 pages
Python Wave Library Sound Processing
No ratings yet
Python Wave Library Sound Processing
24 pages
Build JARVIS with Python Tutorial
No ratings yet
Build JARVIS with Python Tutorial
28 pages
How To Make Voice Bot
No ratings yet
How To Make Voice Bot
3 pages
How To Make Voice Bot
No ratings yet
How To Make Voice Bot
3 pages
Video Language Translator with Python
No ratings yet
Video Language Translator with Python
6 pages
Voice Assistant Program in Python
No ratings yet
Voice Assistant Program in Python
3 pages
Python Alexa Voice Assistant Code
No ratings yet
Python Alexa Voice Assistant Code
2 pages
Video Dubbing with Translation Tools
No ratings yet
Video Dubbing with Translation Tools
4 pages
3-Week Audio Processing Roadmap
No ratings yet
3-Week Audio Processing Roadmap
2 pages
CTranslate2 Device Count in Python
No ratings yet
CTranslate2 Device Count in Python
28 pages
Python-Based Desktop Voice Assistant
No ratings yet
Python-Based Desktop Voice Assistant
15 pages
Audio File Handling in Python with FFMPEG
No ratings yet
Audio File Handling in Python with FFMPEG
3 pages
Integrating Speex in Python Guide
No ratings yet
Integrating Speex in Python Guide
8 pages
Video Dubbing with Translation Tools
No ratings yet
Video Dubbing with Translation Tools
4 pages
Python Web and Program Automation Guide
No ratings yet
Python Web and Program Automation Guide
15 pages
Digital Signal Processing in Music
No ratings yet
Digital Signal Processing in Music
20 pages
Python Text-to-Speech Implementation
No ratings yet
Python Text-to-Speech Implementation
18 pages
Sound File Processing with Python Wave
No ratings yet
Sound File Processing with Python Wave
9 pages
Python Virtual Assistant Project Report
No ratings yet
Python Virtual Assistant Project Report
8 pages
3-Week Audio Project Roadmap
No ratings yet
3-Week Audio Project Roadmap
5 pages
Voice Command Assistant with Jokes
No ratings yet
Voice Command Assistant with Jokes
2 pages
Pygame Sound Generation Techniques
No ratings yet
Pygame Sound Generation Techniques
7 pages
Voice Recognition Using Python
No ratings yet
Voice Recognition Using Python
24 pages
Python Text to Speech with gTTS
No ratings yet
Python Text to Speech with gTTS
3 pages
Audio-Text Conversion with Python
No ratings yet
Audio-Text Conversion with Python
6 pages
Python Voice Assistant Development Guide
No ratings yet
Python Voice Assistant Development Guide
44 pages
Voice Language Translator Complete Guide-1
No ratings yet
Voice Language Translator Complete Guide-1
17 pages
Voice Search with Python: Techniques & Tools
No ratings yet
Voice Search with Python: Techniques & Tools
11 pages
Speech Recognition Web Application Project
No ratings yet
Speech Recognition Web Application Project
17 pages
NLTK Practical Applications Guide
No ratings yet
NLTK Practical Applications Guide
67 pages
Voice To Code Compiler PBL Extended
No ratings yet
Voice To Code Compiler PBL Extended
4 pages
Python Voice Assistant Project Documentation
No ratings yet
Python Voice Assistant Project Documentation
11 pages
Sound Manipulation Homework Guide
No ratings yet
Sound Manipulation Homework Guide
6 pages
Sound Processing with Python Wave Library
No ratings yet
Sound Processing with Python Wave Library
22 pages
Speech Recognition with Python Guide
No ratings yet
Speech Recognition with Python Guide
7 pages
Understanding Mel Spectrograms
No ratings yet
Understanding Mel Spectrograms
16 pages
Voice Assistant for Weather and Tasks
No ratings yet
Voice Assistant for Weather and Tasks
2 pages
Elective 2
No ratings yet
Elective 2
22 pages
Elective 2 Merged
No ratings yet
Elective 2 Merged
23 pages
SSML in Text-to-Speech Creation
No ratings yet
SSML in Text-to-Speech Creation
8 pages
PR Ai
No ratings yet
PR Ai
10 pages
Speech-to-Text Project Overview
No ratings yet
Speech-to-Text Project Overview
7 pages
Operaciones y Manipulación de Señales
No ratings yet
Operaciones y Manipulación de Señales
8 pages
Jack Augustin
No ratings yet
Jack Augustin
10 pages
Speech Command Recognition in Python
No ratings yet
Speech Command Recognition in Python
4 pages
Python Voice Recorder Project Report
No ratings yet
Python Voice Recorder Project Report
5 pages
Bluetooth Control App for Android
No ratings yet
Bluetooth Control App for Android
48 pages
Understanding Electronic Kit Components
No ratings yet
Understanding Electronic Kit Components
7 pages
Automatic LED Chaser Circuit Guide
No ratings yet
Automatic LED Chaser Circuit Guide
4 pages
Temperature-Controlled Fan Circuit Guide
No ratings yet
Temperature-Controlled Fan Circuit Guide
3 pages
DIY Projects: Fan, Light, and Mixer
No ratings yet
DIY Projects: Fan, Light, and Mixer
7 pages
Bluetooth Home Automation Guide
No ratings yet
Bluetooth Home Automation Guide
7 pages
Live Speech-to-Text with Whisper Mic
No ratings yet
Live Speech-to-Text with Whisper Mic
3 pages
Smart Dustbin IoT Project Guide
No ratings yet
Smart Dustbin IoT Project Guide
3 pages
Build a Smart Dustbin with TeBoT
No ratings yet
Build a Smart Dustbin with TeBoT
3 pages
Build an Obstacle Avoider Robot
No ratings yet
Build an Obstacle Avoider Robot
2 pages
IoT Smart Irrigation System Guide
No ratings yet
IoT Smart Irrigation System Guide
8 pages
Understanding the Internet of Things (IoT)
No ratings yet
Understanding the Internet of Things (IoT)
17 pages
Introduction to Electronics Basics
No ratings yet
Introduction to Electronics Basics
12 pages
Introduction to Robotics Overview
No ratings yet
Introduction to Robotics Overview
9 pages
Learn Python: Basics and Setup Guide
No ratings yet
Learn Python: Basics and Setup Guide
15 pages
Understanding Artificial Intelligence Basics
No ratings yet
Understanding Artificial Intelligence Basics
11 pages
Mgosoft PDF Encrypt Overview
No ratings yet
Mgosoft PDF Encrypt Overview
512 pages
Musicolet Log: Ken Carson Tracks
No ratings yet
Musicolet Log: Ken Carson Tracks
3 pages
Rename .txt Files to .xtx in PHP
No ratings yet
Rename .txt Files to .xtx in PHP
4 pages
Poradnik Mechanika Samochodowego PDF
No ratings yet
Poradnik Mechanika Samochodowego PDF
215 pages
How - To.spot.a.fake - aXXo.or - FXG.release - Before.you - Download KuoottA
No ratings yet
How - To.spot.a.fake - aXXo.or - FXG.release - Before.you - Download KuoottA
2 pages
MCQs On File Handling
No ratings yet
MCQs On File Handling
4 pages
Open Document Architecture Overview
No ratings yet
Open Document Architecture Overview
1 page
ITR 1 Schema Changes for AY 2018-19
No ratings yet
ITR 1 Schema Changes for AY 2018-19
6 pages
Download Files from Server in ASP.NET
No ratings yet
Download Files from Server in ASP.NET
3 pages
MIDI File Format Overview and Structure
No ratings yet
MIDI File Format Overview and Structure
19 pages
Understanding File Management Basics
100% (1)
Understanding File Management Basics
22 pages
NSA320 Firmware Release 4.70 Notes
No ratings yet
NSA320 Firmware Release 4.70 Notes
7 pages
Comprehensive CSS Properties Guide
No ratings yet
Comprehensive CSS Properties Guide
8 pages
EDI 834 Transaction Set Overview
No ratings yet
EDI 834 Transaction Set Overview
7 pages
Twitter App Launch Log Analysis
No ratings yet
Twitter App Launch Log Analysis
7 pages
PDF Format Overview and Specifications
No ratings yet
PDF Format Overview and Specifications
1 page
IPTV Links and Updates List
No ratings yet
IPTV Links and Updates List
7 pages
Documentos de Torrent e Magnet Links
No ratings yet
Documentos de Torrent e Magnet Links
16 pages
Sample Audio Files - File Examples Download
No ratings yet
Sample Audio Files - File Examples Download
2 pages
System Log
No ratings yet
System Log
7 pages
Document
No ratings yet
Document
1 page
Web Archive File Overview
No ratings yet
Web Archive File Overview
1,705 pages
Python File Handling Basics
No ratings yet
Python File Handling Basics
84 pages
Magnet Link for Torrent Download
No ratings yet
Magnet Link for Torrent Download
1 page
Testbank Internal Auditing Assurance and Consulting Services 2nd Edition by Reding Ebook Solutions
No ratings yet
Testbank Internal Auditing Assurance and Consulting Services 2nd Edition by Reding Ebook Solutions
235 pages
Image Processing with GUI Controls
No ratings yet
Image Processing with GUI Controls
7 pages
Create PDF Files with novaPDF Printer
No ratings yet
Create PDF Files with novaPDF Printer
65 pages
Quick ePub Tutorial for Beginners
No ratings yet
Quick ePub Tutorial for Beginners
32 pages
Brevent App Log Analysis
No ratings yet
Brevent App Log Analysis
1 page
WeVideo Editing Guide and Tips
No ratings yet
WeVideo Editing Guide and Tips
7 pages

PyDub: Audio Processing in Python

Uploaded by

PyDub: Audio Processing in Python

Uploaded by

Introduction to

If using files other than .wav , install ffmpeg via [Link]

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import an audio file

# Format parameter only for readability

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import play function

# Import audio file

# Play audio file

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Find the max amplitude

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Change sample width to 1

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Change number of channels

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Try to recognize quiet audio

SPOKEN LANGUAGE PROCESSING IN PYTHON

this is a wav file

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import uneven sound audio file

# Check the sound

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Remove the static via slicing

# Check the new sound

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Combine the two audio files

# Check the sound

# Combine two wav files and make the combination louder

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Split stereo to mono

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Recognize the first channel

the pydub library is really useful

SPOKEN LANGUAGE PROCESSING IN PYTHON

# Import audio file

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON

SPOKEN LANGUAGE PROCESSING IN PYTHON

You might also like