0% found this document useful (0 votes)

3 views5 pages

Guitar Effects Estimation with DDSP

Summary of SRIP 2024

Uploaded by

puckandpaint

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views5 pages

Guitar Effects Estimation with DDSP

Summary of SRIP 2024

Uploaded by

puckandpaint

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Wittemann 1

Title:

Blind estimation of guitar AFX's using DDSP by Luke Wittemann

Abstract:

Given the ubiquity of audio effects in the creation and production of music, there exists a
necessity to efficiently estimate the effects used in order to recreate a sound or tone. While
traditional machine learning techniques have produced some promising results, a massive
amount of properly labeled data is needed and extrapolation of unseen configurations still
leaves something to be desired. By using DDSP, a developing field which integrates
differentiable modules of traditional DSP techniques into neural networks, both the complexity of
the models and the vastness of the required data can be reduced appreciably. Additionally, the
nature of DDSP is such that it can be used for a large variety of tasks ranging from classification
to timbre transfer between datasets. This study plans to explore the possibility of DDSP to
estimate an entire guitar effects chain, starting with the pickup(s) used and electronics settings,
continuing to varying numbers of cascaded effects.

Progress:

Thankfully, I got the opportunity to prepare for the summer during a quarter of ECE 199
at UCSD. Coming from an audio signal processing background, I wanted to research something
that was a fusion of existing digital signal processing(DSP) techniques with more modern
machine learning(ML) ones. I used the quarter to narrow down my area of research by reading
numerous research papers in order to survey areas where work has and has not been done.
One area I was considering during this time was effects estimation, where you have a system
capable of identifying effects such as reverb or equalization used to create a given audio
sample. It wasn’t until I came across Google’s DDSP project that I really saw a clear path
forward. DDSP is a set of differentiable digital synthesizers and effects whose parameters can
become trainable features inside a neural network. It was this ability to combine a trainable
source instrument in the form of an auto-encoder and differentiable effects that finally inspired
me to choose the estimation of an entire guitar signal chain as my area of research. This is
because, in essence, the different sounds that a guitar’s different pickups produce can be
thought of as distinct instruments in the way while playing the same note, the presence and
distribution of the harmonics are the prime factor in differentiation of the pickups. The DDSP
autoencoder takes audio and learns how to recreate it using the sum of a harmonic synthesizer
and an additive noise synthesizer. From my experience, the harmonic synth captures the
majority of the harmonic content of notes while the noise synth captures more cacophonous and
unpredictable moments such as the very start of a plucked note. By default, the autoencoder
used a loss function that minimizes spectral magnitude and log magnitude error.
Wittemann 2

Before I could start training, I needed to record some sample audio. I used the USB out
on my Boss Katana to send the audio to my computer, where it was recorded using REAPER.
The amp was left on the clean channel with the EQ and gain controls set at 12 o’clock. Though
the Fender Stratocaster I used to record the samples had five “positions”, I only recorded tracks
for the individual pickup positions(1,3,5). This is because positions 2 and 4 are simply sums of
the adjacent positions(2 is the sum 1 and 3 while 4 is the sum of 3 and 5), so these could be
created after the fact using samples from the three distinct pickup positions. For my first training
attempt, I recorded a little over 20 minutes of audio for each pickup position, trying my best to
play the same thing, melodically and dynamically, for all three pickups.

When first trying to use DDSP, I started off on Google Colab trying to run DDSP’s sample
notebooks. To my surprise, these didn’t run at all. I eventually found a work around in the form
of specifying certain python packages to be installed before running it. This worked, but now the
model training was being done without a GPU, meaning it would take days to generate each
model. After some digging, I found that this issue was an incompatibility between Google
Colab’s current version of Linux(Ubuntu 22.04) and the Nvidia CUDA toolkit, so I decided to try
training the models through a Linux WSL on my personal PC using an RTX 3080. It was during
this time that I gained a familiarity with both basic Linux commands and the underlying structure
of Python and how its package system works. After much trial and error, I finally found the
winning formula of Linux, CUDA, Python, and Python package versions to start training models.
Now, with my CUDA-enabled desktop, I was able to train models in 1-2 hours instead of the 1-2
days on Google Colab. I evaluated the amount of training steps hyper-parameter by looking at
the spectral loss, the magnitude of the noise synthesis(one of the most direct signs of overfitting
for this model), and by simply listening to the resynthesized audio. This led me to the conclusion
that, using my dataset, around 5000 training steps was optimal to minimize both loss and
overfitting.

Sample from training data(left) and resynthesis after training(right)

Wittemann 3

To evaluate the models, I used the three trained models to resynthesize a never before
seen(not in the training data) audio clip. Then, the recreation that was the most similar to the
sample would be the guess. Initially, I used a simple magnitude or log magnitude spectral error
as a means of comparison, but found this to be inconsistent. The magnitude error created a
tendency for the signal with the closest average spectral power to the original to be the guess
while the log magnitude error would be too ignorant of more minute spectral differences
between the recreations and the original sample. My first thought was to use a spectral
threshold such that the frequencies between harmonics which contained little to no information
were ignored in spectral comparisons. While initially promising using hand selected threshold
values, this ended up working poorly. This is because automating the threshold value with a
statistic such as the spectral average proved to be inconsistent at best. I also noticed there were
times in the original sample where the harmonics would die off in a way that was not being
represented by the recreations. Using a threshold based off of the original, these moments
where the models were clearly failing would be ignored. In other words, some spots where all
spectrums were near zero would be ignored as intended, but there were also areas where the
spectrums did differ and this was also being ignored unintentionally.

Next, I decided to threshold the spectral comparisons not by a magnitude, but by the
fundamental frequency confidence provided by the DDSP encoder. I did this because the start
of notes, which would largely be represented by noise synthesis, is not useful information in
comparing the models as the original and recreations matched almost exactly. The spot where
valid comparisons could be made was after the string vibration settled to a fundamental
frequency and its harmonics. Thankfully, this corresponds almost exactly to when the F0
confidence from the encoder reached a certain level(0.8-0.95 out of 1 max in practice). Again,
this seemed effective at first but had its own pitfalls. While more accurate than the initial mag
and log mag comparisons, the F0-confidence gated comparisons still had the same tendencies
to focus too much on average spectral energy or ignore smaller spectral differences. It was at
this point that I went back and recorded a second set of training and validation samples, this
time focusing more on actual guitar playing instead of just playing every note on the fretboard at
different loudnesses. While this did indeed improve the models, the pickup positions were still
just too similar for accurate differentiation using this method and the erroneous non-decaying
harmonics of the recreations were still present. Thankfully, it was at this time that I got the
opportunity to present my project at UCSD’s SRC 2024 and for my PI Tara Javidi and all of her
research students. This gave me a chance to step back a little and get some valuable feedback.
It was in the meeting with my PI that one of her students recommended using the DDSP losses
used for model training as opposed to my own.

For training, the DDSP autoencoder used a loss that is an evenly weighted linear
combination of the spectral magnitude loss and the spectral log magnitude loss. As soon as I
moved to this, my guessing accuracy started to become more consistent. Not previously
mentioned is the fact that for each recreation, both the overall pitch and loudness(high level
abstraction of mag and log mag) could be adjusted manually to tune the results. Previously, I
would have to set and sometimes adjust these manually to make the comparisons between the
recreations more valid. To prevent these adjustable parameters from interfering with the
guessing, I developed a system to only make a recreation which had the pitch aligned with the
Wittemann 4

original sample and the loudness error minimized between the recreation and the original. With
the optimized resynthesis, I tested the guessing with the numerous different losses provided by
DDSP. These losses included magnitude, log magnitude, loudness, delta time, delta frequency,
and a cumulative sum of frequencies. Using the optimized resynthesis, fourteen out of the
fifteen 45-second samples(5 for each pickup) were able to be guessed by at least one of these
metrics; with the cumulative sum of frequencies loss being the most individually accurate metric,
being correct nine out of the fifteen samples.

Pitch and loudness optimized sample resynthesis using trained DDSP instrument model. Note on mask(left),
loudness(center), and pitch/fundamental frequency(right)

Losses: L1 L2
→

Sample: Ma Lg Lo Ti Fr Cs M L Lo Ti Fr Cs

1.1 1 1 1

1.2 1 1 1 1 1 1 1

1.3 1 1 1 1 1 1 1 1 1 1 1

1.4 1 1 1 1 1 1 1 1 1

1.5 1 1 1 1 1 1

3.1 1 1 1 1 1 1

3.2 1 1

3.3 1 1 1 1

3.4 1

3.5

5.1 1 1 1 1 1 1 1 1 1

5.2 1 1 1

5.3 1 1

5.4 1 1 1 1 1 1 1

5.5 1 1 1 1 1 1 1 1 1

Total 8 6 3 7 8 9 8 6 3 5 8 8

Loss testing results using pitch and L2 loudness optimized resynth where a 1 represents a correct guess using only
that metric
Wittemann 5

While I would have loved to continue the effects estimation, the summer is over and I am
happy that I still got to make good progress on what I see as the most proprietary part of the
project. If this work were to be continued, I estimate the 16 kHz sample rate as the single
biggest current bottleneck to performance. This is because the guitar, especially with effects, is
capable of outputting frequencies right up the the limit of human perception. It is doubtless that
the 16 kHz sample rate cuts out a significant portion of the signals information in order to
integrate it more efficiently into the neural network. Another issue with recreating audio using
this method was the non-decaying harmonics, or the tendency for the harmonics inside the
recreations to not decay like they do in the original samples, sometimes not at all. Whether this
was a problem with the model or with my implementation, I was never able to elucidate. In either
case, I believe a system could be implemented to clip the non-decaying harmonics using the
loudness signal, as there is a strong correlation between the decay of the loudness and the
decay of the harmonics.

Conclusion

Though I was unable to attain my original goal of complete effects estimation, spending
my summer on this project under UCSD’s SRIP was as rewarding as it was challenging. Before
this summer, I had never used python, linux, or a professional DAW. Throughout the project, I
used python and tensorflow to train instrument models using my own CUDA-enabled desktop
through a linux WSL. For the training and testing data, I personally recorded hours of sample
audio using REAPER. To analyze the accuracy of these models, I developed a system which
normalized the loudness and pitch of the different models’ recreations such that they could be
compared spectrally to an original sample. While this method has its disadvantages, this project
has confirmed the possibility for instrument models to be integrated into effects estimation
systems in order to reduce the amounts of required training data. At this point, I would like to
thank Tara Javidi at UCSD for sponsoring my research under the SRIP and continually providing
insightful feedback.

Common questions

The main challenges in the blind estimation of guitar audio effects using DDSP include the requirement for a massive amount of properly labeled data in traditional machine learning and the lack of accurate extrapolation for unseen configurations . The document also highlights issues with non-decaying harmonics in recreations and limitations imposed by a 16 kHz sample rate, which cuts out a significant portion of the signal's information . Additionally, comparisons based on spectral magnitude or log magnitude proved inconsistent, and threshold methods were ineffective due to the focus on average spectral energy or ignorance of smaller spectral differences .

To address non-decaying harmonics in DDSP model recreations, the document suggests using DDSP losses designed for model training. A proposed system to clip the non-decaying harmonics could be implemented using the loudness signal, given the correlation between the decay of loudness and harmonics . Additionally, refining the dataset by including more realistic guitar playing improved model accuracy, although further differentiation using existing methods was challenging .

The research improved model training efficiency by overcoming initial compatibility issues between Google Colab, its Linux version, and the Nvidia CUDA toolkit. By transitioning to a Linux WSL on a personal PC with an RTX 3080, the training time was significantly reduced from 1-2 days to 1-2 hours . Additionally, advances in using Python, tensorflow, and CUDA-enabled systems facilitated faster and more efficient training of the models .

DDSP facilitates the estimation of an entire guitar effects chain by using differentiable modules that can serve as trainable features within a neural network. This approach allows the model to consider the guitar's different pickup sounds as distinct instruments, capturing specific harmonic content that is crucial for differentiating effects within a chain. The system integrates auto-encoders combining harmonic and noise synthesizers to recreate audio accurately by focusing on spectral magnitude and log magnitude errors, despite some implementation challenges .

The document indicates that a 16 kHz sample rate is a significant limitation because it cuts out a considerable portion of a guitar's signal information, especially when effects are in use. This undercuts the ability to fully capture and reproduce the range of frequencies the guitar is capable of outputting, which can detract from the accuracy of audio recreations and estimations using DDSP .

The researcher initially attempted to refine the threshold method by suggesting a spectral threshold where frequencies between harmonics with little information would be ignored. However, this approach was ineffective as automating threshold values inconsistently ignored vital differences. A refined method focused on thresholding by fundamental frequency confidence, which initially seemed promising but failed to capture minor spectral differences across recreations .

The DDSP model integrates differentiable modules of traditional DSP techniques into neural networks, thereby reducing both the complexity of models and the vastness of the required data compared to traditional machine learning methods . This integration allows DDSP to efficiently handle various tasks such as classification and timbre transfer between datasets. By using differentiable digital synthesizers and effects, DDSP reduces the need for extensive labeled data and provides better adaptability to different configurations .

The researcher evaluated the effectiveness of the models by resynthesizing audio clips not included in the training data and comparing them to original samples using spectral loss and the magnitude of noise synthesis. Issues with spectral magnitude or log magnitude consistency illuminated potential areas for improvement. Later, the evaluation included normalization of loudness and pitch to ensure fair spectral comparisons, emphasizing the minimal loss and preventing overfitting through a set optimal number of training steps (around 5000).

The research advanced through receiving valuable feedback during a presentation at UCSD’s SRC 2024. Constructive critiques from PI Tara Javidi and her research students provided insights into refining methodologies. One significant suggestion was to utilize DDSP losses for model training to address inconsistencies in audio recreation, thus enhancing the accuracy of effects estimation .

In the DDSP autoencoder, the harmonic synth captures the majority of the harmonic content of notes, which is essential for differentiating the sound of different guitar pickups. Meanwhile, the noise synth captures more cacophonous, unpredictable moments such as the very start of a plucked note. This division allows the autoencoder to accurately recreate audio by minimizing spectral magnitude and log magnitude errors .

Estimating Guitar Effect Parameters
No ratings yet
Estimating Guitar Effect Parameters
7 pages
Guitar Effects Classification with CNNs
No ratings yet
Guitar Effects Classification with CNNs
12 pages
Ddsp-Based Neural Waveform Synthesis of Polyphonic Guitar Performance From String-Wise Midi Input
No ratings yet
Ddsp-Based Neural Waveform Synthesis of Polyphonic Guitar Performance From String-Wise Midi Input
5 pages
Electric Guitar Transcription Dataset & Model
No ratings yet
Electric Guitar Transcription Dataset & Model
8 pages
HMM-Based Music Dataset Creation Method
No ratings yet
HMM-Based Music Dataset Creation Method
7 pages
Real-Time DDSP Synthesizer Plugin
No ratings yet
Real-Time DDSP Synthesizer Plugin
11 pages
MATLAB Digital Guitar Effects Guide
No ratings yet
MATLAB Digital Guitar Effects Guide
8 pages
Guitar Effects System Design Guide
50% (2)
Guitar Effects System Design Guide
11 pages
Guitar vs Bass Audio Classification Techniques
No ratings yet
Guitar vs Bass Audio Classification Techniques
18 pages
Faust Strings
No ratings yet
Faust Strings
35 pages
FPGA Design and Implementation of Electric Guitar Audio Effects - Project Report
100% (1)
FPGA Design and Implementation of Electric Guitar Audio Effects - Project Report
38 pages
LL: Listening and Learning in An Interactive Improvisation System
No ratings yet
LL: Listening and Learning in An Interactive Improvisation System
7 pages
Guitar Effect Recognition Using Neural Networks
No ratings yet
Guitar Effect Recognition Using Neural Networks
7 pages
Dattorro's Guide to Digital Reverberation
No ratings yet
Dattorro's Guide to Digital Reverberation
25 pages
Dattorro's Digital Audio Effects Guide
No ratings yet
Dattorro's Digital Audio Effects Guide
25 pages
Song 2 Vec
No ratings yet
Song 2 Vec
6 pages
3403 Midi DDSP Detailed Control of
No ratings yet
3403 Midi DDSP Detailed Control of
27 pages
SoundHack Plugins and Externals Overview
No ratings yet
SoundHack Plugins and Externals Overview
15 pages
Machine Learning for Guitar Tablature
No ratings yet
Machine Learning for Guitar Tablature
22 pages
Differentiable Modal Synthesis For Physical Modeling of Planar String Sound and Motion Simulation
No ratings yet
Differentiable Modal Synthesis For Physical Modeling of Planar String Sound and Motion Simulation
16 pages
Wavelet-Based Pitch Detection Algorithm
No ratings yet
Wavelet-Based Pitch Detection Algorithm
5 pages
Deep Learning for Guitar Amp Emulation
No ratings yet
Deep Learning for Guitar Amp Emulation
18 pages
Out (1) SECURITY 1
No ratings yet
Out (1) SECURITY 1
56 pages
Multi-Instrument Music Synthesis With Spectrogram Diffusion
No ratings yet
Multi-Instrument Music Synthesis With Spectrogram Diffusion
12 pages
2020 Christian Steinmetz
No ratings yet
2020 Christian Steinmetz
103 pages
Nantes Universit E, Ecole Centrale Nantes, CNRS, LS2N, UMR 6004, F-44000 Nantes, France
No ratings yet
Nantes Universit E, Ecole Centrale Nantes, CNRS, LS2N, UMR 6004, F-44000 Nantes, France
5 pages
Deep Learning for Sound Parameter Estimation
No ratings yet
Deep Learning for Sound Parameter Estimation
6 pages
Gorlow 16a
No ratings yet
Gorlow 16a
14 pages
Instrument Timbre Transformation Thesis
No ratings yet
Instrument Timbre Transformation Thesis
63 pages
Deep Learning for Intelligent Audio Mixing
No ratings yet
Deep Learning for Intelligent Audio Mixing
4 pages
DDSP Piano JAES Final
No ratings yet
DDSP Piano JAES Final
15 pages
Mit PDF
No ratings yet
Mit PDF
81 pages
Deep Learning for Music Instrument Classification
No ratings yet
Deep Learning for Music Instrument Classification
6 pages
Guitar Playing Mode Classification System
No ratings yet
Guitar Playing Mode Classification System
14 pages
Music Metadata Inference Algorithms
No ratings yet
Music Metadata Inference Algorithms
6 pages
Digital Guitar Multi-Effects Processor Using Dual Microcontr
No ratings yet
Digital Guitar Multi-Effects Processor Using Dual Microcontr
8 pages
Digital Echo Generator Design in DSP
No ratings yet
Digital Echo Generator Design in DSP
1 page
Classical Guitar Synthesizer in SuperCollider
No ratings yet
Classical Guitar Synthesizer in SuperCollider
4 pages
AI Deep Learning Guitar Plug-in Study
No ratings yet
AI Deep Learning Guitar Plug-in Study
10 pages
AI-Assisted Music Tab Generation
No ratings yet
AI-Assisted Music Tab Generation
15 pages
Digital Echo Generator Design Guide
No ratings yet
Digital Echo Generator Design Guide
1 page
AI Music Generation for Modern Media
No ratings yet
AI Music Generation for Modern Media
102 pages
AI Guitar Plug-in Study: Audio Preferences
No ratings yet
AI Guitar Plug-in Study: Audio Preferences
12 pages
Musical Instrument Identification Using Deep Learning Approach - 70075
No ratings yet
Musical Instrument Identification Using Deep Learning Approach - 70075
18 pages
Neural Network for Audio-Score Alignment
No ratings yet
Neural Network for Audio-Score Alignment
4 pages
Nonlinear Distortion Restoration in Audio
No ratings yet
Nonlinear Distortion Restoration in Audio
13 pages
Automatic Guitar Tablature Generation
No ratings yet
Automatic Guitar Tablature Generation
6 pages
The Augmented Tonoscope Explained
100% (1)
The Augmented Tonoscope Explained
175 pages
Deep Learning for Music Source Separation
No ratings yet
Deep Learning for Music Source Separation
6 pages
Novel Method for Chord Recognition
No ratings yet
Novel Method for Chord Recognition
9 pages
Automated Synthesis of Musical Instruments
No ratings yet
Automated Synthesis of Musical Instruments
20 pages
Dynamics and Meter in Luzon Folk Songs
100% (1)
Dynamics and Meter in Luzon Folk Songs
13 pages
Brejeiro by Ernesto Nazareth
No ratings yet
Brejeiro by Ernesto Nazareth
5 pages
The Jazz Discharge Party Hat - Full Score
100% (2)
The Jazz Discharge Party Hat - Full Score
7 pages
Klezmer Violin Techniques Explained
100% (5)
Klezmer Violin Techniques Explained
3 pages
Sabor a Mi Guitar Tabs and Chords
No ratings yet
Sabor a Mi Guitar Tabs and Chords
5 pages
Fretboard Memorization Worksheets
No ratings yet
Fretboard Memorization Worksheets
4 pages
Sikuti Dance and Luhya Ornaments in Culture
No ratings yet
Sikuti Dance and Luhya Ornaments in Culture
23 pages
Genio Essence User Manual
No ratings yet
Genio Essence User Manual
20 pages
Tamil Guitar Chords for Beginners
No ratings yet
Tamil Guitar Chords for Beginners
5 pages
Gajendra Verma's Saajna Re Chords
No ratings yet
Gajendra Verma's Saajna Re Chords
4 pages
Easy Piano Sheet: Last Christmas
No ratings yet
Easy Piano Sheet: Last Christmas
1 page
James Horner Score The Darkside of The Moon - Apollo 13
No ratings yet
James Horner Score The Darkside of The Moon - Apollo 13
18 pages
William F. Seefeldt: Musical Legacy
No ratings yet
William F. Seefeldt: Musical Legacy
2 pages
East Asian Music Lesson Plans
No ratings yet
East Asian Music Lesson Plans
3 pages
Side-Slipping Chords 1
No ratings yet
Side-Slipping Chords 1
1 page
S90XS Voice List Overview
No ratings yet
S90XS Voice List Overview
27 pages
German Bow Evolution and Makers
No ratings yet
German Bow Evolution and Makers
14 pages
Kishori Amonkar: Legacy of a Vocalist
No ratings yet
Kishori Amonkar: Legacy of a Vocalist
7 pages
MT10 - Paper 1 - TE
No ratings yet
MT10 - Paper 1 - TE
22 pages
Alvin Lucier's Sound and Space Exploration
No ratings yet
Alvin Lucier's Sound and Space Exploration
6 pages
Marty McFly's Unexpected Call
No ratings yet
Marty McFly's Unexpected Call
250 pages
Acoustic Analysis of the Peruvian Quena
No ratings yet
Acoustic Analysis of the Peruvian Quena
9 pages
Clear Brook HS Audition Schedule
No ratings yet
Clear Brook HS Audition Schedule
2 pages
Beginner's Guide to Playing Pandeiro
No ratings yet
Beginner's Guide to Playing Pandeiro
24 pages
Understanding Bambuco: Origins and Dance
No ratings yet
Understanding Bambuco: Origins and Dance
10 pages
Wind of Change: Hope and Unity
No ratings yet
Wind of Change: Hope and Unity
2 pages
Preview: Celtic Alleluia: Sending Forth
No ratings yet
Preview: Celtic Alleluia: Sending Forth
19 pages
Senior Recital: A Musical Journey
No ratings yet
Senior Recital: A Musical Journey
100 pages
House of The Rising Sun Guitar Tabs and Chords
No ratings yet
House of The Rising Sun Guitar Tabs and Chords
5 pages
Mahler S Symphony No. 9
No ratings yet
Mahler S Symphony No. 9
17 pages

Guitar Effects Estimation with DDSP

Uploaded by

Guitar Effects Estimation with DDSP

Uploaded by

Wittemann 1

Blind estimation of guitar AFX's using DDSP by Luke Wittemann

Sample from training data(left) and resynthesis after training(right)

Common questions

What are the main challenges faced in the blind estimation of guitar audio effects using DDSP according to the document?

What solutions or methods were explored to address the problem of non-decaying harmonics in DDSP model recreations?

What were the main technological advancements used in the document's research that improved efficiency in model training?

How does the distinctive approach of DDSP facilitate the estimation of an entire guitar effects chain?

What does the document indicate about the limitations imposed by sample rate in DDSP-based audio estimation?

In what ways did the researcher attempt to refine the threshold method for spectral comparison in the document?

How does the DDSP model improve upon traditional machine learning methods for audio effects estimation?

How did the researcher evaluate the effectiveness of their guitar effects estimation models, as per the document?

What feedback and collaborative opportunities contributed to advancing the research described in the document?

What role do harmonic synth and noise synth play in the DDSP autoencoder according to the document?

You might also like