0% found this document useful (0 votes)

27 views25 pages

RF Signal Classification via Deep Learning

Uploaded by

Abhishek Raj Soni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views25 pages

RF Signal Classification via Deep Learning

Uploaded by

Abhishek Raj Soni

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Internship Project Report and Documentation: RF Signal

Classification using Deep Learning (06 July 2020 to 21 Aug 2020)

Yoke Kai Wen

August 20, 2020

Contents
1 Intent 2
1.1 Motivation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Project organisation and main questions to answer . . . . . . . . . . . . . . . . . . . 2

2 Literature Studies 2
2.1 DeepSig and radioML datasets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3
2.2 Using CNN to extract features from I/Q time-series . . . . . . . . . . . . . . . . . . 4
2.3 Representing I/Q time-series as amplitude-phase time-series . . . . . . . . . . . . . . 6
2.4 Representing I/Q time-series as constellation images . . . . . . . . . . . . . . . . . . 7

3 General implementation methodology 8

3.1 Google Colaboratory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.2 Training parameters . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
3.3 Performance metrics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 9

4 Part 1: Reproduction of previous architectures on radioML dataset 9

4.1 Methodology . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
4.2 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 11

5 Part 2: Evaluation of models on a more realistic dataset 16

5.1 Methodology: Dataset creation using Matlab . . . . . . . . . . . . . . . . . . . . . . 16
5.2 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 20

6 Conclusion and Future work 23

6.1 Model architecture: CLDNN, ResNet . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
6.2 Input features . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
6.3 Model performance in different datasets . . . . . . . . . . . . . . . . . . . . . . . . . 24
6.4 Future work . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 24

7 References 24

8 Appendix: How to access code and data files 25

1
1 Intent
1.1 Motivation
The aim of this project is to classify RF signals by their modulation type using deep learning
models. Modulation recognition is an important component of spectrum management and an es-
sential first step towards realising the vision of cognitive radios to better ensure access to radio
frequencies required for military and civilian communications. Traditional methods of modulation
recognition involve feature extraction and design by domain experts which takes a lot of time and
effort, whereas deep learning methods have been shown to be capable of extracting features auto-
matically from raw I/Q data and surpassing the performance of traditional methods. Therefore,
in this project, I focus on using deep learning methods for modulation classification.

1.2 Project organisation and main questions to answer

My project can be split into two parts. In the first part, I work with the 2016.10A radioML dataset
[1] (described in the next section), and try to reproduce previous work on modulation classifica-
tion described in Table 1. The main questions I want to answer in this part are: What is the best
neural architecture and input feature type for the task of modulation classification, and why do
they work? In the second part, I create another RF dataset that simulates more realistic and
harsher channel conditions than the radioML dataset, and test the models that performed well
in the previous part on this new and more difficult dataset. The main questions I want to answer
here are: How well can DL models adapt to RF signals from different channel conditions, and do
the conclusions drawn from the first part apply to this dataset as well?

2 Literature Studies
Several neural architectures (employing various combinations of convolutional and recurrent lay-
ers) and RF signal data representations (I/Q time-series, amplitude-phase time-series and con-
stellation diagrams) have been proposed for the task of modulation recognition. In this section, I
highlight works that I have heavily relied on, as summarised in Table 1.

Architecture Input feature Dataset [1] Source

2016.04C, [2]
2x128 I/Q time-series
Basic CNN 2016.10A radioML [3]
2x128 amp-phase time-series 2016.10A radioML [4]
Inception 2x128 I/Q time-series 2016.10A radioML [3]
2x128 I/Q time-series 2016.10A radioML [3]
ResNet
2x1024 I/Q time-series 2018.01A radioML [5]
CLDNN 2x128 I/Q time-series 2016.10A radioML [3]
LSTM 2x128 amp-phase time-series 2016.10A radioML [6]
Grayscale constellation
unknown [7]
AlexNet image (227x227x1)
Coloured constellation
[8]
image (128x128x3), unknown
[9]
(227x227x3)

Table 1: Table summarizing neural architectures and signal feature types focused on in this
project

2
2.1 DeepSig and radioML datasets
DeepSig Inc. ([Link] and its researchers are the pioneers in the field of RF
signal processing using machine learning, having written many papers on this topic [2] [3] [5] and
published several RF datasets [1]. The three RF radioML datasets are available here: https://
[Link]/datasets. The 2016.04C and 2016.10A datasets contain 11 types of modulation
schemes ranging across SNRs from -20dB to 18dB, with each data sample being an I/Q time-
series with 128 time-steps, represented as a 2x128 array. Realistic channel imperfections such as
moderate LO drift, light multipath fading and AWGN are included in the datasets (generated
by GNU Radio) and the detailed process for dataset generation can be found in O’Shea et al.’s
paper [1]. There are 8 digital modulation classes (BPSK, QPSK, 8PSK, PAM4, QAM16, QAM64,
GFSK, CPFSK) and 3 analogue modulation classes (WBFM, AM-DSB, AM-SSB) (see Figure 1). The
work from DeepSig forms the basis for this project.

Figure 1: Constellation diagrams of 11 modulation schemes at SNR=18dB of the 2016.10A ra-

dioML dataset [1]

The 2016 radioML datasets have been used by other papers as a benchmark and therefore I also
use them in the project. I explore all three radioML datasets in more detail in my dataset -
[Link] notebook, and have found flaws in the 2016.04C and 2018.01A datasets
that make them unusable, hence only the 2016.10A radioML dataset is used in this project. The
description of each dataset is presented in Table 2.

3
2016.04C 2016.10A 2018.01A
num classes 11 24
Digital: OOK, 4ASK, 8ASK, BPSK,
QPSK, 8PSK, 16PSK, 32PSK,
Digital: BPSK, QPSK, 8PSK, 16APSK, 32APSK, 64APSK,
QAM16, QAM64, CPFSK, 128APSK, 16QAM, 32QAM,
Class types GFSK, PAM4 64QAM, 128QAM, 256QAM,
Analogue: WBFM, AM-SSB, GMSK, OQPSK
AM-DSB Analogue: AM-SSB-WC,
AM-SSB-SC, AM-DSB-WC,
AM-DSB-SC, FM
SNR range -20dB to 18dB
num samps
705 1000 >4000
per (mod, snr)
samp
2x128 2x1024
format
Problems Almost impossible to differentiate between analogue modulations
(analogue because of pauses in the voice recording of the source dataset
modulation) that analogue modulations were generated from.
Other very noisy,
- Class labels are wrong
problems not normalised

Table 2: Table summarizing descriptions of radioML datasets

2.2 Using CNN to extract features from I/Q time-series

O’Shea et al. [2] [3] [5] propose several variations of Convolutional Neural Networks for the task
of modulation classification. 1D convolutional layers have proven to be helpful for time-series
analysis in tasks such as human activity recognition and financial time-series forecasting, so it
makes sense that 1D convolutional layers would also be helpful for extracting features from the
I/Q time-series. In [2] and [3], O’Shea et al. suggest that convolutional filters could be analogous
to matched filters at the receiver which help to maximise the SNR of the received signal at spe-
cific points. I attempt to visualise the filters and feature maps of the CNN layers but have not
managed to figure out if they do actually work like matched filters. Nonetheless, the CNN archi-
tectures are quite successful in modulation classification, and below I describe four variations of
CNN from [2] [3] [5] that have been inspired by architectures applied in computer vision, such as
ResNet and Inception.

2.2.1 Basic CNN [2]

In 2016, O’Shea et al. [2] designed a basic CNN with two convolutional layers followed by two
dense layers (architecture shown in Figure 2) to prove the point that even a simple neural archi-
tecture like this outperforms traditional expert feature based methods, which consequently led to
more research in using deep learning methods for modulation classification. The choice of filter
sizes (1x3, 2x3) and filter numbers in each layer were determined through trial and error.

4
Figure 2: Architecture of basic CNN with two convolutional layers and two dense layers [2]

2.2.2 CNN based on Inception [3]

The inception module contains filters of varying sizes in each layer, allowing processing of spatial
information at various scales, which are then aggregated when the filter outputs are concatenated
together. O’Shea et al. [3] adopts the same idea, but varies the filter sizes used in the inception
module (see Figure 3). In [3], they mention that by expert knowledge they hypothesize 8-tap fil-
ters to be the most optimal, but I was still not very sure why and I was thinking it could be re-
lated to rule-of-thumbs for determining FIR filter order based on desired filter attributes such as
bandwidth, roll-off, attenuation. [3] use the 2016.10A dataset and find the Inception modules not
significantly helpful in improving classification accuracy.

Figure 3: Architecture of 1D inception module [3]

2.2.3 CNN based on ResNet [5]

Residual networks have done very well in computer vision tasks because they allow networks
to go deep without facing the problem of vanishing gradients by having skip connections be-
tween layers, which also allows features to operate at multiple scales and depths throughout the
network. This idea was adopted in [3] and [5] with the residual unit and residual stack module
shown in Figure 4, where each convolutional layer contains 1x3 filters. The former paper did not

5
manage to achieve exceptional results with ResNet modules and thought that it was because
CNNs were limited in how much they could learn from radio signals so it did not matter how
deep the network can go. A year later, the same researchers [5] produced and used the 2018.01A
radioML dataset and showed that its modified ResNet architecture outperformed previous neural
architectures. These modifications include the activation functions, dropout layers and initialisa-
tion functions, but I am not sure of the exact details.

Figure 4: Architecture of 1D Residual Stack module [5]

2.2.4 CLDNN: CNN followed by LSTM [3]

Convolutional Long short-term Deep Neural Networks (CLDNN) have been used extensively for
voice-processing research. Unlike the previous CNN architectures, CLDNN adds a recurrent unit
(LSTM) after the convolutional layers to extract temporal features. [3] also adds a skip connec-
tion after the first convolutional layer to the concatenated layer so that the LSTM can process
the waveform in a rawer state (see Figure 5).

Figure 5: Architecture of CLDNN [3]

2.3 Representing I/Q time-series as amplitude-phase time-series

Some papers [4] [6] used amplitude-phase time-series as input to their deep learning models rather
than using I/Q time-series directly, and reported better results. [4] also tried using the frequency
spectra as inputs but this performed badly so I will not discuss it here.

6
2.3.1 Basic CNN with amplitude-phase time-series as input [4]
[4] used a similar CNN architecture as in [2] (Figure 2) and tried training it with both I/Q and
amplitude-phase time-series from the 2016.10A radioML dataset. It was found that training with
amplitude-phase time series improves classification accuracy only at high SNR, while training
with I/Q data resulted in higher accuracy at low SNR, but it was not clear why this was so.

2.3.2 LSTM with amplitude-phase time-series as input [6]

Recurrent Neural Networks (RNN) are commonly used for learning persistent features in time-
series data, and the LSTM (Long Short Term Memory network) is a type of RNN that is able
to retain a longer history and thus capable of learning longer term patterns. [6] showed that a
simple network consisting of two LSTM layers and two fully connected layers (see architecture in
Figure 6), trained with amplitude-phase time-series data, was able to achieve very good classifica-
tion accuracy on the 2016.10A radioML dataset. The number of layers and number of cells were
determined experimentally.

Figure 6: Architecture of LSTM network comprising 2 LSTM layers and 2 dense layers [6]

2.4 Representing I/Q time-series as constellation images

Several papers [7] [8] [9] proposed to transform the I/Q raw data into constellation images to be
fed into a CNN image classifier. Constellation diagrams are widely used as a 2-D representation
of a modulated signal by mapping signal samples into scatter points on the complex plane. [7]
simply transforms the I/Q data into a grayscale image, while [9] and [8] propose a data conver-
sion method by mapping point density to a colours from a colour scale, with an example shown
in Figure 7. All three papers used CNN models based on variants of AlexNet for constellation
image classification. [8] compared the constellation model with time-series IQ models on two spe-
cific QAM modulations, and found that the constellation model was able to achieve 100% ac-
curacy across a range of SNRs and outperforms the IQ models significantly. [9] compared [7]’s
single-channel constellation images with its RGB constellation images and found that training
with the coloured version resulted in better accuracy. These papers did not make the RF datasets

7
they used available so it is difficult to benchmark them against other classification models, and it
was also not clear how many data points were used per image.

Figure 7: Data conversion from I/Q time-series to constellation image coloured by density, with
an example QPSK signal [9]

3 General implementation methodology

In this section, I describe in general the implementation process such as computing resources,
training parameters and performance metrics used.

3.1 Google Colaboratory

All code was written and executed on Google Colaboratory using the Keras library. Google Co-
lab provides for free a 12GB NVIDIA Tesla K80 GPU, Intel(R) Xeon(R) CPU @ 2.30GHz and
12GB RAM. The GPU was sufficient - each epoch for time-series model takes less than 30s and
for the constellation model around a minute. However, the 12GB RAM was quite limiting be-
cause loading the RF datasets (from my Google Drive) into Colab consumes a lot of memory,
especially when I convert the time-series data into constellation images, so care has to be taken
to delete variables when not needed. Also, Google Drive provides only 15GB free storage, while
each RF dataset can take up a few GB, so I had to delete and re-upload files onto Google Drive
frequently.

3.2 Training parameters

3.2.1 Dataset used
Only the 8 digital modulation classes from the 2016.10A radioML dataset [1] are used for train-
ing and evaluation, while the analogue modulations are removed because it is impossible to dis-
tinguish between them anyway due to pauses in the audio source they originated from. Although
this makes it difficult to compare my implementation’s results with those from previous papers

8
that used all 11 classes in the radioML dataset, I think the tradeoff for a better reflection of model
accuracy is worth it.

3.2.2 Train-validation-test splits

67% of the radioML dataset was randomly selected for training, 13% for validation and 20% for
testing. I initially considered doing 5-Fold validation for a fairer evaluation but it was taking
too long and consuming too much memory and so I gave up. Since I had 1000 samples for each
(modulation, SNR), after random selection and doing train-val-test splits, I have roughly 670 for
training, 130 for validation and 200 for testing for each (modulation, SNR).

3.2.3 Batch size

Batch sizes of 1024 were used for time-series data inputs and batch sizes of 64 for constellation
image data inputs due to memory constraints.

3.2.4 Training epochs

I set a maximum training epochs of 100, and activated Early Stopping with a patience of 10
epochs for time-series and 4 epochs for constellation images, meaning that training stops when
the validation loss has not improved for n consecutive epochs. Usually for time-series training,
the model overfits before 70 epochs while for constellation image training, the model overfits be-
fore 40 epochs.

3.2.5 Optimisers and other hyperparameters

In this project, I did not spend a lot of time on optimising hyperparameters, and most of the hy-
perparameters were default values in Keras except when otherwise specified in the papers. I used
the Adam optimiser for all models as it is known to be relatively easy to configure because the
default configuration parameters do well on most problems. The time-series model uses a learn-
ing rate of 10−3 while the constellation model uses a learning rate of 10−4 because 10−3 did not
work for the constellation model.

3.3 Performance metrics

I use the overall classification accuracy ( number classif
total
ied correctly
) across all modulation classes
and SNRs as the main metric. However, this is not very helpful because performance varies vastly
across the SNR range, so I also calculate three more metrics for each SNR range: low SNR (-
20dB to -12dB), medium SNR (-10dB to 4dB) and high SNR (6dB to 18dB) classification ac-
curacy.
I also plot confusion matrices at different SNRs to gain more insight into which modulation classes
usually get confused and why.

4 Part 1: Reproduction of previous architectures on radioML

dataset
Part 1 focuses on implementing promising model architectures and feature types in Table 1, com-
paring them and drawing insights into the best architecture and feature type for modulation clas-
sification. Then, I try to incorporate these insights into a combined model. I chose these models

9
because they represent a wide range of architectures and there was also enough information in
the papers for me to reproduce them.

4.1 Methodology
I created three Colab notebooks for Part 1: (1) the time-series [Link] note-
book for training different architectures (see Figure 1) on time-series features and analysing the
effects of training with amplitude-phase time-series instead of I/Q time-series ; (2) the constel-
lation [Link] notebook for experimenting with different representations of
constellation images and their effects on classification accuracy; (3) the model evaluation ra-
[Link] notebook for evaluating and comparing the best performing models, highlighting
insights and presenting a combined model that integrates these insights. All of the implementa-
tion details can be found in the notebooks, so below I only explain key things to note.

4.1.1 Conversion of I/Q data to amplitude-phase data

Note that although the I and Q components were already normalised, after obtaining the ampli-
tude and phase data from the I, Q components via the standard formula, it is still necessary to
normalise them, otherwise the model will perform poorly. The normalisation process (see code
below) follows [6].
1 import numpy as np
2 def iq2ampphase ( inphase , quad ) :
3 amplitude = np . sqrt ( np . square ( inphase ) + np . square ( quad ) )
4 amp_norm = np . linalg . norm ( amplitude ) # L2 norm
5 amplitude = amplitude / amp_norm # normalise
6 phase = np . arctan ( np . divide ( quad , inphase ) )
7 phase = 2.*( phase - np . min ( phase ) ) / np . ptp ( phase ) -1 # rescale phase to range
[ -1 , 1]
8 return amplitude , phase

4.1.2 Conversion of I/Q data to constellation images

Conversion to constellation image involves dividing the I and Q axes (in the desired region of the
complex plane) into nbins and counting the number of points that fall into each bin, and then
normalising the counts to a range of -1 to 1. A colormap from matplotlib is then applied, with
cmap=’gray’ for single channel and cmap=’hot’ for three channels. I explored constellations
with nbins = 32, 48, 64, 96 and single, triple channels.
1 xyrange = 0.02 # 0.05 for matlab data , this sets the region in complex plane to be
captured
2 counts , xedges , yedges = np . histogram2d ( inphase , quadrature , bins =b , range = [[ -
xyrange , xyrange ] , [ - xyrange , xyrange ]])
3 def arr2img ( arr , chnum ) :
4 norm = plt . Normalize ( vmin = arr . min () , vmax = arr . max () )
5 if chnum == 1:
6 cmap = plt . cm . gray
7 image4d = cmap ( norm ( arr ) ) # RGBA
8 img = image4d [: ,: ,0] # All RGBA channels identical
9 elif chnum == 3:
10 cmap = plt . cm . hot # or can choose any other colormap
11 image4d = cmap ( norm ( arr ) ) # RGBA
12 img = image4d [: ,: ,:3] # ignore A channel
13
14 return img

10
Figure 8 shows the constellation images at different resolutions and colourations. Initially, I as-
sumed that higher resolution and colour images would work the best, in accordance to [8] and [9],
especially when we have high order modulations like 64QAM. By human eye, it is clear that the
coloured ones look more distinctive, so I assumed the CNN would work similarly. Ideally, I want
to pick the lowest resolution that works accurately to save on memory consumption.

Figure 8: Examples of constellation images at different resolutions and colourations: (a) 32 bins,
grayscale; (b) 48 bins, grayscale; (c) 64 bins, grayscale; (d) 32 bins, colour; (e) 48 bins, colour; (f)
64 bins, colour

4.2 Results
In this section, I present the evaluation results of the different models and different features.
They were trained, validated and evaluated on the same train-val-test split, so all the test data
for evaluation were unseen by all the models. For the constellation models, I only trained with
higher SNR data due to memory and speed issues. I also divided the SNRs into three groups:
low (¡-10dB), medium (-10 to 5dB), high (¿5dB) for easier comparison and evaluation. Table 3
summarises the classification accuracies of all models tried on the radioML dataset at different
SNR ranges.

11
Models Classification accuray (%) at:
Architecture Input Overall High snr Med snr Low snr
IQ 55.78 81.24 60.70 13.14
Basic CNN
AP 54.92 86.67 53.90 12.92
Inception IQ 44.52 64.61 46.81 13.35
IQ 55.84 82.36 59.61 13.52
ResNet
AP 54.09 86.52 51.91 12.96
IQ 56.73 81.70 62.23 13.86
CLDNN
AP 58.98 93.60 58.67 11.92
Simple LSTM AP 55.55 87.84 54.46 12.90
32x32x1 - 90.93 51.69 -
Constellation 32x32x3 - 89.37 50.67 -
(AlexNet) 48x48x1 - 91.82 51.83 -
48x48x3 - 92.14 51.69 -

Table 3: Table showing classification accuracies of different models at three SNR ranges

From Table 3, all the models did worse or similar to random guessing at low SNR, and high SNR
classification accuracy was much higher than at medium SNR. We also see that CLDNN-AP
is the best performing at high SNR with constellation models following closely behind, while
CLDNN-IQ is the best performing at medium SNR. In the next sections, I analyse model per-
formance more closely.

4.2.1 Performance of models trained on time-series inputs

I trained five types of architectures on time-series inputs: Basic CNN, Inception, ResNet, CLDNN
and LSTM (for full model architectures, see my notebook), with each model trained separately
on IQ and amplitude-phase (AP) data, except for Inception and LSTM. Inception performed so
poorly on the raw IQ data that I did not train it with AP data; LSTM-IQ performed extremely
poorly on IQ data so I did not present its results. The classification accuracies over the entire
SNR range is shown in Figure 9.

12
Figure 9: Classification accuracy over SNR range for time-series models

A few significant observations:

1. CLDNN model trained on amplitude-phase time-series outperformed the rest of the models
significantly at high SNR. This could mean that a combination of CNN layers followed by
LSTM layers is promising for the task of modulation classification.

2. Across all models, training with AP data yielded significantly higher ( 5%) accuracy at
high SNRs.

3. Across all models, training with IQ data made them more resistant to low SNR conditions
compared to training with AP data.

To get some insight into why the AP models are performing better than the IQ models at high
SNR, I looked at the confusion matrices of cldnn-iq and cldnn-ap. It seems like AP data helps
the model to differentiate QAMs and higher order PSKs better than the raw IQ data. 8PSK and
QPSK were perfectly differentiated by cldnn-ap while there remained some confusion for differen-
tiating QAMs. This seems to indicate that amplitude-phase time-series are more distinctive fea-
tures for modulation classification, but they are more easily affected by noise conditions. These
findings about the effect of amplitude-phase features are largely consistent with [4].

13
Figure 10: Confusion matrices of (a) cldnn-iq and (b) cldnn-ap at high SNR

4.2.2 Performance of constellation models on images of different resolutions and

colourations
As expected, higher resolution results in slightly better performance, but it was surprising that
adding colour to the constellations did not improve performance significantly and even worsens
performance in some cases. Furthermore, the constellations were still not able to perfectly dif-
ferentiate the QAMs unlike what was achieved in [8] [9], and only performed slightly better than
the time-series model for QAMs differentiation. Perhaps, it was because there were only 128 data
points in each constellation which corresponds to approximately 16 symbols given a sample rate
of 8 samples per symbol in the radioML dataset, so not enough details were captured.

Figure 11: (a) Classification accuracy over SNR range for constellation models trained with dif-
ferent resolutions and colourations; (b) Confusion matrix for constellation model trained on reso-
lution 32, single colour images

14
Even after I tried training on resolution 64 and resolution 96 constellations to differentiate be-
tween QAMs, I never managed to achieve 100% accuracy and only close to 90% for the highest
SNRs.

4.2.3 Overall insights for Part 1

For the radioML dataset, it seems that the constellation feature does not help that much, and
the CLDNN-AP model is good at high SNR while the CLDNN-IQ model is better at low SNRs.
Therefore, combining the CLDNN-AP and CLDNN-IQ model seems like a good idea to get both
advantages. In fact, [8] also suggests a similar approach by first having an IQ model to separate
easier modulations and subsequently using a constellation model to differentiate more difficult
modulations (see Figure 12). In my case, I used CLDNN-IQ for the first model, and CLDNN-AP
for the second model. The resultant classification accuracy of the combined model is compared
with the other individual models in Figure 13.

Figure 12: Approach of two cascaded models, IQ followed by constellation to separate classes of
different difficulties [8]

15
Figure 13: Classification accuracy over SNR range for cldnn-iq, cldnn-ap, and constellation mod-
els

As seen in Figure 13, the combined model manages to incorporate advantages of each model and
performs well at both low and high SNRs.

5 Part 2: Evaluation of models on a more realistic dataset

Part 2 focuses on creating a dataset simulating a more realistic and difficult channel environ-
ment than the one in radioML, and testing how the best performing models and feature types
from Part 1 perform on this dataset. Ideally, I would have created datasets simulating differ-
ent types of conditions, such as foliage, urban and rain and seeing how the models respond to
each condition, but due to time constraints, I simply followed this Matlab tutorial ([Link]
[Link]/help/deeplearning/ug/[Link]) for re-
alistic channel parameters. These are not specific to any particular environment.

5.1 Methodology: Dataset creation using Matlab

I used Matlab’s communications toolbox to generate a more difficult RF dataset with the same
8 digital modulations in radioML, but this time each data sample has 2x1024 points but still
roughly the same symbol rate ( 8 samples per symbol), and with 1000 samples for each (mod,
snr) and snr ranging from -10dB to 30dB. Below, I explain the process of dataset creation, im-
portant channel parameters used and their justification.

5.1.1 Dataset generation process for M-ary modulation scheme

The general workflow for generating I/Q samples from an M-ary modulation scheme is shown
below:

1. Source msg: Generate random symbols from 0 to M-1 with a uniform distribution.

16
2. Convert to I/Q complex form using modulation functions inbuilt into Matlab’s communica-
tions toolbox.

3. Upsample to get 8 samples per symbol.

4. Filter with root-raised cosine filter with roll-off factor 0.35.

5. Apply channel effects.

6. Normalise and extract 1024 samples

5.1.2 Important parameters and channel effects

Important parameters and channels effects are shown in Table 4, and are input into Matlab’s
[Link] System object, with the exception of max clock shift which was separately
accounted for by readjusting frequency offset and sampling rate. To briefly explain the channel
impairments: Rician fading assumes a direct line-of-sight propagation path superposed together
with other reflected paths that follow Rayleigh fading processes; clock shift is from inaccuracies
of Local Oscillators (LO) resulting from heat or another conditions leading to clock inaccuracies;
Doppler shifts try to capture effects from moving emitters and receivers that cause frequency dis-
tortion.

Parameters Value Justification

Sampling freq 200kHz
Carrier freq 902MHz
3 paths with: - Path delay of E-5 s typical of outdoors RF propagation,
Rician
Delay [0, 9E-6, 1.7E-5] (s) path length difference of ∼km
multipath
Gains [0, -2, -10] (dB) - Gains arbitrary
fading
Kfactor = 4 - Kfactor 4 typical for Rician fading, 0 for Rayleigh fading
Max clock - Affects frequency offset: f offset = -fc*(clkshift/1M)
Light - 0.001, Heavy - 5
shift (ppm) - Affects sampling rate: fs new = fs*(1+clkshift/1M)
Max Doppler
4 -4Hz correspond to walking speed given f c=902MHz
shift (Hz)
AWGN (dB) -10 to 30

Table 4: Table showing important parameters and channel effects

5.1.3 Creation of easy, medium and hard datasets

I decided to create three datasets of varying levels of difficulty, with the hard dataset being the
one simulating the realistic and difficult channel conditions, while the easy and medium datasets
had more perfect channel conditions (see complete description in Table 5, and dataset visualisa-
tions can be found in my notebook). The rationale for this is that I wanted to observe how well a
model trained on one dataset would perform on a dataset that it had not train on, as this would
be the typical situation in real-life.

17
Max Max Doppler
AWGN Rician Fading
Clock Offset Shift
Hard Present 5ppm 4Hz
Medium -10 to 30dB Present 0.001ppm None
Easy None None None

Table 5: Table showing characteristics of hard, medium and easy datasets

In Figures 14, 15 and 16, I show the constellation diagrams of the 8 different digital modula-
tions at SNR=30dB for the hard, medium and easy datasets respectively. I observe that the hard
dataset is extremely noisy, with BPSK being completely unrecognisable , while the medium and
easy datasets look much more discernable. However, even the easy dataset at 30dB, which does
not undergo any channel impairments other than AWGN, looks noisier than radioML’s dataset at
18dB (see Figure 1) which was supposed to have been affected by channel effects. This suggests
the importance of a benchmark dataset as different ways of dataset creation, even for supposedly
the same SNR level, result in datasets that are very different.

Figure 14: Constellation diagrams of different modulations at SNR=30dB for Matlab hard
dataset

18
Figure 15: Constellation diagrams of different modulations at SNR=30dB for Matlab medium
dataset

Figure 16: Constellation diagrams of different modulations at SNR=30dB for Matlab easy
dataset

19
5.2 Results
5.2.1 Performance of models on hard (realistic) dataset
On the hard dataset, I tried the models that performed well on the radioML dataset, with full
details in my model evaluation [Link] notebook. Figure 17 compares the classifica-
tion accuracies of the ResNet-IQ, ResNet-AP and constellation model on the hard dataset. I
did not put in CLDNN for comparison because surprisingly, CLDNN did not perform as well as
ResNet on the hard dataset unike in the radioML dataset. Perhaps it was because the CLDNN
architecture was already tuned for data samples of 2x128 dimension and thus not able to exploit
the greater amount of information in 2x1024 data samples from the hard dataset, while ResNet,
which had many skip connections in between layers, was able to glean more patterns from the
longer input data. Perhaps combining ResNet with LSTM layers in a way similar to CLDNN
would yield even better results.

Figure 17: Classification accuracies over SNR for models on hard dataset

From Figure 17, the amplitude-phase (AP) data does not produce a big improvement in classifi-
cation accuracy unlike in the radioML dataset. I suspect it is because the hard dataset is already
quite noisy and amplitude-phase data is not resistant to noise. I expected the constellation model
to achieve better results than in the radioML dataset because the hard dataset had longer data
samples, but it only performs slightly better than ResNet at high SNR. A closer look at the con-
fusion matrices (see Figure 18) show that while the constellation model was much better than
ResNet-IQ at differentiating QAMs and PSKs, it was still nowhere close to 100% accuracy. Also,
similar to the radioML dataset, the constellation model was not as good as the time-series mod-
els in telling apart the easier modulations (BPSK, 4PAM, GFSK, CPFSK).

20
Figure 18: Confusion matrices of (a) ResNet-IQ and (b) constellation model at high SNR

Nevertheless, the constellation images and IQ time-series once again prove to be complementary
features that are able to distinguish different types of modulations, and this suggests that future
models should work on combining both features for better results.

5.2.2 Cross-evaluating models with easy, medium and hard datasets

This section was done with only a quarter of the full dataset because of memory limitations,
and the model tested was a multi-input model that combined both the resnet-iq and constella-
tion model. It essentially learns the best weight combination for both features extracted by the
resnet-iq and constellation model (Figure 19). On the hard dataset, its performance was slightly
worse than ResNet-IQ for low SNRs but it managed to outperform ResNet-IQ at high SNR,
showing that it managed to synergise the two input features to some extent.

21
Figure 19: (a) Architecture of multi-input combined model (b) Classification accuracy of com-
bined model vs individual models

A model was trained for each of the easy, medium and hard dataset, and cross-evaluated on each
other to assess how well models can adapt to another dataset. Figure 20 shows the classification
accuracy over SNR of the hard, medium and easy dataset, and unsurprisingly, the hard dataset
had the worst performance.

Figure 20: Classification accuracy over SNR on hard, medium and easy datasets

Table 6 shows the classification accuracy at 30dB when each model is cross-evaluated on another

22
dataset. It is unsurprising that models trained on easier datasets were unable to perform well on
hard dataset, but it is surprising that models trained on harder datasets did not perform better
when tested on easier datasets. This does not bode well for application of this model in real-life
since the model is likely to be exposed to different noise conditions.

Tested on
Classification accuracy at 30dB/%
Easy Medium Hard
Easy 95 87 48
Trained on Medium 94 91 48
Hard 68 71 81

Table 6: Table showing classification accuracy at 30dB when models are cross-evaluated on differ-
ent datasets

5.2.3 Overall insights for Part 2

As expected, models do not perform so well on the Matlab hard dataset now that there is more
noise and distortions. The behaviour of different features and models in differentiating different
modulation type is largely similar to that of radioML dataset, with a few differences: amplitude-
phase does not seem to have that great of an effect in accuracy and ResNet performed better
than CLDNN. Constellation models were still unable to perfectly classify QAMs even at the
highest SNRs which is very surprising. When tested for its ability to adapt to different datasets,
the models performed poorly on another set it was not trained on. Overall, this is a very brief
study and more conclusions can be drawn if we have data from a variety of channel conditions.

6 Conclusion and Future work

In this section, I summarise my main findings and suggest future directions to work on.

6.1 Model architecture: CLDNN, ResNet

For time-series input features, I found that the CLDNN architecture which combined both con-
volutional and LSTM layers was the best performing on the cleaner radioML dataset (2x128
time-series). This seems to suggest that the CNN+LSTM combination is useful for time-series
analysis. On the other hand, ResNet was best performing on the Matlab hard dataset which had
longer input time-series (2x1024), which seems to suggest that that skip connections between lay-
ers help to analyse longer time-series. Therefore, I conjecture that a network combining skip con-
nections with a CNN+LSTM combination would work well on inputs with relatively large num-
ber of time-steps.

6.2 Input features

6.2.1 Time-series features
For time-series features, amplitude-phase (AP) format has been shown to improve model perfor-
mance at high SNR but worsen performance at low SNR compared to raw IQ data, which led to
me concluding that amplitude-phase time-series are more distinctive for each modulation type
but also more easily corrupted by noise. Therefore, a combination of IQ and AP models might
lead to better performance at both low and high SNR conditions.

23
6.2.2 Constellation image feature
For constellation images, I did not find that colouration improved model performance, but I did
find that higher image resolution led to better performance, but this comes with a tradeoff in
memory consumption. Also, constellation models are better at distinguishing higher order mod-
ulations such as QAMs and PSKs, but only at high SNR, and are very sensitive to noise condi-
tions. Also, constellation models are not as good as time-series models in telling apart lower or-
der modulations. This complementary behaviour of constellation and time-series models suggests
that perhaps we can use constellation models only for higher order modulations, or have a model
that combines both time-series and image features in model classification.

6.3 Model performance in different datasets

I found that general trends, such as the effects of amplitude-phase and constellation image fea-
tures, apply across different datasets. However, the models that I tried adapted poorly when
tested on a different dataset than it was trained on, and this is a big problem. A more rigorous
analysis with data in different noise conditions can be undertaken to study model performance
under different conditions.

6.4 Future work

In this project, I did not focus much on designing my own model architecture, but after analysing
common models used in literature, I find that there is potential in designing ensemble-types of
architectures that combine different models and input features for modulation classification. I
was also quite disappointed that my coloured constellations did not work as well as expected,
perhaps it could also be due to the architecture I used (which I tried to replicate from [8]), and
more work can be done here to explore different architectures for constellation image classifica-
tion. Finally, creating more datasets simulating different channel conditions and studying how
models work under these conditions would be very helpful in realising effective modulation classi-
fication in real-life applications.

7 References

References
[1] T. O’Shea and N. West, “Radio machine learning dataset generation with gnu
radio,” Proceedings of the GNU Radio Conference, vol. 1, no. 1, 2016. [Online]. Available:
[Link]

[2] T. J. O’Shea and J. Corgan, “Convolutional radio modulation recognition networks,” CoRR,
vol. abs/1602.04105, 2016. [Online]. Available: [Link]

[3] N. E. West and T. J. O’Shea, “Deep architectures for modulation recognition,” CoRR, vol.
abs/1703.09197, 2017. [Online]. Available: [Link]

[4] M. Kulin, T. Kazaz, I. Moerman, and E. D. Poorter, “End-to-end learning from spectrum data:
A deep learning approach for wireless signal identification in spectrum monitoring applications,”
CoRR, vol. abs/1712.03987, 2017. [Online]. Available: [Link]

24
[5] T. J. O’Shea, T. Roy, and T. C. Clancy, “Over the air deep learning based
radio signal classification,” CoRR, vol. abs/1712.04578, 2017. [Online]. Available:
[Link]

[6] S. Rajendran, W. Meert, D. Giustiniano, V. Lenders, and S. Pollin, “Distributed deep

learning models for wireless signal classification with low-cost spectrum sensors,” CoRR, vol.
abs/1707.08908, 2017. [Online]. Available: [Link]

[7] S. Peng, H. Jiang, H. Wang, H. Alwageed, and Y. Yao, “Modulation classification using
convolutional neural network based deep learning model,” in 2017 26th Wireless and Optical
Communication Conference (WOCC), 2017, pp. 1–5.

[8] Y. Wang, M. Liu, J. Yang, and G. Gui, “Data-driven deep learning for automatic modulation
recognition in cognitive radios,” IEEE Transactions on Vehicular Technology, vol. 68, no. 4,
pp. 4074–4077, 2019.

[9] B. Tang, Y. Tu, Z. Zhang, and Y. Lin, “Digital signal modulation classification with data
augmentation using generative adversarial nets in cognitive radio networks,” IEEE Access,
vol. 6, pp. 15 713–15 722, 2018.

8 Appendix: How to access code and data files

1. All my notebooks and scripts for data generation are on github: [Link]
RF modulation classification.

2. The 2016.10A radioML dataset can be downloaded here: [Link]

3. The Matlab RF dataset generation tutorial can be accessed here: [Link]

com/help/deeplearning/ug/[Link].

Common questions

The CLDNN architecture combines CNN and LSTM layers, where CNN layers first extract spatial features from the RF signals, and LSTM layers then capture temporal dependencies. This combination proves advantageous as it harnesses the strengths of CNNs in feature extraction and LSTMs in handling sequential data, making it suitable for time-series analysis like modulation classification .

The document suggests that combining different data representations like constellation diagrams and I/Q time-series can improve modulation classification accuracy. This approach potentially benefits from complementary strengths, such as detailed pattern recognition and resistance to noise, thereby overcoming the limitations of using a single type of data representation. Developing models that effectively integrate multiple inputs could yield better results .

Dataset characteristics, such as SNR levels and channel imperfections, significantly impact model evaluation and performance. The document highlights that benchmark datasets are crucial because they provide a consistent comparison across models. Differences in dataset creation methods can result in varying noise levels, affecting classification results even for the same SNR levels. Thus, evaluating models necessitates datasets that accurately simulate real-world conditions .

Convolutional layers in CNNs extract features from RF signals and, according to O'Shea et al., act similarly to matched filters at the receiver. These layers maximize the SNR of the received signal at specific points. The convolutional filters are believed to perform feature extraction efficiently, contributing to the model's ability to classify modulation schemes more accurately .

ResNet architectures may be more suitable for certain RF signal classification tasks due to their ability to leverage skip connections that allow effective learning from more extended input data. They address the problem of vanishing gradients and enable the network to handle complex patterns across scales and depths, which was advantageous for dealing with the 2x1024 data samples in the harder datasets compared to the ones CLDNN was tuned for .

The document indicates that RF signal classification models show limited adaptability to different datasets and noise conditions. Models trained on simpler, less noisy datasets performed poorly when tested on more challenging ones, and vice versa. Even models trained on noisier datasets did not show improved performance on easier datasets, suggesting that effective adaptability to varied real-world noise conditions requires more comprehensive training datasets and model architectures .

Differentiating QAM modulations is challenging because the nuances between the different QAM variants are subtle and require high-resolution detail that current dataset samples may not effectively capture. The document notes that even with advanced models like constellation representations, achieving high accuracy specifically for QAMs remains difficult, possibly due to data complexity and the lack of detailed features in given samples .

Higher-resolution constellation images did not significantly improve classification accuracy because the increase in resolution did not translate to a proportionate gain in detailed information captured. The constellations, comprising only 128 data points per constellation, correspond to about 16 symbols, thus limiting the information available to differentiate complex modulation schemes like QAMs even at higher resolutions .

Amplitude-phase time-series data can offer better classification accuracy at high SNR but are more sensitive to noise, making them less effective at low SNR. I/Q time-series, on the other hand, provide more resistance to noise, allowing better performance in such conditions. The trade-off is thus between higher accuracy for specific modulation types in ideal conditions and robustness in real-world noisy conditions .

CNN-based Inception modules do not significantly improve classification accuracy compared to other architectures in the context of RF signal modulation. It is hypothesized that the specific filter configurations and the inherent ability to process multi-scale information did not leverage any significant advantage for modulation schemes, possibly due to the complexity or nature of RF signal data compared to traditional image processing tasks where Inception generally performs well .

RF Signal Classification with DNNs
No ratings yet
RF Signal Classification with DNNs
7 pages
Exploring Deep Learning Architectures For RF Signal Classification
No ratings yet
Exploring Deep Learning Architectures For RF Signal Classification
6 pages
Form 4-Sdp Interim Report
No ratings yet
Form 4-Sdp Interim Report
23 pages
Project Report Sample
No ratings yet
Project Report Sample
6 pages
LSTM for Radar and Communication Signal Classification
No ratings yet
LSTM for Radar and Communication Signal Classification
6 pages
Modulation Classification with CNNs
No ratings yet
Modulation Classification with CNNs
9 pages
CNN-Based Modulation Classification Analysis
No ratings yet
CNN-Based Modulation Classification Analysis
17 pages
LSTM for Radar and Communication Signals
No ratings yet
LSTM for Radar and Communication Signals
6 pages
Radio Frequency Machine Learning - A Practical Deep Learning - Kuzdeba - 2025 - Artech House - 9781685690335 - Anna's Archive
No ratings yet
Radio Frequency Machine Learning - A Practical Deep Learning - Kuzdeba - 2025 - Artech House - 9781685690335 - Anna's Archive
265 pages
Deep CVCNN-LSTM for Modulation Recognition
No ratings yet
Deep CVCNN-LSTM for Modulation Recognition
9 pages
CNN-Based Modulation Classification on MPSoC
No ratings yet
CNN-Based Modulation Classification on MPSoC
1 page
Deep Learning for Modulation Classification
No ratings yet
Deep Learning for Modulation Classification
4 pages
Deep Learning for Modulation Classification
No ratings yet
Deep Learning for Modulation Classification
18 pages
Diffusion-Based Signal Augmentation for AMC
No ratings yet
Diffusion-Based Signal Augmentation for AMC
17 pages
Final Thesis
No ratings yet
Final Thesis
84 pages
Deep Learning For Radar Signal Detection in The 3.5 GHZ CBRS Band Raied Caromi, Alex Lackpour, Kassem Kallas, Thao Nguyen and Michael Sourya
No ratings yet
Deep Learning For Radar Signal Detection in The 3.5 GHZ CBRS Band Raied Caromi, Alex Lackpour, Kassem Kallas, Thao Nguyen and Michael Sourya
8 pages
Deep Learning for 3.5 GHz Radar Detection
No ratings yet
Deep Learning for 3.5 GHz Radar Detection
8 pages
Identifying Rogue RF Transmitters with GANs
No ratings yet
Identifying Rogue RF Transmitters with GANs
7 pages
5G Signal Classification with Deep Learning
No ratings yet
5G Signal Classification with Deep Learning
14 pages
CNN-LSTM Hybrid Architecture For Over-The-Air Automatic Modulation Classification Using SDR
No ratings yet
CNN-LSTM Hybrid Architecture For Over-The-Air Automatic Modulation Classification Using SDR
7 pages
Deep Learning Compression for RF Signals
No ratings yet
Deep Learning Compression for RF Signals
6 pages
RF Signal Classification via Deep Learning
No ratings yet
RF Signal Classification via Deep Learning
8 pages
Digital Signal Processing Using Deep Neural Networks: Brian Shevitski, Yijing Watkins, Nicole Man and Michael Girard
No ratings yet
Digital Signal Processing Using Deep Neural Networks: Brian Shevitski, Yijing Watkins, Nicole Man and Michael Girard
21 pages
Over The Air Deep Learning Based Radio Signal Classification
No ratings yet
Over The Air Deep Learning Based Radio Signal Classification
12 pages
LoRa Cybersecurity via SOM Orthogonalization
No ratings yet
LoRa Cybersecurity via SOM Orthogonalization
20 pages
RaGAN-Based Noise Reduction for Radio Signals
No ratings yet
RaGAN-Based Noise Reduction for Radio Signals
16 pages
【IEEE文献1篇】SemanticRF - Towards Robust RF Scene Retrieval Using Natural Language Supervision
No ratings yet
【IEEE文献1篇】SemanticRF - Towards Robust RF Scene Retrieval Using Natural Language Supervision
6 pages
Real-Time CNN for Signal Classification
No ratings yet
Real-Time CNN for Signal Classification
4 pages
Intelligent Radio Signal Processing Survey
No ratings yet
Intelligent Radio Signal Processing Survey
30 pages
ANSIS-Cloud: RF Signal Analysis and Modulation Classification System
No ratings yet
ANSIS-Cloud: RF Signal Analysis and Modulation Classification System
11 pages
Electronics 13 01604
No ratings yet
Electronics 13 01604
13 pages
Modulation Recognition With GNU Radio, Keras, and
No ratings yet
Modulation Recognition With GNU Radio, Keras, and
3 pages
D-MoE Paper - HaraNagaSai
No ratings yet
D-MoE Paper - HaraNagaSai
9 pages
SDR Receiver for Real-Time Modulation Classification
No ratings yet
SDR Receiver for Real-Time Modulation Classification
21 pages
Robust and Fast Automatic Modulation Classification With CNN Under Multipath Fading Channels
No ratings yet
Robust and Fast Automatic Modulation Classification With CNN Under Multipath Fading Channels
6 pages
Intelligent Spectrum Analyzer
No ratings yet
Intelligent Spectrum Analyzer
1 page
Automatic Modulation ClassificationBased On Deep Learning For SDR
No ratings yet
Automatic Modulation ClassificationBased On Deep Learning For SDR
13 pages
SDR Receiver for Real-Time Modulation Classification
No ratings yet
SDR Receiver for Real-Time Modulation Classification
21 pages
Deep Learning for Modulation Classification
No ratings yet
Deep Learning for Modulation Classification
19 pages
DRFM Jamming Detection in Radar Systems
No ratings yet
DRFM Jamming Detection in Radar Systems
94 pages
Deep Learning for Multi-Signal Detection
No ratings yet
Deep Learning for Multi-Signal Detection
20 pages
RFSensingGPT: Advancing RF Sensing in 6G
No ratings yet
RFSensingGPT: Advancing RF Sensing in 6G
14 pages
Deep Learning for Modulation Classification
No ratings yet
Deep Learning for Modulation Classification
66 pages
Deep Learning for DFRC Systems Design
No ratings yet
Deep Learning for DFRC Systems Design
17 pages
DeepRx: Convolutional Receiver for 5G
No ratings yet
DeepRx: Convolutional Receiver for 5G
32 pages
Self-Supervised Learning for RF Signals
No ratings yet
Self-Supervised Learning for RF Signals
5 pages
Deep Learning for Modulation Recognition
No ratings yet
Deep Learning for Modulation Recognition
5 pages
Deep Learning for Modulation Classification
No ratings yet
Deep Learning for Modulation Classification
19 pages
LSTM-Based IQ Symbol Processing in OFDM
No ratings yet
LSTM-Based IQ Symbol Processing in OFDM
9 pages
Adversarial Attacks in Wireless Deep Learning
No ratings yet
Adversarial Attacks in Wireless Deep Learning
30 pages
Deep Learning for Spectrum Sensing Review
No ratings yet
Deep Learning for Spectrum Sensing Review
25 pages
Wheat Disease Control Knowledge Graph
No ratings yet
Wheat Disease Control Knowledge Graph
137 pages
Simulation-Based Validation For Autonomous Driving Systems
No ratings yet
Simulation-Based Validation For Autonomous Driving Systems
13 pages
Understanding LTE With MATLAB - Zarrinkoub Houman
No ratings yet
Understanding LTE With MATLAB - Zarrinkoub Houman
8 pages
Nonlinear Loads and Harmonics Study
No ratings yet
Nonlinear Loads and Harmonics Study
4 pages
Bubble Sort Overview and Analysis
No ratings yet
Bubble Sort Overview and Analysis
10 pages
Bethesda's Social Media Strategy Insights
No ratings yet
Bethesda's Social Media Strategy Insights
2 pages
Computers and Geotechnics: Ning Luo, Richard J. Bathurst, Sina Javankhoshdel
No ratings yet
Computers and Geotechnics: Ning Luo, Richard J. Bathurst, Sina Javankhoshdel
11 pages
12.5A Silicon Controlled Rectifier Data
No ratings yet
12.5A Silicon Controlled Rectifier Data
3 pages
Power BI Overview and Features Guide
No ratings yet
Power BI Overview and Features Guide
27 pages
LED Strip Lighting Installation Guide
100% (1)
LED Strip Lighting Installation Guide
131 pages
Diagnosing Transmission Ratio Codes
100% (1)
Diagnosing Transmission Ratio Codes
8 pages
SpectrOil 100 Series Datasheet
No ratings yet
SpectrOil 100 Series Datasheet
2 pages
845H Optical Incremental Encoder Overview
No ratings yet
845H Optical Incremental Encoder Overview
4 pages
Answerkey 5.PDF
No ratings yet
Answerkey 5.PDF
6 pages
Design Trends and Collaborations August 2013
No ratings yet
Design Trends and Collaborations August 2013
100 pages
Etisalat Telecom Infrastructure Guidelines
No ratings yet
Etisalat Telecom Infrastructure Guidelines
12 pages
Resource Planning and Scheduling Techniques
No ratings yet
Resource Planning and Scheduling Techniques
29 pages
102 Combinatorial Problems for IMO Training
No ratings yet
102 Combinatorial Problems for IMO Training
1 page
HP Printers - Device Types For SAP Printing - HP® Customer Support
No ratings yet
HP Printers - Device Types For SAP Printing - HP® Customer Support
21 pages
PG Diploma in Data Analytics at IIIT-B
No ratings yet
PG Diploma in Data Analytics at IIIT-B
9 pages
NT538 Controller Instruction Manual
No ratings yet
NT538 Controller Instruction Manual
30 pages
MS Access Combo Box and Form Guide
No ratings yet
MS Access Combo Box and Form Guide
3 pages
Abaqus Input File Format Explained
No ratings yet
Abaqus Input File Format Explained
3 pages
Big Data and Data Lake Concepts Explained
No ratings yet
Big Data and Data Lake Concepts Explained
5 pages
Kumar Kashyap's Academic & Project Profile
No ratings yet
Kumar Kashyap's Academic & Project Profile
1 page
HP PC Commercial
No ratings yet
HP PC Commercial
62 pages
Backend Engineer Role at Spinny
No ratings yet
Backend Engineer Role at Spinny
2 pages
Evolution of Operating Systems
No ratings yet
Evolution of Operating Systems
21 pages
Depth Recovery in Stereo Vision Techniques
No ratings yet
Depth Recovery in Stereo Vision Techniques
32 pages
Marketing Experience Overview
No ratings yet
Marketing Experience Overview
1 page

RF Signal Classification via Deep Learning

Uploaded by

RF Signal Classification via Deep Learning

Uploaded by

Internship Project Report and Documentation: RF Signal

Classification using Deep Learning (06 July 2020 to 21 Aug 2020)

Yoke Kai Wen

August 20, 2020

3 General implementation methodology 8

4 Part 1: Reproduction of previous architectures on radioML dataset 9

5 Part 2: Evaluation of models on a more realistic dataset 16

6 Conclusion and Future work 23

8 Appendix: How to access code and data files 25

1.2 Project organisation and main questions to answer

Architecture Input feature Dataset [1] Source

Figure 1: Constellation diagrams of 11 modulation schemes at SNR=18dB of the 2016.10A ra-

Table 2: Table summarizing descriptions of radioML datasets

2.2 Using CNN to extract features from I/Q time-series

2.2.1 Basic CNN [2]

2.2.2 CNN based on Inception [3]

Figure 3: Architecture of 1D inception module [3]

2.2.3 CNN based on ResNet [5]

Figure 4: Architecture of 1D Residual Stack module [5]

2.2.4 CLDNN: CNN followed by LSTM [3]

Figure 5: Architecture of CLDNN [3]

2.3 Representing I/Q time-series as amplitude-phase time-series

2.3.2 LSTM with amplitude-phase time-series as input [6]

2.4 Representing I/Q time-series as constellation images

3 General implementation methodology

3.1 Google Colaboratory

3.2 Training parameters

3.2.2 Train-validation-test splits

3.2.3 Batch size

3.2.4 Training epochs

3.2.5 Optimisers and other hyperparameters

3.3 Performance metrics

4 Part 1: Reproduction of previous architectures on radioML

4.1.1 Conversion of I/Q data to amplitude-phase data

4.1.2 Conversion of I/Q data to constellation images

4.2.1 Performance of models trained on time-series inputs

A few significant observations:

4.2.2 Performance of constellation models on images of different resolutions and

4.2.3 Overall insights for Part 1

5 Part 2: Evaluation of models on a more realistic dataset

5.1 Methodology: Dataset creation using Matlab

5.1.1 Dataset generation process for M-ary modulation scheme

3. Upsample to get 8 samples per symbol.

4. Filter with root-raised cosine filter with roll-off factor 0.35.

5. Apply channel effects.

6. Normalise and extract 1024 samples

5.1.2 Important parameters and channel effects

Parameters Value Justification

Table 4: Table showing important parameters and channel effects

5.1.3 Creation of easy, medium and hard datasets

Table 5: Table showing characteristics of hard, medium and easy datasets

5.2.2 Cross-evaluating models with easy, medium and hard datasets

5.2.3 Overall insights for Part 2

6 Conclusion and Future work

6.1 Model architecture: CLDNN, ResNet

6.2 Input features

6.3 Model performance in different datasets

6.4 Future work

[6] S. Rajendran, W. Meert, D. Giustiniano, V. Lenders, and S. Pollin, “Distributed deep

8 Appendix: How to access code and data files

2. The 2016.10A radioML dataset can be downloaded here: [Link]

3. The Matlab RF dataset generation tutorial can be accessed here: [Link]

Common questions

How does the CLDNN architecture utilize both CNN and LSTM layers for modulation classification, and why might this be advantageous?

How does the CLDNN architecture utilize both CNN and LSTM layers for modulation classification, and why might this be advantageous?

What does the document suggest about the viability of combining different data representations (e.g., constellation diagrams and I/Q time-series) for improving modulation classification accuracy?

What does the document suggest about the viability of combining different data representations (e.g., constellation diagrams and I/Q time-series) for improving modulation classification accuracy?

What insights can be drawn about the role of dataset characteristics in evaluating deep learning models for RF signal classification?

What insights can be drawn about the role of dataset characteristics in evaluating deep learning models for RF signal classification?

How do the convolutional layers in CNNs contribute to RF signal modulation classification tasks, and what are their perceived benefits as per O'Shea et al.?

How do the convolutional layers in CNNs contribute to RF signal modulation classification tasks, and what are their perceived benefits as per O'Shea et al.?

Why might ResNet architectures be more suitable than CLDNN for RF signal classification in certain datasets, based on the discussion in the document?

Why might ResNet architectures be more suitable than CLDNN for RF signal classification in certain datasets, based on the discussion in the document?

What conclusions can be derived about the adaptability of RF signal classification models to different datasets and noise conditions?

What conclusions can be derived about the adaptability of RF signal classification models to different datasets and noise conditions?

What are the primary challenges in differentiating QAM modulations using deep learning models, according to the document findings?