Note

Go to the end to download the full example code.

Data Augmentation on BCIC IV 2a Dataset#

This tutorial shows how to train EEG deep models with data augmentation. It follows the trial-wise decoding example and also illustrates the effect of a transform on the input signals.

# Authors: Simon Brandt <simonbrandt@protonmail.com>
#          Cédric Rommel <cedric.rommel@inria.fr>
#
# License: BSD (3-clause)

Loading and preprocessing the dataset #

Loading #

from skorch.callbacks import LRScheduler
from skorch.helper import predefined_split

from braindecode import EEGClassifier
from braindecode.datasets import MOABBDataset

subject_id = 3
dataset = MOABBDataset(dataset_name="BNCI2014_001", subject_ids=[subject_id])

Preprocessing #

from numpy import multiply

from braindecode.preprocessing import (
    Preprocessor,
    exponential_moving_standardize,
    preprocess,
)

low_cut_hz = 4.0  # low cut frequency for filtering
high_cut_hz = 38.0  # high cut frequency for filtering
# Parameters for exponential moving standardization
factor_new = 1e-3
init_block_size = 1000
# Factor to convert from V to uV
factor = 1e6

preprocessors = [
    Preprocessor("pick_types", eeg=True, meg=False, stim=False),  # Keep EEG sensors
    Preprocessor(lambda data: multiply(data, factor)),  # Convert from V to uV
    Preprocessor("filter", l_freq=low_cut_hz, h_freq=high_cut_hz),  # Bandpass filter
    Preprocessor(
        exponential_moving_standardize,  # Exponential moving standardization
        factor_new=factor_new,
        init_block_size=init_block_size,
    ),
]

preprocess(dataset, preprocessors, n_jobs=-1)

BaseConcatDataset
Type	BaseConcatDataset of RawDataset
Recordings	12
Total samples	1160820
Sfreq*	250.0 Hz
Channels*	22 (22 EEG)
Ch. names*	Fz, FC3, FC1, FCz, FC2, FC4, C5, C3, C1, Cz, ... (+12 more)
Montage*	head
Duration*	386.9 s
* from first recording
Description	12 recordings × 3 columns [subject, session, run]

Extracting windows #

from braindecode.preprocessing import create_windows_from_events

trial_start_offset_seconds = -0.5
# Extract sampling frequency, check that they are same in all datasets
sfreq = dataset.datasets[0].raw.info["sfreq"]
assert all([ds.raw.info["sfreq"] == sfreq for ds in dataset.datasets])
# Calculate the trial start offset in samples.
trial_start_offset_samples = int(trial_start_offset_seconds * sfreq)

# Create windows using braindecode function for this. It needs parameters to
# define how trials should be used.
windows_dataset = create_windows_from_events(
    dataset,
    trial_start_offset_samples=trial_start_offset_samples,
    trial_stop_offset_samples=0,
    preload=True,
)

Split dataset into train and valid #

splitted = windows_dataset.split("session")
train_set = splitted["0train"]  # Session train
valid_set = splitted["1test"]  # Session evaluation

Data can be manipulated by transforms, which are callable objects. A transform is usually handled by a custom data loader, but can also be called directly on input data, as demonstrated below for illutrative purposes.

First, we need to define a Transform. Here we chose the FrequencyShift, which randomly translates all frequencies within a given range.

from braindecode.augmentation import FrequencyShift

transform = FrequencyShift(
    probability=1.0,  # defines the probability of actually modifying the input
    sfreq=sfreq,
    max_delta_freq=2.0,  # the frequency shifts are sampled now between -2 and 2 Hz
)

Manipulating one session and visualizing the transformed data #

Next, let us augment one session to show the resulting frequency shift. The data of an mne Epoch is used here to make usage of mne functions.

import numpy as np
import torch

X = np.stack([X for X, y, i in train_set.datasets[0]])
# This allows to apply the transform with a fixed shift (10 Hz) for
# visualization instead of sampling the shift randomly between -2 and 2 Hz
X_tr, _ = transform.operation(torch.as_tensor(X).float(), None, 10.0, sfreq)  # type: ignore[has-type]

The psd of the transformed session has now been shifted by 10 Hz, as one can see on the psd plot.

import matplotlib.pyplot as plt
import mne


def plot_psd(data, axis, label, color):
    psds, freqs = mne.time_frequency.psd_array_multitaper(
        data, sfreq=sfreq, fmin=0.1, fmax=100
    )
    psds = 10.0 * np.log10(psds)
    psds_mean = psds.mean(0).mean(0)
    axis.plot(freqs, psds_mean, color=color, label=label)


_, ax = plt.subplots()
plot_psd(X, ax, "original", "k")
plot_psd(X_tr.numpy(), ax, "shifted", "r")

ax.set(
    title="Multitaper PSD (gradiometers)",
    xlabel="Frequency (Hz)",
    ylabel="Power Spectral Density (dB)",
)
ax.legend()
plt.show()

Training a model with data augmentation #

Now that we know how to instantiate Transforms, it is time to learn how to use them to train a model and try to improve its generalization power. Let’s first create a model.

Create model #

The model to be trained is defined as usual.

from braindecode.models import ShallowFBCSPNet
from braindecode.util import set_random_seeds

cuda = torch.cuda.is_available()  # check if GPU is available, if True chooses to use it
device = "cuda" if cuda else "cpu"
if cuda:
    torch.backends.cudnn.benchmark = True

# Set random seed to be able to roughly reproduce results
# Note that with cudnn benchmark set to True, GPU indeterminism
# may still make results substantially different between runs.
# To obtain more consistent results at the cost of increased computation time,
# you can set `cudnn_benchmark=False` in `set_random_seeds`
# or remove `torch.backends.cudnn.benchmark = True`
seed = 20200220
set_random_seeds(seed=seed, cuda=cuda)

n_classes = 4
classes = list(range(n_classes))

# Extract number of chans and time steps from dataset
n_channels = train_set[0][0].shape[0]
n_times = train_set[0][0].shape[1]

model = ShallowFBCSPNet(
    n_chans=n_channels,
    n_outputs=n_classes,
    n_times=n_times,
    final_conv_length="auto",
)

Create an EEGClassifier with the desired augmentation #

In order to train with data augmentation, a custom data loader can be for the training. Multiple transforms can be passed to it and will be applied sequentially to the batched data within the AugmentedDataLoader object.

from braindecode.augmentation import AugmentedDataLoader, SignFlip

freq_shift = FrequencyShift(
    probability=0.5,
    sfreq=sfreq,
    max_delta_freq=2.0,  # the frequency shifts are sampled now between -2 and 2 Hz
)

sign_flip = SignFlip(probability=0.1)

transforms = [freq_shift, sign_flip]

# Send model to GPU
if cuda:
    model.cuda()

The model is now trained as in the trial-wise example. The AugmentedDataLoader is used as the train iterator and the list of transforms are passed as arguments.

lr = 0.0625 * 0.01
weight_decay = 0

batch_size = 64
n_epochs = 4

clf = EEGClassifier(
    model,
    iterator_train=AugmentedDataLoader,  # This tells EEGClassifier to use a custom DataLoader
    iterator_train__transforms=transforms,  # This sets the augmentations to use
    criterion=torch.nn.CrossEntropyLoss,
    optimizer=torch.optim.AdamW,
    train_split=predefined_split(valid_set),  # using valid_set for validation
    optimizer__lr=lr,
    optimizer__weight_decay=weight_decay,
    batch_size=batch_size,
    callbacks=[
        "accuracy",
        ("lr_scheduler", LRScheduler("CosineAnnealingLR", T_max=n_epochs - 1)),
    ],
    device=device,
    classes=classes,
)
# Model training for a specified number of epochs. `y` is None as it is already
# supplied in the dataset.
clf.fit(train_set, y=None, epochs=n_epochs)

  epoch    train_accuracy    train_loss    valid_acc    valid_accuracy    valid_loss      lr     dur
-------  ----------------  ------------  -----------  ----------------  ------------  ------  ------
      1            0.2500        1.5869       0.2500            0.2500        6.2989  0.0006  2.0041
      2            0.2500        1.2203       0.2500            0.2500        6.2202  0.0005  1.7538
      3            0.2569        1.1668       0.2431            0.2431        5.2575  0.0002  1.7702
      4            0.2569        1.1553       0.2465            0.2465        4.3377  0.0000  1.7950

<class 'braindecode.classifier.EEGClassifier'>[initialized](
  module_==================================================================================================================================================
  Layer (type (var_name):depth-idx)             Input Shape               Output Shape              Param #                   Kernel Shape
  =================================================================================================================================================
  ShallowFBCSPNet (ShallowFBCSPNet)             [1, 22, 1125]             [1, 4]                    --                        --
  ├─Ensure4d (ensuredims): 1-1                  [1, 22, 1125]             [1, 22, 1125, 1]          --                        --
  ├─Rearrange (dimshuffle): 1-2                 [1, 22, 1125, 1]          [1, 1, 1125, 22]          --                        --
  ├─CombinedConv (conv_time_spat): 1-3          [1, 1, 1125, 22]          [1, 40, 1101, 1]          36,240                    --
  ├─BatchNorm2d (bnorm): 1-4                    [1, 40, 1101, 1]          [1, 40, 1101, 1]          80                        --
  ├─Square (conv_nonlin_exp): 1-5               [1, 40, 1101, 1]          [1, 40, 1101, 1]          --                        --
  ├─AvgPool2d (pool): 1-6                       [1, 40, 1101, 1]          [1, 40, 69, 1]            --                        [75, 1]
  ├─SafeLog (pool_nonlin_exp): 1-7              [1, 40, 69, 1]            [1, 40, 69, 1]            --                        --
  ├─Dropout (drop): 1-8                         [1, 40, 69, 1]            [1, 40, 69, 1]            --                        --
  ├─Sequential (final_layer): 1-9               [1, 40, 69, 1]            [1, 4]                    --                        --
  │    └─Conv2d (conv_classifier): 2-1          [1, 40, 69, 1]            [1, 4, 1, 1]              11,044                    [69, 1]
  │    └─SqueezeFinalOutput (squeeze): 2-2      [1, 4, 1, 1]              [1, 4]                    --                        --
  │    │    └─Rearrange (squeeze): 3-1          [1, 4, 1, 1]              [1, 4, 1]                 --                        --
  =================================================================================================================================================
  Total params: 47,364
  Trainable params: 47,364
  Non-trainable params: 0
  Total mult-adds (Units.MEGABYTES): 0.01
  =================================================================================================================================================
  Input size (MB): 0.10
  Forward/backward pass size (MB): 0.35
  Params size (MB): 0.04
  Estimated Total Size (MB): 0.50
  =================================================================================================================================================,
)

In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.

EEGClassifier

iFitted

Parameters

	module	ShallowFBCSPN...') ) ) )
	callbacks	['accuracy', ('lr_scheduler', ...)]
	optimizer	<class 'torch....adamw.AdamW'>
	lr	0.01
	max_epochs	10
	batch_size	64
	iterator_train	<class 'brain...edDataLoader'>
	iterator_valid	<class 'torch...r.DataLoader'>
	dataset	<class 'skorc...aset.Dataset'>
	train_split	functools.par...: 72, 3: 72}))
	predict_nonlinearity	'auto'
	warm_start	False
	verbose	1
	device	'cpu'
	compile	False
	use_caching	'auto'
	torch_load_kwargs	None
	_params_to_validate	{'iterator_train__drop_last', 'iterator_train__shuffle', 'iterator_train__transforms', 'optimizer__lr', 'optimizer__weight_decay'}
	iterator_train__transforms	[FrequencyShift(), SignFlip()]
	optimizer__lr	0.000625
	optimizer__weight_decay	0
	classes	[0, 1, ...]
	callbacks__epoch_timer	<skorch.callb...x7fded0ad33e0>
	callbacks__train_loss	<skorch.callb...x7fde7f24d190>
	callbacks__train_loss__scoring	<function tra...x7fdeb34feb60>
	callbacks__train_loss__lower_is_better	True
	callbacks__train_loss__on_train	True
	callbacks__train_loss__name	'train_loss'
	callbacks__train_loss__target_extractor	<function noo...x7fdeb34fe980>
	callbacks__train_loss__use_caching	True
	callbacks__valid_loss	<skorch.callb...x7fded1304830>
	callbacks__valid_loss__scoring	<function val...x7fdeb34fec00>
	callbacks__valid_loss__lower_is_better	True
	callbacks__valid_loss__on_train	False
	callbacks__valid_loss__name	'valid_loss'
	callbacks__valid_loss__target_extractor	<function noo...x7fdeb34fe980>
	callbacks__valid_loss__use_caching	True
	callbacks__valid_acc	<skorch.callb...x7fded0af0050>
	callbacks__valid_acc__scoring	'accuracy'
	callbacks__valid_acc__lower_is_better	False
	callbacks__valid_acc__on_train	False
	callbacks__valid_acc__name	'valid_acc'
	callbacks__valid_acc__target_extractor	<function to_...x7fdeb34fdee0>
	callbacks__valid_acc__use_caching	True
	callbacks__train_accuracy	<braindecode....x7fded08fcda0>
	callbacks__train_accuracy__scoring	'accuracy'
	callbacks__train_accuracy__lower_is_better	False
	callbacks__train_accuracy__on_train	True
	callbacks__train_accuracy__name	'train_accuracy'
	callbacks__train_accuracy__target_extractor	<function to_...x7fdeb34fdee0>
	callbacks__train_accuracy__use_caching	False
	callbacks__valid_accuracy	<skorch.callb...x7fde7f48c860>
	callbacks__valid_accuracy__scoring	'accuracy'
	callbacks__valid_accuracy__lower_is_better	False
	callbacks__valid_accuracy__on_train	False
	callbacks__valid_accuracy__name	'valid_accuracy'
	callbacks__valid_accuracy__target_extractor	<function to_...x7fdeb34fdee0>
	callbacks__valid_accuracy__use_caching	True
	callbacks__lr_scheduler	<skorch.callb...x7fded08fc140>
	callbacks__lr_scheduler__policy	'CosineAnnealingLR'
	callbacks__lr_scheduler__monitor	'train_loss'
	callbacks__lr_scheduler__event_name	'event_lr'
	callbacks__lr_scheduler__step_every	'epoch'
	callbacks__lr_scheduler__T_max	3
	callbacks__print_log	<skorch.callb...x7fded08690d0>
	callbacks__print_log__keys_ignored	None
	callbacks__print_log__sink	<built-in function print>
	callbacks__print_log__tablefmt	'simple'
	callbacks__print_log__floatfmt	'.4f'
	callbacks__print_log__stralign	'right'
	criterion	<class 'torch...sEntropyLoss'>
	cropped	False
	iterator_train__shuffle	True
	iterator_train__drop_last	True
	aggregate_predictions	True

Fitted attributes

Name	Type	Value
callbacks_	list	[('ep...er', <skorch.callb...x7fde8e8d3170>), ('tr...ss', <skorch.callb...x7fded0b90fb0>), ('va...ss', <skorch.callb...x7fde745223c0>), ('va...cc', <skorch.callb...x7fde7f48ce00>), ...]
classes_	ndarray[int64](4,)	[0,1,2,3]
criterion_	CrossEntropyLoss	CrossEntropyLoss()
cuda_dependent_attributes_	list	['module_', 'cr...n_', 'op...r_']
history_	History	[{'batches': ...ent_lr': 0.0}]
init_context_	NoneType	None
initialized_	bool	True
module_	ShallowFBCSPNet	ShallowFBCSPN...') ) ) )
optimizer_	AdamW	AdamW ( Param...ght_decay: 0 )
prefixes_	list	['it...in', 'it...id', 'ca...ks', 'dataset', ...]
signal_args_set_	bool	True
virtual_params_	dict	{'lr': functools.par...e='optimizer'), 'op..._': functools.par...e='optimizer'), 'op..._': functools.par...e='optimizer')}

Manually composing Transforms #

It would be equivalent (although more verbose) to pass to EEGClassifier a composition of the same transforms:

from braindecode.augmentation import Compose

composed_transforms = Compose(transforms=transforms)

Setting the data augmentation at the Dataset level #

Also note that it is also possible for most of the transforms to pass them directly to the WindowsDataset object through the transform argument, as most commonly done in other libraries. However, it is advised to use the AugmentedDataLoader as above, as it is compatible with all transforms and can be more efficient.

train_set.transform = composed_transforms

Total running time of the script: (0 minutes 29.305 seconds)

Estimated memory usage: 581 MB

Gallery generated by Sphinx-Gallery