Temporal-GradCam

TemporalGradCam

Temporal interpretation of medical image data.

TemporalGradCam

Dataset

The OASIS_2D dataset contains brain X-ray images of 100 patients.

50 images with disease (label 1), 43 unique patients
50 images with healthy (label 0), 10 unique patients
The image size is 256 x 256 colored.
For experiement we split the dataset 80:20 for train and test using unique patients. So the same patient will not appear in both training and test. The split is stratified, so balanced amount of healthy and disease examples is present in train and test.
No data augmentation is used at this point.

Figure: Length distribution when patients images are converted to a time series. Each patient can have multiple X-rays at different days. Most patients only have one image.

distribution

Model

Currently we have the following two models implemented

ResNet18
ViT (vit_b_16)

For training we freeze all layers except the output Linear layer.

Epochs: 25
Learning rate: 1e-3
Early stop: 5
Experiment iteration: 5, the whole experiment is repeated 5 times using different random seed each time. The test results and best model checkpoints are saved.

Figure: Training vistory of one iteration from ResNet

gradient

Temporal Model

To create the temporal version of the OASIS model we,

Extracted features from the images using the pretrained models (ResNet or ViT). The extracted feature dimension is equal to the dimension of layer just before the output layer.
- 512 for ResNet
- 768 for ViT
For each sample
- find previous images of the same patient, max upto seq length
- create the time series example [seq_len, feature_dim].
- smaller sequences are padded at the beginning to the max sequence length. Larger sequences are truncated from the beginning (olders images are dropped).
- we currently use seq_len 3, around 70% examples fall within this range. Rest are padded.
The model is batch first. Pytorch doesn’t easily support variable length time sequences.
Currently we use a simple DNN model on the the temporal dataset.
- max epochs 100
- learning rate 1e-3
- dropout=0.1
- hidden_size=64