Temporal-GradCam

TemporalGradCam

Temporal interpretation of medical image data.

Dataset

The OASIS_2D dataset contains brain X-ray images of 100 patients.

Figure: Length distribution when patients images are converted to a time series. Each patient can have multiple X-rays at different days. Most patients only have one image.

distribution

Model

Currently we have the following two models implemented

For training we freeze all layers except the output Linear layer.

Figure: Training vistory of one iteration from ResNet

gradient

Temporal Model

To create the temporal version of the OASIS model we,

Results

Following shows the average test result across all five iterations.

Model Loss Accuracy F1-score AUC
ResNet 1.32 83.87 81.32 91.68
ResNet (Seq 3) 1.45 79.87 81.24 82.24
ViT 1.22 85.77 86.50 92.08
ViT (Seq 3) 0.94 87.72 88.25 95.13

The temporal model (sequence length 3) with Vision Transformer is performing best so far.

Interpretation

Interpreting sample patient image for ResNet

Note that, this is not for the temporal model.

No Sample Gradient Shap GradCam Guided GradCam Guided Backprop
1
2
3

Interpreting sample patient image for ViT

Note that, this is not for the temporal model.

No Sample Gradient Shap GradCam Guided GradCam Guided Backprop
1
2
3

Tools

Files

The following files are available for now with pre-trained vision models for transfer learning on the medical dataset.

Literature