Temporal interpretation of medical image data.
The OASIS_2D dataset contains brain X-ray images of 100 patients.
label 1), 43 unique patientslabel 0), 10 unique patientsFigure: Length distribution when patients images are converted to a time series. Each patient can have multiple X-rays at different days. Most patients only have one image.

Currently we have the following two models implemented
For training we freeze all layers except the output Linear layer.
Figure: Training vistory of one iteration from ResNet

To create the temporal version of the OASIS model we,
seq_len 3, around 70% examples fall within this range. Rest are padded.DNN model on the the temporal dataset.
Following shows the average test result across all five iterations.
| Model | Loss | Accuracy | F1-score | AUC |
|---|---|---|---|---|
| ResNet | 1.32 | 83.87 | 81.32 | 91.68 |
| ResNet (Seq 3) | 1.45 | 79.87 | 81.24 | 82.24 |
| ViT | 1.22 | 85.77 | 86.50 | 92.08 |
| ViT (Seq 3) | 0.94 | 87.72 | 88.25 | 95.13 |
The temporal model (sequence length 3) with Vision Transformer is performing best so far.
Note that, this is not for the temporal model.
| No | Sample | Gradient Shap | GradCam | Guided GradCam | Guided Backprop |
|---|---|---|---|---|---|
| 1 | ![]() |
![]() |
![]() |
![]() |
![]() |
| 2 | ![]() |
![]() |
![]() |
![]() |
![]() |
| 3 | ![]() |
![]() |
![]() |
![]() |
![]() |
Note that, this is not for the temporal model.
| No | Sample | Gradient Shap | GradCam | Guided GradCam | Guided Backprop |
|---|---|---|---|---|---|
| 1 | ![]() |
![]() |
![]() |
![]() |
![]() |
| 2 | ![]() |
![]() |
![]() |
![]() |
![]() |
| 3 | ![]() |
![]() |
![]() |
![]() |
![]() |
The following files are available for now with pre-trained vision models for transfer learning on the medical dataset.