Temporal interpretation of medical image data.
The OASIS_2D
dataset contains brain X-ray images of 100 patients.
label 1
), 43 unique patientslabel 0
), 10 unique patientsFigure: Length distribution when patients images are converted to a time series. Each patient can have multiple X-rays at different days. Most patients only have one image.
Currently we have the following two models implemented
For training we freeze all layers except the output Linear layer
.
Figure: Training vistory of one iteration from ResNet
To create the temporal version of the OASIS model we,
seq_len 3
, around 70% examples fall within this range. Rest are padded.DNN
model on the the temporal dataset.
Following shows the average test result across all five iterations.
Model | Loss | Accuracy | F1-score | AUC |
---|---|---|---|---|
ResNet | 1.32 | 83.87 | 81.32 | 91.68 |
ResNet (Seq 3) | 1.45 | 79.87 | 81.24 | 82.24 |
ViT | 1.22 | 85.77 | 86.50 | 92.08 |
ViT (Seq 3) | 0.94 | 87.72 | 88.25 | 95.13 |
The temporal model (sequence length 3) with Vision Transformer is performing best so far.
Note that, this is not for the temporal model.
No | Sample | Gradient Shap | GradCam | Guided GradCam | Guided Backprop |
---|---|---|---|---|---|
1 | ![]() |
![]() |
![]() |
![]() |
![]() |
2 | ![]() |
![]() |
![]() |
![]() |
![]() |
3 | ![]() |
![]() |
![]() |
![]() |
![]() |
Note that, this is not for the temporal model.
No | Sample | Gradient Shap | GradCam | Guided GradCam | Guided Backprop |
---|---|---|---|---|---|
1 | ![]() |
![]() |
![]() |
![]() |
![]() |
2 | ![]() |
![]() |
![]() |
![]() |
![]() |
3 | ![]() |
![]() |
![]() |
![]() |
![]() |
The following files are available for now with pre-trained vision models for transfer learning on the medical dataset.