Pytorch Malaria classifier #43
Open
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Updated environment file, added training, data prep, test prep and pytorch model.
This binary classifier can be used to analyze benign and malignant microscopy images (binary so Infected vs Not Infected of Thin or Thick smears) from the National Library of Medicine dataset at https://lhncbc.nlm.nih.gov/LHC-research/LHC-projects/image-processing/malaria-datasheet.html. This model achieved a validation accuracy in the region of 90% on the augmented dataset (mainly geometric transformations such as rotations/horizontal and vertical flips of the original images) that was used to train it.
The RGB image needs to be reduced to size 50x50x3 before being analyzed by the model.
It is basically the same neural network architecture as proposed by https://www.kaggle.com/code/kushal1996/detecting-malaria-cnn/notebook adapted to Pytorch with some finetuning.