Canine Classifier

A deep learning system built to classify dog breeds using convolutional neural networks (CNNs). The project explores transfer learning, data augmentation, and model interpretability techniques.

Overview

This project tackles the real-world challenge of fine-grained visual classification with limited labeled data. Starting with a CNN trained from scratch, the system evolves through transfer learning from multi-class breed recognition, and further benefits from a custom-designed data augmentation pipeline. A variety of architectural designs and training optimizations were tested to maximize generalization performance.

Key components include:

Custom CNN architecture with multiple convolutional and pooling layers
Transfer learning from a multi-breed classifier to a binary classifier
Automated data augmentation system (rotation, grayscale, etc.)
Grad-CAM visualizations for model interpretability
AUROC-based model evaluation and early stopping
Training/inference automation with GPU offloading and batch size tuning

Techniques Explored

CNN Design: Designed and trained convolutional neural networks from scratch, starting with a baseline 3-layer CNN (Conv -> Pool -> Conv -> Pool -> Conv -> FC) and iteratively improving performance through deeper and wider architectures. Explored the impact of increasing filter counts, adjusting layer configurations, and fine-tuning versus freezing layers in transfer learning.
Transfer Learning: Leveraged knowledge from a source classifier trained on 8 other breeds to enhance binary classification on Collies vs. Golden Retrievers.
Data Augmentation: Implemented rotation, grayscale transformations, and custom augmentation combinations to improve generalization.
Model Evaluation: Early stopping based on validation loss, AUROC as primary evaluation metric, and comprehensive training/test curve analysis.
Interpretability: Applied Grad-CAM to visualize what features the model focuses on during classification.
Workflow Automation: Developed a fully modular and GPU-optimized training pipeline that streamlines challenge predictions and evaluation.

Results

The final model achieved a significant performance boost on held-out test data through a combination of transfer learning and carefully tuned augmentation strategies.

Model Variant	Train AUROC	Val AUROC	Test AUROC
CNN (from scratch)	0.9793	0.9308	0.6552
Transfer Learning (FC layer only)	0.8732	0.8782	0.8776
Grayscale Augmentation Only	0.8844	0.7929	0.7776
Rotation + Grayscale Augmentation	0.9764	0.9198	0.7260

Grad-CAM confirmed early hypotheses that background elements (like grass) were driving predictions. Augmentation helped the model shift its focus toward more meaningful features.

Experiments

Dozens of configurations were tested and benchmarked. Highlights include:

Model depth vs. width tradeoffs
Filter scaling and receptive field tuning
Batch size scaling for training stability
Custom learning rate schedules
Modular architecture with script files for flexible experimentation
GPU memory usage benchmarking for model variants

Takeaways

Transfer learning can dramatically improve performance even on a binary classification task.
Backgrounds in training images can bias CNNs - visualizations and data augmentation are key to overcoming this.
Grayscale augmentation, despite reducing color variance, forced the model to focus on shape and structure, improving test generalization.
With a modular pipeline and thoughtful experimentation, significant gains are possible even with limited data.

Training Pipeline Automation

The entire pipeline for model training, evaluation, and challenge prediction is wrapped into a single customizable script, train_transfer_learning_custom.py, enabling rapid iteration and experimentation.

Built with PyTorch, pandas, and scikit-learn.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
checkpoints		checkpoints
data		data
model		model
.gitignore		.gitignore
README.md		README.md
TL_custom_3_layers.png		TL_custom_3_layers.png
TL_custom_5_layers.png		TL_custom_5_layers.png
augment_data.py		augment_data.py
cnn_custom_training_plot.png		cnn_custom_training_plot.png
cnn_training_plot.png		cnn_training_plot.png
config.json		config.json
confusion_matrix.py		confusion_matrix.py
dataset.py		dataset.py
dataset_custom.py		dataset_custom.py
full_proj.ipynb		full_proj.ipynb
generate_challenge_predictions.sh		generate_challenge_predictions.sh
muhsinun.csv		muhsinun.csv
notes.ipynb		notes.ipynb
predict_challenge.py		predict_challenge.py
profile.prof		profile.prof
profile_train_cnn.prof		profile_train_cnn.prof
pytorch_gpu.ipynb		pytorch_gpu.ipynb
requirements.txt		requirements.txt
run_train_cnn.sh		run_train_cnn.sh
run_train_cnn_custom.sh		run_train_cnn_custom.sh
run_transfer_learning.sh		run_transfer_learning.sh
run_transfer_learning_combined.sh		run_transfer_learning_combined.sh
source_custom_training_plot.png		source_custom_training_plot.png
source_training_plot.png		source_training_plot.png
test_cnn.py		test_cnn.py
test_cnn_custom.py		test_cnn_custom.py
test_gpu.py		test_gpu.py
train_challenge.py		train_challenge.py
train_cnn.py		train_cnn.py
train_cnn_custom.py		train_cnn_custom.py
train_common.py		train_common.py
train_common_custom.py		train_common_custom.py
train_source.py		train_source.py
train_source_custom.py		train_source_custom.py
train_target.py		train_target.py
train_target_custom.py		train_target_custom.py
train_transfer_learning_custom.py		train_transfer_learning_custom.py
utils.py		utils.py
visualize_cnn.py		visualize_cnn.py
visualize_data.py		visualize_data.py
visualize_source.py		visualize_source.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Canine Classifier

Overview

Techniques Explored

Results

Experiments

Takeaways

Training Pipeline Automation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

MuhsinunC/Dog-Image-Classification-Deep-CNN

Folders and files

Latest commit

History

Repository files navigation

Canine Classifier

Overview

Techniques Explored

Results

Experiments

Takeaways

Training Pipeline Automation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages