SmartGallery

A sophisticated AI-powered image gallery application that enables intelligent searching through your photo collection using natural language queries. SmartGallery combines computer vision, embeddings, and a user-friendly desktop interface to organize and discover images effortlessly.

Features

AI-Powered Search: Search images using natural language descriptions powered by CLIP embeddings
Automatic Captions: Generate descriptive captions for images using Vision Encoder-Decoder models
Smart Tagging: Automatically extract relevant tags from image captions using KeyBERT
Album Organization: Organize images into albums based on folder structure
Infinite Scroll: Smooth loading of image thumbnails with infinite scroll functionality
Full-Screen Viewer: View images in a dedicated full-screen dialog
Real-Time Updates: Monitor directories for new/deleted images and update embeddings on the fly
GPU Support: Utilizes CUDA for faster processing when available

Demo & Screenshots

Demo.mp4

Main Gallery View

Shows album-based navigation with infinite scrolling thumbnails.

AI Search Results

Natural language search example using query "cat" filtered inside albums.

Full Image Viewer

Click any image to open it in a full-size dedicated viewer.

Architecture

SmartGallery uses a multi-process architecture for optimal performance:

Main Application (app.py): PyQt5-based GUI for browsing and searching
Search Server (search_server.py): Handles AI-powered image search queries
Encoder Server (encoder_server.py): Generates embeddings for new images in real-time
Pipeline (image_captioning_clip_pipeline.py): Preprocessing script to generate captions and embeddings

System Requirements

Dependencies

Python 3.8+
PyQt5: Desktop GUI framework
PyTorch: Deep learning framework
CLIP: Multi-modal embeddings
FAISS: Efficient similarity search
Transformers: Pre-trained models
KeyBERT: Keyword extraction
NumPy, Pandas, Pillow: Data processing

Hardware

GPU with CUDA support (optional but recommended)
Minimum 8GB RAM for processing large image collections
SSD storage for faster image loading

Installation

Step 1: Clone the Repository

git clone https://github.com/Uni-Creator/SmartGallery.git
cd SmartGallery

Step 2: Install Dependencies

pip install -r requirements.txt

Step 3: Prepare Your Data

Organize your images in a folder structure:

PHOTOS/
├── Category1/
│   ├── Folder1/
│   │   ├── image1.jpg
│   │   └── image2.jpg
│   └── Folder2/
│       └── image3.jpg
└── Category2/
    └── image4.jpg

Step 4: Generate Initial Embeddings

Before running the application, generate embeddings for your images:

python image_captioning_clip_pipeline.py

This will:

Generate captions for each image
Extract tags from captions
Create CLIP embeddings for both images and captions
Save outputs to the data/ and embeddings/ directories

Step 5: Run the Application

python app.py

Update the BASE_FOLDER variable in app.py to point to your photo collection:

BASE_FOLDER = r"D:\Your\Photo\Path"  # Update this path

Usage

Basic Navigation

Browse Albums: Select albums from the left sidebar to view grouped images
View Thumbnails: Scroll through image thumbnails in a 4-column grid
Open Full Image: Click any thumbnail to view the full-resolution image

AI Search

Enter a natural language query in the search box (e.g., "sunset over water", "people at beach")
Press ENTER to search
Results will be filtered from the current album
Clear the search box and press ENTER to show all images again

Supported Image Formats

JPG/JPEG
PNG
BMP
WebP

Project Structure

SmartGallery/
├── app.py                                   # Main PyQt5 GUI application
├── search_server.py                         # Search query processing server
├── encoder_server.py                        # Real-time embedding encoder
├── image_captioning_clip_pipeline.py        # Preprocessing pipeline
├── text_search.py                           # Search engine logic
├── pre-processing.py                        # Data preprocessing utilities
├── requirements.txt                         # Python dependencies
├── data/
│   ├── final_cleaned_data.csv              # Original image metadata
│   └── images_with_captions_and_tags.csv   # Captions and tags
└── embeddings/
    ├── image_embeddings.npy                # Raw image embeddings
    ├── image_embeddings_normalized.npy     # Normalized image embeddings
    ├── caption_embeddings.npy              # Caption embeddings
    └── image_faiss_index.idx               # FAISS index for fast search

Key Components

ThumbnailWorker (app.py)

Asynchronous background worker for generating and caching image thumbnails to prevent UI freezing.

GalleryWindow (app.py)

Main PyQt5 window managing:

Album selection and navigation
Grid layout with infinite scroll
Search interface
Subprocess management (search & encoder servers)

CLIPSearchEngine (text_search.py)

Handles semantic search using CLIP embeddings with configurable similarity thresholds.

ImageEncoderServer (encoder_server.py)

Real-time service that:

Listens for new image paths
Generates embeddings using CLIP
Updates FAISS index
Maintains metadata CSV files

Configuration

Adjusting Search Sensitivity

In text_search.py, modify the alpha parameter in the search methods to adjust filtering sensitivity.

Thumbnail Size

Edit in app.py ThumbnailWorker.process_queue():

image = image.scaled(200, 200, ...)  # Adjust dimensions

Grid Layout

Modify in app.py load_batch():

if col >= 4:  # Change 4 to desired columns per row

Batch Size

In app.py GalleryWindow.__init__():

self.batch_size = 100  # Images loaded per scroll

Performance Tips

Use GPU: Ensure CUDA is properly installed for ~5-10x faster processing
Normalize Embeddings: Pre-normalized embeddings improve search speed
FAISS Optimization: For large collections (>100k images), consider GPU-accelerated FAISS
Caching: Thumbnails are cached in memory; increase batch size for slower systems

Troubleshooting

Search Server Not Starting

Verify search_server.py can run independently: python search_server.py
Check that embeddings files exist in embeddings/ directory

Out of Memory Errors

Reduce BATCH_SIZE in image_captioning_clip_pipeline.py
Process images in smaller chunks
Reduce thumbnail batch size

CUDA Not Available

Install PyTorch with CUDA support: pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
Verify NVIDIA drivers are installed

Missing Embeddings

Run image_captioning_clip_pipeline.py to generate missing embeddings
Ensure CSV file paths are correct

Future Enhancements

[❌] Multi-threaded image processing pipeline
[❌] Web-based interface (Flask/React)
[✅] Batch image uploading with automatic processing
[❌] Advanced filtering and faceted search
[❌] Image clustering and recommendations
[❌] Database backend (PostgreSQL + pgvector)
[❌] REST API for programmatic access
[❌] Image deduplication detection
[❌] Automatically remove deleted image's embeddings

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit pull requests or open issues for bugs and feature requests.

Support

For issues, questions, or suggestions, please open an issue on the GitHub repository.

Acknowledgments

OpenAI CLIP for multi-modal embeddings
Facebook FAISS for efficient similarity search
PyQt5 for the GUI framework
Hugging Face Transformers for pre-trained models

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
assests		assests
.gitignore		.gitignore
Flowchart.jpeg		Flowchart.jpeg
LICENSE		LICENSE
README.md		README.md
app.py		app.py
encoder_server.py		encoder_server.py
image_captioning_clip_pipeline.py		image_captioning_clip_pipeline.py
pre-processing.py		pre-processing.py
requirements.txt		requirements.txt
search_server.py		search_server.py
smart_gallery.log		smart_gallery.log
text_search.py		text_search.py

License

Uni-Creator/SmartGallery

Folders and files

Latest commit

History

Repository files navigation

SmartGallery

Features

Demo & Screenshots

Main Gallery View

AI Search Results

Full Image Viewer

Architecture

System Requirements

Dependencies

Hardware

Installation

Step 1: Clone the Repository

Step 2: Install Dependencies

Step 3: Prepare Your Data

Step 4: Generate Initial Embeddings

Step 5: Run the Application

Usage

Basic Navigation

AI Search

Supported Image Formats

Project Structure

Key Components

ThumbnailWorker (app.py)

GalleryWindow (app.py)

CLIPSearchEngine (text_search.py)

ImageEncoderServer (encoder_server.py)

Configuration

Adjusting Search Sensitivity

Thumbnail Size

Grid Layout

Batch Size

Performance Tips

Troubleshooting

Search Server Not Starting

Out of Memory Errors

CUDA Not Available

Missing Embeddings

Future Enhancements

License

Contributing

Support

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages