Multimodal search engine using CLIP embeddings for bidirectional image-text retrieval.
search-engine scikit-learn foss jupyter-notebook python3 memory-management image-to-text batch-processing text-to-image fp16 gpu-optimization multimodal-deep-learning flickr8k-dataset bidirectional-search local-first sentence-transformers poetry-python gradio-interface clip-vit-b-16
-
Updated
Sep 9, 2025 - Jupyter Notebook