#
becnhmarking
Here are 2 public repositories matching this topic...
CPU-optimized RAG pipeline reducing latency 2.7× (247ms → 92ms). Implements caching, filtering, quantization for production. Complete with FastAPI, Docker, benchmarks, investor materials. The engineering showcase that sells itself.
docker caching dockerfile sales-engineering sqlite showcase embeddings low-latency production-ready demonstration semantic-search faiss fastapi retrieval-augmented-generation cpu-only rag-optimization ai-ml-performance-tuning becnhmarking
-
Updated
Jan 24, 2026 - Python
Improve this page
Add a description, image, and links to the becnhmarking topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the becnhmarking topic, visit your repo's landing page and select "manage topics."