Create benchmarks for the RAG pipeline: - Embedding generation speed - Vector search latency (Qdrant) - Chunking strategy comparison - Reranker effectiveness metrics - End-to-end retrieval accuracy