Skip to content

Examples of integrating LangChain with vector stores for semantic search and RAG. Covers Chroma and FAISS. Ideal for learning how to store, index, and query embeddings with LLMs.

Notifications You must be signed in to change notification settings

tahirkorma/langchain-vector-stores

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

LangChain Vector Stores

This repository demonstrates how to integrate vector stores with LangChain for building applications that require semantic search, retrieval-augmented generation (RAG), and other LLM-powered workflows.

Currently, the code covers two vector stores:

  • Chroma – a lightweight, open-source embedding database.
  • FAISS – Facebook AI Similarity Search, a high-performance library for efficient similarity search and clustering of dense vectors.

πŸš€ Use Cases

Vector stores allow you to efficiently store and search embeddings. Some common applications include:

  • Question Answering over large documents.
  • Context-aware chatbots with memory.
  • Semantic Search beyond keyword matching.
  • Recommendation Systems using similarity search.
  • Retrieval-Augmented Generation (RAG) to enhance LLM outputs with external knowledge.

πŸ“š Vector Stores Covered in this Repository

1. Chroma

  • Open-source and lightweight.
  • Simple local setup, ideal for prototypes and small to medium workloads.
  • Integrates seamlessly with LangChain.

2. FAISS

  • Developed by Facebook AI Research.
  • Optimized for speed and efficiency.
  • Supports both CPU and GPU acceleration.
  • Great for large-scale similarity search.

πŸ› οΈ Other Popular Vector Databases

Even though this repository focuses on Chroma and FAISS, here are some other production-grade vector databases you may consider:

  • Weaviate

    • Cloud-native, open-source, and schema-based.
    • Supports hybrid search (combining keyword + vector search).
    • Has built-in modules for image and text embeddings.
  • Milvus

    • Open-source and highly scalable.
    • Designed for billion-scale vector datasets.
    • Strong community support and active development.
  • Pinecone

    • Fully managed vector database as a service.
    • Handles scaling, sharding, and replication automatically.
    • Great for enterprise use cases where infrastructure is managed.
  • Qdrant

    • Open-source, written in Rust.
    • Provides high performance with a modern API.
    • Supports filters and hybrid search.

About

Examples of integrating LangChain with vector stores for semantic search and RAG. Covers Chroma and FAISS. Ideal for learning how to store, index, and query embeddings with LLMs.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published