A clean, modular Python tool for extracting text from PDF documents, chunking it, and generating concise summaries using a large language model.
- Robust PDF text extraction with optional page cleaning
- Configurable text chunking
- LLM-powered summarization (currently using OpenAI's legacy completion endpoint)
- Paginated pretty-printing of the final summary
pip install -r requirements.txt