Skip to content

Comments

allow providing vlm captions with surrounding page text#1389

Draft
edknv wants to merge 5 commits intoNVIDIA:mainfrom
edknv:edwardk/vlm-caption-context-text
Draft

allow providing vlm captions with surrounding page text#1389
edknv wants to merge 5 commits intoNVIDIA:mainfrom
edknv:edwardk/vlm-caption-context-text

Conversation

@edknv
Copy link
Collaborator

@edknv edknv commented Feb 10, 2026

Description

This PR adds context_text_max_chars parameter to allow enriching VLM image captions with surrounding page text. When enabled, each image's caption prompt is prepended with nearby text, improving retrieval accuracy for documents where images and surrounding text are semantically linked.

Checklist

  • I am familiar with the Contributing Guidelines.
  • New or existing tests cover these changes.
  • The documentation is up to date with these changes.
  • If adjusting docker-compose.yaml environment variables have you ensured those are mimicked in the Helm values.yaml file.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant