Skip to content

Saturn Cloud Template: NLP & LLMs Section#405

Open
GeoSegun wants to merge 13 commits intomainfrom
vllm-7b
Open

Saturn Cloud Template: NLP & LLMs Section#405
GeoSegun wants to merge 13 commits intomainfrom
vllm-7b

Conversation

@GeoSegun
Copy link
Member

@GeoSegun GeoSegun commented Oct 23, 2025

Here are the code Templates completed so far:

  • nvidia-vllm-7b
  • nvidia-lora
  • nvidia-rag-mini
  • nvidia-embeddings-api
  • nvidia-langgraph
  • nvidia-vector-db
  • nvidia-nim-tgi
  • nvidia-rag-serve-api
  • nvidia-vllm-tp
  • nvidia-fsdp
  • nvidia-deepspeed
  • cpu-small-nlp

…tral, Llama, and Gemma) on Saturn Cloud GPUs, enabling fast text generation, interactive prompt input, and scalable multi-GPU deployment
@GeoSegun GeoSegun changed the title vLLM-based LLM inference notebook that runs 7B-class models (like Mitral, Llama, and Gemma) on Saturn Cloud GPUs, enabling fast text generation, interactive prompt input, and scalable multi-GPU deployment vLLM-based LLM inference notebook that runs 7B-class models Oct 23, 2025
@GeoSegun GeoSegun marked this pull request as draft October 23, 2025 08:55
@GeoSegun GeoSegun changed the title vLLM-based LLM inference notebook that runs 7B-class models Saturn Cloud Template: NLP & LLMs Section Oct 24, 2025
@GeoSegun GeoSegun marked this pull request as ready for review October 24, 2025 11:11
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant