Looking for a way to fine-tune small / vision language models without diving into complex code? Our no-code web platform simplifies the process of adapting LLM for task and domain specific use cases, allowing you to manage and run fine-tuning jobs effortlessly on Google Cloud Platform (GCP). From dataset preprocessing to model export, everything is customizable to your use case while being intuitive to use.
This is brought to you by Jet Chiang (@supreme-gg-gg) & Adarsh Dubey (@inclinedadarsh) as part of Google Summer of Code 2025 @ Google DeepMind.
A new demo is coming soon with more features as well as a beta release!
- Data preprocessing (for vision and text) from custom uploaded and huggingface datasets
- Data augmentation using NLP techniques and synthetic generation using LLM
- Fine-tuning using both Huggingface or Unsloth frameworks on text and multimodal datasets
- Fine-tuning with SFT for domain adaptation, DPO for preference alignment, and GRPO for reasoning tasks
- Fine-tuning with PEFT (LoRA, QLoRA) and full tuning, with quantization support (4-bit, 8-bit)
- Export to multiple formats (supports transformeres, vLLM, Ollama, llama.cpp, etc) to GCS bucket or huggingface hub
- Convert between different model formats to suite your deployment needs
- Cloud logging integration with Weights & Biases or TensorBoard
- Task-specific evaluation suites and batch inference ("vibe check") of fine tuned models
- IaC and deployment scripts so you can run this on your own GCP project!
- More data augmentation for vision and text datasets
- Audio modality support for Gemma 3n
- Direct deployment on GCP of fine tuned model using Ollama or vLLM (one-click and ready for inference)
-
Use it for free on our website platform once it's live (beta coming soon!)
-
Deploy the project to your own google cloud project!
We hope to support more flexible deployment options in the future.
-
Use this as a reference to build your own fine-tuning service since we're fully open source!
graph TD
subgraph Frontend
B["User"]
end
subgraph Authentication
B -- "API Request (JWT)" --> X["API Gateway"]
end
subgraph Backend
X -- "Preprocess Request" --> C["Preprocessing Service"]
C -- "Track Dataset" --> F1["Firestore (Datasets)"]
C -- "Store Data" --> H["GCS"]
X -- "Inference, Eval, Export" --> G["Inference Service"]
X -- "Train Request" --> D["Training Service"]
D -- "Trigger Job (gRPC) & Write Config to GCS" --> E["Training Job"]
E -- "Read Data, Export Model" --> H
E -- "Update Status" --> F2["Firestore (Jobs)"]
F2 -- "Read status" --> D
G -- "Load Model" --> H
end
classDef cpu fill:#e0f7fa,stroke:#333,stroke-width:1px;
classDef gpu fill:#fff3e0,stroke:#333,stroke-width:1px;
classDef storage fill:#f3e5f5,stroke:#333,stroke-width:1px;
classDef gateway fill:#ffe082,stroke:#333,stroke-width:1px;
class C,D cpu;
class G,E gpu;
class F1,F2,H storage;
class X gateway;
- Read the
README.mdin each subdirectory for more details. - Deployment current uses
cloudbuild.yaml, make sure you set the correct project ID withgcloud config set project <project-id> - Unless you have a CUDA GPU don't try to install all packages for certain services, it will fail locally (especially on Mac), use Colab for experiments instead
