Popular repositories Loading
-
-
-
Step1X-Edit
Step1X-Edit PublicA SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
-
gelab-zero
gelab-zero PublicSTEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.
-
Step-Audio2
Step-Audio2 PublicStep-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation.
Repositories
- Step3-VL-10B Public
Step3-VL-10B: A compact yet frontier multimodal model achieving SOTA performance at the 10B scale, matching open-source models 10-20x its size.
stepfun-ai/Step3-VL-10B’s past year of commit activity - Step-Audio-R1 Public
stepfun-ai/Step-Audio-R1’s past year of commit activity - stepfun-ai.github.io Public
stepfun-ai/stepfun-ai.github.io’s past year of commit activity - gelab-zero Public
STEP-GUI: The top GUI agent solution in the galaxy. Developed by the StepFun-GELab team and powered by StepFun’s cutting-edge research capabilities.
stepfun-ai/gelab-zero’s past year of commit activity - Step1X-Edit Public
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gemini 2 Flash.
stepfun-ai/Step1X-Edit’s past year of commit activity - NextStep-1 Public
NextStep-1: SOTA Autogressive Image Generation with Continuous Tokens. A research project developed by the StepFun’s Multimodal Intelligence team.
stepfun-ai/NextStep-1’s past year of commit activity
Most used topics
Loading…