Add AutoPipeline For Text2Video #12846

naomili0924 · 2025-12-16T07:14:22Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline?
Did you read our philosophy doc (important for complex PRs)?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

IlyasMoutawwakil · 2025-12-17T08:53:13Z

tests/pipelines/test_pipelines_auto.py

+    def test_from_pretrained_text_to_video(self):
+        repo = "hf-internal-testing/tiny-stable-diffusion-pipe"
+
+        pipe = AutoPipelineForText2Video.from_pretrained(repo)
+        assert pipe.__class__.__name__ == "TextToVideoSDPipeline"
+
+        pipe = AutoPipelineForText2Video.from_pipe(pipe)
+        assert pipe.__class__.__name__ == "TextToVideoSDPipeline"


does this test pass with "hf-internal-testing/tiny-stable-diffusion-pipe" ?

github-actions · 2026-01-15T15:04:18Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

sayakpaul · 2026-01-15T15:20:55Z

@yiyixuxu could you give this a look?

yiyixuxu

hey @naomili0924

thanks for the PR! could you explain what would be the motivation for this PR? these pipeline groups do not support multiple tasks

HuggingFaceDocBuilderDev · 2026-01-15T20:46:48Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

IlyasMoutawwakil · 2026-01-16T08:46:42Z

@yiyixuxu the idea is to have a single entrypoint for the text2video models / task

yiyixuxu · 2026-01-16T20:34:08Z

hi @IlyasMoutawwakil

The purpose of AutoPipeline is to enable users to switch between tasks using the same checkpoint. For video models, though, different tasks typically require different checkpoints (see Wan 2.1 for example - their checkpoints all have the task encoded in their names: "T2V" for text-to-video, "I2V" for image-to-video, plus other tasks like audio-to-video, first-last-frame-to-video, etc.). We're also seeing some model makers (e.g. LTX) release checkpoints that handle multiple tasks, which we support with a single pipeline (LTXConditionPipeline).

The only exception where checkpoint sharing happens is text-to-video vs video-to-video, but IMO it's not worth adding and maintaining two auto pipelines for such a limited use case.

I'd like to understand the motivation a bit more - is there a specific project you're working on that requires this? If the goal is just to have a single entry point to load different video pipelines without knowing the class name, DiffusionPipeline.from_pretrained() should already handle that.

IlyasMoutawwakil · 2026-01-17T15:26:49Z

Thanks @yiyixuxu i think i misunderstood the rationale behind AutoPipelines. The reason why we prefer having a task-specific AutoPipeline is because it makes task-specific testing in optimum-onnx and optimum-intel easy.
DiffusionPipeline does not guarantee which pipeline you will get when you load a checkpoint, while AutoPipelineForTask will always return the pipeline for the task it's related to.
I wasn't aware that video checkpoints only support one task for now, thanks for the clarification, i think we can use DiffusionPipeline with video models in our testing.

add autopipeline for text2video task

4f389e3

naomili0924 force-pushed the autopipeline_for_text2video branch from 2a75ff3 to 4f389e3 Compare December 17, 2025 07:09

naomili0924 mentioned this pull request Dec 17, 2025

add_text2video_ort_pipeline huggingface/optimum-onnx#105

Open

IlyasMoutawwakil reviewed Dec 17, 2025

View reviewed changes

IlyasMoutawwakil requested a review from sayakpaul December 17, 2025 08:54

sayakpaul requested a review from yiyixuxu December 17, 2025 09:54

github-actions bot added the stale Issues that haven't received updates label Jan 15, 2026

sayakpaul removed the stale Issues that haven't received updates label Jan 15, 2026

yiyixuxu reviewed Jan 15, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add AutoPipeline For Text2Video #12846

Add AutoPipeline For Text2Video #12846

naomili0924 commented Dec 16, 2025

Uh oh!

IlyasMoutawwakil Dec 17, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Jan 15, 2026

Uh oh!

sayakpaul commented Jan 15, 2026

Uh oh!

yiyixuxu left a comment

Uh oh!

HuggingFaceDocBuilderDev commented Jan 15, 2026

Uh oh!

IlyasMoutawwakil commented Jan 16, 2026

Uh oh!

yiyixuxu commented Jan 16, 2026

Uh oh!

IlyasMoutawwakil commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add AutoPipeline For Text2Video #12846

Are you sure you want to change the base?

Add AutoPipeline For Text2Video #12846

Conversation

naomili0924 commented Dec 16, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

IlyasMoutawwakil Dec 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Jan 15, 2026

Uh oh!

sayakpaul commented Jan 15, 2026

Uh oh!

yiyixuxu left a comment

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented Jan 15, 2026

Uh oh!

IlyasMoutawwakil commented Jan 16, 2026

Uh oh!

yiyixuxu commented Jan 16, 2026

Uh oh!

IlyasMoutawwakil commented Jan 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

IlyasMoutawwakil Dec 17, 2025 •

edited

Loading