[Idea/Question] Qwen_TE_LLM node: extract a loader to be used for both LLM inference and text conditioning?

If I understand correctly the base CLIP loader of ComfyUI loads only the text encoder part of the LLM models to generate the embeddings for the text conditions for the image/video models. Your node loads the full model (together with the LM head) to be able to also generate text. But that's a superset, containing also the text encoder.

Could we spare some memory swapping here by loading the model only once in full, and use it for both LLM tasks and condition encoding? Basically creating a new loader node which can provide CLIP node output aswell, and an another output which can be provided as an input to the LLM inferencing only variant of "Qwen_TE_LLM"?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Idea/Question] Qwen_TE_LLM node: extract a loader to be used for both LLM inference and text conditioning? #7

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

[Idea/Question] Qwen_TE_LLM node: extract a loader to be used for both LLM inference and text conditioning? #7

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions