DEFT (Paper)

Emergent Properties of Efficient Fine-Tuning in Text-to-Image Models

Overview

In this work, we introduce DEFT, Decompositional Efficient Fine-Tuning, an efficient fine-tuning framework that adapts a pre-trained weight matrix by decomposing its update into two components with two trainable matrices: (1) a projection onto the complement of a low-rank subspace spanned by a low-rank matrix, and (2) a low-rank update. The single trainable low-rank matrix defines the subspace, while the other trainable low-rank matrix enables flexible parameter adaptation within that subspace. We conducted extensive experiments on the Dreambooth and Dreambench Plus datasets for personalization, the InsDet dataset for object and scene adaptation, and the VisualCloze dataset for a universal image generation framework through visual in-context learning with both Stable Diffusion and a unified model. Our results demonstrated compatative performance and highlights the emergent properties of efficient fine-tuning.

Quick strart

To create enviroment please follow the ./docs/enviroment.md#enviroment-details

Adding DEFT layer to pretrained model

import torch.nn as nn
import copy
from OmniGen import OmniGenPipeline
from deft.deft import KnowledgeInjectionConfig, make_knowledge_injection_adapter, add_knowledge_injection_methods
pipe = OmniGenPipeline.from_pretrained("Shitao/OmniGen-v1")
config = KnowledgeInjectionConfig(r=16, injection_method="residual_projection", target_modules=None, use_gating=True)
adapted_model = make_knowledge_injection_adapter(pipe.model, config)
adapted_model = add_knowledge_injection_methods(adapted_model)
adapted_model.save_pretrained("./knowledge_injection_adapter")
print("Adapted model saved and loaded successfully!")

Training Omnigen using DEFT

Please use --use_injection in finetuning

accelerate launch --num_processes=1 finetune.py --model_name_or_path Shitao/OmniGen-v1 \
 --batch_size_per_device 4 --condition_dropout_prob 0.01 --lr 1e-3 --use_injection --lora_rank 8 \
 --json_file ./toy_data/toy_subject_data.jsonl --image_path ./toy_data/images --max_input_length_limit 18000 \
 --keep_raw_resolution --max_image_size 1024 --gradient_accumulation_steps 1 --ckpt_every 100 --epochs 2000 --log_every 1 \
 --decomposition_method qr --results_dir ~/PP

Please refer to docs/fine-tuning.md for more details (e.g. full finetune).

Approch

From the method, we can use different decomposition methods.

Results on emergent properties

More Results

Additional results and visualizations can be found at the following links:

DreamBench++ Comparison: https://anonymousdreambranchplus.netlify.app
OmniGen-VisualCloze: https://anonymouscloze.netlify.app/
InstT Objects Qualitative Results: https://anonymousinstobjets.netlify.app

Acknowledgments

We would like to acknowledge and thank the following repositories and works that made this research possible:

OmniGen - For the foundational OmniGen framework
DreamBench++ - For the comprehensive personalized image generation benchmark
PaRa: Personalizing Text-to-Image Diffusion via Parameter Rank Reduction - For insights on parameter-efficient personalization methods
PEFT (Parameter-Efficient Fine-Tuning) - For efficient fine-tuning implementations

We appreciate the open-source community for making these valuable resources available.

License

This repo is licensed under the Apache License 2.0.

Citation

If you find this repository useful, please consider giving a star ⭐ and citation

@inproceedings{
kumar2025emergent,
title={Emergent Properties of Efficient Fine-Tuning in Text-to-Image Models},
author={Komal Kumar, Rao Muhammad Anwer, Fahad Shahbaz Khan, Salman Khan, Ivan Laptev, Hisham Cholakkal},
booktitle={The Thirty-ninth Annual Conference on Neural Information Processing Systems},
year={2025},
url={https://openreview.net/forum?id=R9xJSk5SQ2}
}

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
OmniGen		OmniGen
assets		assets
deft		deft
docs		docs
dog		dog
dreambench_plus		dreambench_plus
evaluations		evaluations
examples		examples
preprocess		preprocess
study		study
toy_data		toy_data
LICENSE		LICENSE
README.md		README.md
finetune.py		finetune.py
finetuning_qwen_image.py		finetuning_qwen_image.py
qwen_readme.md		qwen_readme.md
requirements_qwen.txt		requirements_qwen.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DEFT (Paper)

Overview

Quick strart

To create enviroment please follow the ./docs/enviroment.md#enviroment-details

Adding DEFT layer to pretrained model

Training Omnigen using DEFT

Please use --use_injection in finetuning

Approch

From the method, we can use different decomposition methods.

Results on emergent properties

More Results

Acknowledgments

License

Citation

About

Uh oh!

Releases

Packages

Languages

License

MAXNORM8650/DEFT

Folders and files

Latest commit

History

Repository files navigation

DEFT (Paper)

Overview

Quick strart

To create enviroment please follow the ./docs/enviroment.md#enviroment-details

Adding DEFT layer to pretrained model

Training Omnigen using DEFT

Please use --use_injection in finetuning

Approch

From the method, we can use different decomposition methods.

Results on emergent properties

More Results

Acknowledgments

License

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages