Hello,
I observe that the training of this stage is much slower than the previous one. Is that expected?
For example:
- stage1
MoT_vae_stage1_t2m.yaml the training succeeds in 2-4 batches per second
- stage2
MoT_vae_stage2_all.yaml the training succeeds in 1.5-3 batches per second
- stage3
MoT_vae_stage2_instruct.yaml the training succeeds in only 0.4 batches per second, together with the higher amount of batches per epoch, makes the training very slow.
Did you observe the same?