Hi, First of all, I want to express my appreciation for the incredible work you have done on the SlimPajama project. I have a query regarding the interleaved component of the pipeline. Currently, it appears that equal weightage is given to all sources. Is there a recommended approach or best practice to assign different weights to each source while combining data from various sources?
Thanks in advance.
Cheers,