Tool to generate high quality synthetic data for llm fintuning, rlhf and (maybe) pretraning
- step-1 : start with evolve instruct (GPT-3.5/4)
- add another backend (llama, mistral, claude)
- Add self opmizer https://arxiv.org/abs/2309.03409
- optimze root prompt for diffent llm
@article{xu2023wizardlm,
title={Wizardlm: Empowering large language models to follow complex instructions},
author={Xu, Can and Sun, Qingfeng and Zheng, Kai and Geng, Xiubo and Zhao, Pu and Feng, Jiazhan and Tao, Chongyang and Jiang, Daxin},
journal={arXiv preprint arXiv:2304.12244},
year={2023}
}