seanzhang-zhichen / Qwen-WisdomVast

Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and 2,000 single-turn self-cognition data, using the training methods of DORA and LORA+ based on Qwen1.5-7B as the base. Compared to Qwen1.5-7B-Chat, it has improved mathematical abilities by 5.16%, 12.8% on the Hu…
18Updated 7 months ago

Related projects

Alternatives and complementary repositories for Qwen-WisdomVast