character-ai / pipelining-sftLinks

Simple and efficient DeepSeek V3 SFT using pipeline parallel and expert parallel, with both FP8 and BF16 trainings
β˜†81Updated last month

Alternatives and similar repositories for pipelining-sft

Users that are interested in pipelining-sft are comparing it to the libraries listed below

Sorting: