changyeyu / LLM-RL-VisualizedLinks

LLM, RL, DPO, SFT, Distillation, Alignment. 由《大模型算法》作者发起(By the author of the book📘 "Large Model Algorithms")
44Updated 2 weeks ago

Alternatives and similar repositories for LLM-RL-Visualized

Users that are interested in LLM-RL-Visualized are comparing it to the libraries listed below

Sorting: