ChenmienTan / RL2Links
☆1,089Updated last week
Alternatives and similar repositories for RL2
Users that are interested in RL2 are comparing it to the libraries listed below
Sorting:
- Unified KV Cache Compression Methods for Auto-Regressive Models☆1,298Updated last year
- adds Sequence Parallelism into LLaMA-Factory☆602Updated 3 months ago
- ☆332Updated 5 months ago
- Codebase for Iterative DPO Using Rule-based Rewards☆267Updated 9 months ago
- Train your Agent model via our easy and efficient framework