ch33nchan / RLlamaLinks
☆15Updated this week
Alternatives and similar repositories for RLlama
Users that are interested in RLlama are comparing it to the libraries listed below
Sorting:
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated 3 months ago
- rl from zero pretrain, can it be done? we'll see.☆56Updated this week
- Simple repository for training small reasoning models☆33Updated 4 months ago
- Train your own SOTA deductive reasoning model☆94Updated 3 months ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆60Updated last month
- ☆39Updated last month
- ☆27Updated 11 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆68Updated 3 months ago
- ☆63Updated last month
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)