nph4rd / grpoLinks
simple grpo
☆12Updated 8 months ago
Alternatives and similar repositories for grpo
Users that are interested in grpo are comparing it to the libraries listed below
Sorting:
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Updated last year
- A puzzle to learn about prompting☆135Updated 2 years ago
- ☆31Updated last year
- Highly commented implementations of Transformers in PyTorch☆138Updated 2 years ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆65Updated 8 months ago
- ☆92Updated last year
- git extension for {collaborative, communal, continual} model development☆217Updated last year
- Open Character Training☆66Updated 2 months ago
- Tools to make language models a bit easier to use☆64Updated last week
- ☆10Updated last year
- Large scale 4D parallelism pre-training for 🤗 transformers in Mixture of Experts *(still work in progress)*☆86Updated 2 years ago
- ☆40Updated last year
- ☆76Updated last year
- Train vision models using JAX and 🤗 transformers☆100Updated last month
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.☆160Updated last year
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆129Updated 2 months ago
- PageRank for LLMs☆52Updated 4 months ago
- Compiling useful links, papers, benchmarks, ideas, etc.☆46Updated 10 months ago
- ☆116Updated last week
- Use Actions to acquire those precious lambda GPUs☆19Updated 2 years ago
- Various handy scripts to quickly setup new Linux and Windows sandboxes, containers and WSL.☆40Updated last week
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆83Updated 2 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆51Updated 2 years ago
- ☆94Updated 2 years ago
- MoE training for Me and You and maybe other people☆331Updated 3 weeks ago
- ☆56Updated last year
- ☆53Updated 11 months ago
- ML/DL Math and Method notes☆66Updated 2 years ago
- Losslessly encode text natively with arithmetic coding and HuggingFace Transformers☆77Updated 3 months ago