huggingface / trl-tutoLinks
☆51Updated 6 months ago
Alternatives and similar repositories for trl-tuto
Users that are interested in trl-tuto are comparing it to the libraries listed below
Sorting:
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆195Updated 6 months ago
- Simple repository for training small reasoning models☆47Updated 10 months ago
- ☆213Updated last week
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆105Updated 2 months ago
- Low memory full parameter finetuning of LLMs☆53Updated 5 months ago
- LLM training in simple, raw C/CUDA☆15Updated last year
- ☆52Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation☆112Updated last year
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆121Updated 2 months ago
- This repository contain the simple llama3 implementation in pure jax.☆70Updated 10 months ago
- ☆38Updated last year
- NanoGPT-speedrunning for the poor T4 enjoyers☆73Updated 7 months ago
- Slide decks, coding exercises, and quick references for learning the JAX AI Stack☆195Updated last month
- A practical guide to diffusion models, implemented from scratch.☆196Updated last week
- Training framework with a goal to explore the frontier of sample efficiency of small language models☆82Updated last week
- ☆46Updated 8 months ago
- ☆128Updated last week
- Exploring Applications of GRPO☆250Updated 3 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines