huggingface / trl-tutoLinks
☆52Updated 8 months ago
Alternatives and similar repositories for trl-tuto
Users that are interested in trl-tuto are comparing it to the libraries listed below
Sorting:
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- ☆214Updated 2 weeks ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆116Updated last year
- This repository contain the simple llama3 implementation in pure jax.☆71Updated 11 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆198Updated 8 months ago
- ☆31Updated last year
- ☆53Updated last year
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆57Updated 4 months ago
- A practical guide to diffusion models, implemented from scratch.☆245Updated last month
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆115Updated last month
- Simple repository for training small reasoning models☆49Updated last year
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- ☆59Updated 2 months ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆315Updated this week
- Slide decks, coding exercises, and quick references for learning the JAX AI Stack☆244Updated 3 weeks ago
- Gradient Boosting Reinforcement Learning (GBRL)☆136Updated last week
- ☆39Updated 9 months ago
- A collection of lightweight interpretability scripts to understand how LLMs think☆89Updated this week
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆334Updated 3 months ago
- ☆67Updated 7 months ago
- ☆56Updated last year
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆150Updated 4 months ago
- ☆46Updated 10 months ago
- ☆239Updated 2 months ago
- ☆37Updated last year
- code for training & evaluating Contextual Document Embedding models☆202Updated 8 months ago
- Simple GRPO scripts and configurations.☆59Updated last year
- Low memory full parameter finetuning of LLMs☆53Updated 6 months ago
- ☆46Updated 8 months ago
- [ICLR 2026] Official PyTorch Implementation of RLP: Reinforcement as a Pretraining Objective☆232Updated 2 weeks ago