huggingface / trl-tutoLinks
☆48Updated 4 months ago
Alternatives and similar repositories for trl-tuto
Users that are interested in trl-tuto are comparing it to the libraries listed below
Sorting:
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆274Updated last week
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆194Updated 4 months ago
- Simple repository for training small reasoning models☆40Updated 8 months ago
- ☆52Updated 11 months ago
- ☆113Updated 3 weeks ago
- ☆211Updated last week
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated last year
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆103Updated 3 weeks ago
- An extension of the nanoGPT repository for training small MOE models.☆197Updated 7 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆101Updated last week
- Simple GRPO scripts and configurations.☆59Updated 8 months ago
- Official codebase for "Quantile Reward Policy Optimization: Alignment with Pointwise Regression and Exact Partition Functions" (Matrenok …☆27Updated 3 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"☆187Updated last week
- Code for☆27Updated 10 months ago
- LLM training in simple, raw C/CUDA☆15Updated 10 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆96Updated last week
- ☆46Updated 6 months ago
- ☆31Updated 11 months ago
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆296Updated 2 months ago
- Training framework with a goal to explore the frontier of sample efficiency of small language models☆63Updated this week
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated last year
- Gradient Boosting Reinforcement Learning (GBRL)☆120Updated 2 months ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆50Updated last month
- ☆230Updated this week
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆114Updated 2 months ago
- ☆57Updated 2 weeks ago
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…☆155Updated last week
- ☆67Updated last year
- Exploring Applications of GRPO☆248Updated last month