huggingface / trl-tutoLinks
☆51Updated 6 months ago
Alternatives and similar repositories for trl-tuto
Users that are interested in trl-tuto are comparing it to the libraries listed below
Sorting:
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- ☆122Updated last week
- Simple repository for training small reasoning models☆46Updated 9 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆111Updated last year
- Simple GRPO scripts and configurations.☆59Updated 9 months ago
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆105Updated 2 months ago
- ☆31Updated last year
- ☆52Updated last year
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆196Updated 5 months ago
- ☆86Updated last year
- Low memory full parameter finetuning of LLMs☆53Updated 4 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆112Updated last month
- ☆38Updated last year
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆56Updated 2 months ago
- ☆212Updated last week
- This repository contain the simple llama3 implementation in pure jax.☆70Updated 9 months ago
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆196Updated last year
- experiments with inference on llama☆103Updated last year
- Jax like function transformation engine but micro, microjax☆33Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆76Updated last year
- code for training & evaluating Contextual Document Embedding models☆200Updated 6 months ago
- LLM training in simple, raw C/CUDA☆15Updated 11 months ago
- A reading list of relevant papers and projects on foundation model annotation☆28Updated 9 months ago
- A comprehensive deep dive into the world of tokens☆227Updated last year
- Training framework with a goal to explore the frontier of sample efficiency of small language models☆79Updated 2 weeks ago
- ☆68Updated last year
- ML/DL Math and Method notes☆64Updated last year
- A place to store reusable transformer components of my own creation or found on the interwebs☆62Updated last week
- A collection of lightweight interpretability scripts to understand how LLMs think☆68Updated this week
- ☆94Updated 2 years ago