huggingface / trl-tutoLinks
☆49Updated 5 months ago
Alternatives and similar repositories for trl-tuto
Users that are interested in trl-tuto are comparing it to the libraries listed below
Sorting:
- ☆211Updated last week
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆195Updated 5 months ago
- This repository contain the simple llama3 implementation in pure jax.☆70Updated 8 months ago
- ☆52Updated last year
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆91Updated 2 years ago
- ☆31Updated 11 months ago
- Gradient Boosting Reinforcement Learning (GBRL)☆122Updated this week
- ☆57Updated last month
- ☆67Updated last year
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆301Updated last week
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated last year
- Training framework with a goal to explore the frontier of sample efficiency of small language models☆78Updated last week
- Highly commented implementations of Transformers in PyTorch☆136Updated 2 years ago
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆104Updated last month
- Simple repository for training small reasoning models☆44Updated 9 months ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆53Updated last month
- ☆118Updated last week
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆111Updated last month
- Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines☆195Updated last year
- An extension of the nanoGPT repository for training small MOE models.☆207Updated 8 months ago
- LLM training in simple, raw C/CUDA☆15Updated 11 months ago
- ☆103Updated 3 months ago
- code for training & evaluating Contextual Document Embedding models☆199Updated 5 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆473Updated 2 months ago
- Our solution for the arc challenge 2024☆183Updated 4 months ago
- ☆225Updated 2 weeks ago
- ML/DL Math and Method notes☆64Updated last year
- ☆230Updated last week
- Tutorials for Triton, a language for writing gpu kernels☆56Updated 2 years ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆147Updated last month