huggingface / trl-tutoLinks
☆48Updated last month
Alternatives and similar repositories for trl-tuto
Users that are interested in trl-tuto are comparing it to the libraries listed below
Sorting:
- ☆56Updated last month
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆186Updated 3 weeks ago
- ☆47Updated 7 months ago
- ☆40Updated last month
- NanoGPT-speedrunning for the poor T4 enjoyers☆66Updated 2 months ago
- An extension of the nanoGPT repository for training small MOE models.☆155Updated 3 months ago
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated last year
- Simple GRPO scripts and configurations.☆59Updated 4 months ago
- An introduction to LLM Sampling☆78Updated 6 months ago
- Source code for the collaborative reasoner research project at Meta FAIR.☆93Updated 2 months ago
- ☆46Updated 3 months ago
- ☆194Updated 4 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆71Updated this week
- ☆30Updated 7 months ago
- code for training & evaluating Contextual Document Embedding models☆195Updated last month
- Collection of autoregressive model implementation☆85Updated 2 months ago
- Simple repository for training small reasoning models☆33Updated 4 months ago
- ☆124Updated 2 months ago
- Prune transformer layers☆69Updated last year
- ☆55Updated 7 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated 9 months ago
- LLM training in simple, raw C/CUDA☆14Updated 6 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- ☆128Updated 3 months ago
- This repository contain the simple llama3 implementation in pure jax.☆67Updated 4 months ago
- Learn online intrinsic rewards from LLM feedback☆41Updated 6 months ago
- Building the cognitive-core to solve ARC-AGI-2☆21Updated 4 months ago
- Train your own SOTA deductive reasoning model☆96Updated 3 months ago
- ☆41Updated 6 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated 10 months ago