huggingface / trl-tutoLinks
☆49Updated 3 months ago
Alternatives and similar repositories for trl-tuto
Users that are interested in trl-tuto are comparing it to the libraries listed below
Sorting:
- ☆209Updated last week
- This code repository contains the code used for my "Optimizing Memory Usage for Training LLMs and Vision Transformers in PyTorch" blog po…☆92Updated 2 years ago
- Simple repository for training small reasoning models☆40Updated 7 months ago
- ☆229Updated last month
- ☆111Updated 2 weeks ago
- ☆240Updated 6 months ago
- ☆44Updated 4 months ago
- A repository to unravel the language of GPUs, making their kernel conversations easy to understand☆194Updated 3 months ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆110Updated last year
- ☆31Updated 10 months ago
- Seemless interface of using PyTOrch distributed with Jupyter notebooks☆50Updated last week
- ☆67Updated 11 months ago
- Library for text-to-text regression, applicable to any input string representation and allows pretraining and fine-tuning over multiple r…☆225Updated last week
- Create an AI capable of solving reasoning tasks it has never seen before☆94Updated 9 months ago
- This repository contain the simple llama3 implementation in pure jax.☆68Updated 7 months ago
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated last year
- Examining how large language models (LLMs) perform across various synthetic regression tasks when given (input, output) examples in their…☆155Updated last year
- A zero-to-one guide on scaling modern transformers with n-dimensional parallelism.☆91Updated 3 weeks ago
- ☆56Updated 10 months ago
- ☆101Updated last week
- FlexAttention based, minimal vllm-style inference engine for fast Gemma 2 inference.☆276Updated last month
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…☆99Updated last month
- ☆58Updated 4 months ago
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆451Updated last month
- ML/DL Math and Method notes☆63Updated last year
- RAGs: Simple implementations of Retrieval Augmented Generation (RAG) Systems☆130Updated 8 months ago
- Gradient Boosting Reinforcement Learning (GBRL)☆118Updated last month
- An extension of the nanoGPT repository for training small MOE models.☆194Updated 6 months ago
- $100K or 100 Days: Trade-offs when Pre-Training with Academic Resources☆146Updated 4 months ago
- ☆157Updated 9 months ago