Train transformer language models with reinforcement learning.
☆19Feb 25, 2025Updated last year
Alternatives and similar repositories for trl
Users that are interested in trl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A nlp framework to find hate speech comments out of a comments corpus.☆11Dec 8, 2022Updated 3 years ago
- ☆18Nov 5, 2025Updated 5 months ago
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆42Feb 7, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆31Nov 16, 2025Updated 5 months ago
- A First Look at Conventional Commits Classification☆13Nov 18, 2024Updated last year
- Basic deep learning models in PyTorch.☆10May 18, 2020Updated 5 years ago
- ☆20May 24, 2025Updated 11 months ago
- ☆53Apr 17, 2026Updated last week
- English and Chinese LaTeX template for reports/projects/proposal at Beijing Institute of Technology☆10Nov 19, 2020Updated 5 years ago
- Lego for GRPO☆30May 27, 2025Updated 11 months ago
- maxas Scott Grey's maxas assembler sgemm explaining the (for me) missing parts https://github.com/NervanaSystems/maxas☆17Dec 22, 2018Updated 7 years ago
- Instant Neural Graphics Primitives from scratch, zero dependencies. Learning by doing.☆10Aug 18, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆40Jan 25, 2026Updated 3 months ago
- ☆12Apr 19, 2024Updated 2 years ago
- Open & executable reproductions of figures and other results from papers in earth science & engineering.☆11Oct 9, 2023Updated 2 years ago
- ☆25Aug 19, 2025Updated 8 months ago
- ☆11Mar 15, 2024Updated 2 years ago
- Geology and Python conference☆10Jun 18, 2020Updated 5 years ago
- ☆15Jan 15, 2021Updated 5 years ago
- An implementation of the Augmented Random Search algorithm☆14Jan 29, 2022Updated 4 years ago
- A library for pairing based cryptography☆32Apr 14, 2026Updated 2 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Clustered Compositional Embeddings☆12Oct 25, 2023Updated 2 years ago
- Seismic event picking using Matplotlib and machine learning☆14Oct 22, 2017Updated 8 years ago
- The Swift Coding Style guide☆16Apr 30, 2018Updated 8 years ago
- ScribePal is an Open Source intelligent browser extension that leverages AI to empower your web experience by providing contextual insigh…☆22Apr 6, 2026Updated 3 weeks ago
- Mind-wandering detector using EEG and ML☆10Aug 19, 2023Updated 2 years ago
- Automatically annotates YOLO dataset using Moondream visual model☆19Aug 24, 2025Updated 8 months ago
- Agentic Virtual Lab☆19Nov 30, 2025Updated 5 months ago
- ☆116Jan 21, 2025Updated last year
- A Frida-based utility for dynamically extracting native (.so) libraries from Android applications.☆57Feb 6, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Python framework to connect to email/contacts/agendas servers and write automated rules for efficient workflows.☆13Jan 26, 2026Updated 3 months ago
- ☆14May 17, 2025Updated 11 months ago
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆33Mar 2, 2025Updated last year
- pre-trained topographic models☆13Aug 22, 2025Updated 8 months ago
- ☆15Apr 1, 2024Updated 2 years ago
- ☆18Apr 20, 2026Updated last week
- Fork of Flame repo for training of some new stuff in development☆19Updated this week