Train transformer language models with reinforcement learning.
☆19Feb 25, 2025Updated last year
Alternatives and similar repositories for trl
Users that are interested in trl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Oct 12, 2025Updated 7 months ago
- Score Entropy Discrete Diffusion language model - https://arxiv.org/abs/2310.16834☆18Jul 7, 2025Updated 10 months ago
- Repository for the paper: "TiC-LM: A Web-Scale Benchmark for Time-Continual LLM Pretraining" ACL Oral 2025☆24Apr 19, 2026Updated last month
- Generates video game music using neural networks.☆10Jun 9, 2022Updated 3 years ago
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Generates video game music using neural networks.☆12Jun 9, 2022Updated 3 years ago
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆42Feb 7, 2026Updated 3 months ago
- ☆20May 24, 2025Updated 11 months ago
- ☆64Apr 17, 2026Updated last month
- Lego for GRPO☆30May 27, 2025Updated 11 months ago
- Instant Neural Graphics Primitives from scratch, zero dependencies. Learning by doing.☆10Aug 18, 2023Updated 2 years ago
- ☆43May 3, 2026Updated 2 weeks ago
- ☆12Apr 19, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Mar 15, 2024Updated 2 years ago
- ☆19Dec 12, 2023Updated 2 years ago
- An implementation of the Augmented Random Search algorithm☆14Jan 29, 2022Updated 4 years ago
- A peer-to-peer communication system. BIT 小学期软件开发实训。☆11Sep 7, 2018Updated 7 years ago
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- ☆176Feb 13, 2026Updated 3 months ago
- Prompt-driven automation platform - Transform natural language into executable workflows☆34Jul 13, 2025Updated 10 months ago
- Mind-wandering detector using EEG and ML☆10Aug 19, 2023Updated 2 years ago
- An unofficial API for royalroad.com☆22Jul 27, 2025Updated 9 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Automatically annotates YOLO dataset using Moondream visual model☆21Aug 24, 2025Updated 8 months ago
- Agentic Virtual Lab☆19Nov 30, 2025Updated 5 months ago
- ☆116Jan 21, 2025Updated last year
- A course on Hugging Face land☆37Apr 17, 2026Updated last month
- ☆14Apr 16, 2025Updated last year
- This project automates promotional posts across multiple social media platforms.☆34Mar 30, 2026Updated last month
- ☆14May 17, 2025Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Apr 24, 2026Updated 3 weeks ago
- Render HTML to a specific portion of a word document using Python and PyWin32☆17Apr 24, 2017Updated 9 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A tool for determining if a pickleball is in or out of bounds☆17Feb 3, 2025Updated last year
- ☆12Jun 2, 2023Updated 2 years ago
- Project investigating human physical construction behavior☆12Oct 6, 2023Updated 2 years ago
- 纯QML实现的多标签页窗口demo,可自动搜索并加载本地.qml页面文件。☆12May 14, 2023Updated 3 years ago
- ☆20Apr 24, 2025Updated last year
- DUNL - Neuron 2025☆25Jan 18, 2026Updated 4 months ago
- Easily slice your video files into smaller segments.☆34Nov 9, 2025Updated 6 months ago