Train transformer language models with reinforcement learning.
☆19Feb 25, 2025Updated last year
Alternatives and similar repositories for trl
Users that are interested in trl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Oct 12, 2025Updated 7 months ago
- This project demonstrates how you can enhance standard CRUD operations in your application using Semantic Search mechanism.☆12Oct 23, 2024Updated last year
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- Sparse matrix and vector classes, solvers. This is a mirror repository - development happens on https://gitlab.dune-project.org/☆11Jun 1, 2026Updated last week
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆42Feb 7, 2026Updated 4 months ago
- ☆36Nov 16, 2025Updated 6 months ago
- ☆20May 24, 2025Updated last year
- ☆76Apr 17, 2026Updated last month
- ❇️ The best modules for Markov Logic Networks condensed in one framework.☆13Dec 20, 2017Updated 8 years ago
- English and Chinese LaTeX template for reports/projects/proposal at Beijing Institute of Technology☆10Nov 19, 2020Updated 5 years ago
- Lego for GRPO☆30May 27, 2025Updated last year
- PDELab library (function spaces, operators, solvers…). This is a mirror repository - development happens on https://gitlab.dune-project.o…☆12Apr 15, 2026Updated last month
- Instant Neural Graphics Primitives from scratch, zero dependencies. Learning by doing.☆10Aug 18, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆46May 3, 2026Updated last month
- ☆12Apr 19, 2024Updated 2 years ago
- Open & executable reproductions of figures and other results from papers in earth science & engineering.☆11Oct 9, 2023Updated 2 years ago
- ☆11May 13, 2020Updated 6 years ago
- ☆12Mar 15, 2024Updated 2 years ago
- Geology and Python conference☆10Jun 18, 2020Updated 5 years ago
- Simple notebook to train (technically, fine-tune) llama 3 8B on your own text data!☆24May 5, 2024Updated 2 years ago
- ☆19Dec 12, 2023Updated 2 years ago
- An implementation of the Augmented Random Search algorithm☆14Jan 29, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Clustered Compositional Embeddings☆13Oct 25, 2023Updated 2 years ago
- Plex Media Server on AWS☆22Dec 13, 2020Updated 5 years ago
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆23Jul 28, 2025Updated 10 months ago
- A conversational UI for chatbots using the llama.cpp server☆14May 26, 2025Updated last year
- Seismic event picking using Matplotlib and machine learning☆14Oct 22, 2017Updated 8 years ago
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- Prompt-driven automation platform - Transform natural language into executable workflows☆34Jul 13, 2025Updated 10 months ago
- An unofficial API for royalroad.com☆22Jul 27, 2025Updated 10 months ago
- Agentic Virtual Lab☆19Nov 30, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A course on Hugging Face land☆38May 26, 2026Updated 2 weeks ago
- A snake clone made in rust using leptos☆14Mar 31, 2023Updated 3 years ago
- Notas de la clase de estadística computacional, maestría en Ciencia de Datos, ITAM☆23Oct 30, 2019Updated 6 years ago
- A Frida-based utility for dynamically extracting native (.so) libraries from Android applications.☆60Feb 6, 2026Updated 4 months ago
- ☆14Apr 16, 2025Updated last year
- Code to optimize borehole directional/deviation survey data - Github pages:☆17Feb 17, 2024Updated 2 years ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆41Feb 15, 2024Updated 2 years ago