Train transformer language models with reinforcement learning.
☆19Feb 25, 2025Updated last year
Alternatives and similar repositories for trl
Users that are interested in trl are comparing it to the libraries listed below
Sorting:
- ☆28Nov 16, 2025Updated 4 months ago
- various experiments for scaling inference time compute with small reasoning models☆17Jan 16, 2025Updated last year
- Anonymize sensitive information in text prompts before sending them to LLM applications☆20Mar 24, 2024Updated last year
- Mixture of Cognitive Reasoners: Modular Reasoning with Brain-Like Specialization☆40Feb 7, 2026Updated last month
- ☆10Mar 4, 2016Updated 10 years ago
- A First Look at Conventional Commits Classification☆13Nov 18, 2024Updated last year
- ☆35Jan 25, 2026Updated last month
- Lego for GRPO☆30May 27, 2025Updated 9 months ago
- English and Chinese LaTeX template for reports/projects/proposal at Beijing Institute of Technology☆10Nov 19, 2020Updated 5 years ago
- Instant Neural Graphics Primitives from scratch, zero dependencies. Learning by doing.☆10Aug 18, 2023Updated 2 years ago
- iMessage RAG MCP Server from Anthropic MCP Hackathon (NYC)☆14Mar 10, 2025Updated last year
- ☆24Aug 19, 2025Updated 7 months ago
- An implementation of the Augmented Random Search algorithm☆14Jan 29, 2022Updated 4 years ago
- Alternative zk-SNARK proof verifier written in Rust for Zcash Sprout.☆20Aug 12, 2017Updated 8 years ago
- A peer-to-peer communication system. BIT 小学期软件开发实训。☆11Sep 7, 2018Updated 7 years ago
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- A compute framework for building Search, RAG, Recommendations and Analytics over complex (structured+unstructured) data, with ultra-modal…☆12Sep 16, 2024Updated last year
- A conversational UI for chatbots using the llama.cpp server☆14May 26, 2025Updated 9 months ago
- Mind-wandering detector using EEG and ML☆10Aug 19, 2023Updated 2 years ago
- Automatically annotates YOLO dataset using Moondream visual model☆20Aug 24, 2025Updated 6 months ago
- Agentic Virtual Lab☆19Nov 30, 2025Updated 3 months ago
- ☆116Jan 21, 2025Updated last year
- ☆14Apr 16, 2025Updated 11 months ago
- ☆20Jan 21, 2022Updated 4 years ago
- ☆14May 17, 2025Updated 10 months ago
- WebUI for using SmolDocling-256M-preview☆13Mar 21, 2025Updated last year
- OWL To SPARQL Query Rewriter☆20Nov 16, 2020Updated 5 years ago
- UQ: Assessing Language Models on Unsolved Questions☆30Aug 26, 2025Updated 6 months ago
- ☆15Apr 1, 2024Updated last year
- Project investigating human physical construction behavior☆12Oct 6, 2023Updated 2 years ago
- A tool for determining if a pickleball is in or out of bounds☆16Feb 3, 2025Updated last year
- DUNL - Neuron 2025☆24Jan 18, 2026Updated 2 months ago
- Easily slice your video files into smaller segments.☆34Nov 9, 2025Updated 4 months ago
- Code to go along with my AI agents youtube video☆17Apr 5, 2024Updated last year
- ☆19Dec 4, 2025Updated 3 months ago
- An open source UI for Meta LLama Stack Apps / Agents☆41Sep 10, 2024Updated last year
- Spec-driven thinking, nano-sized docs. Lightweight task specification for AI-assisted development.☆37Jan 25, 2026Updated last month
- Deep Learning☆14Aug 28, 2020Updated 5 years ago
- II-Thought-RL is our initial attempt at developing a large-scale, multi-domain Reinforcement Learning (RL) dataset☆31Apr 8, 2025Updated 11 months ago