alexzhang13 / rlmLinks
General plug-and-play inference library for Recursive Language Models (RLMs), supporting various sandboxes.
☆414Updated this week
Alternatives and similar repositories for rlm
Users that are interested in rlm are comparing it to the libraries listed below
Sorting:
- Super basic implementation (gist-like) of RLMs with REPL environments.☆293Updated 2 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆509Updated 3 weeks ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆782Updated this week
- A framework for optimizing DSPy programs with RL☆303Updated this week
- Curated collection of community environments☆196Updated 2 weeks ago
- Together Open Deep Research☆356Updated 8 months ago
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆273Updated last month
- rl from zero pretrain, can it be done? yes.☆286Updated 3 months ago
- The State Of The Art, intelligence☆157Updated 4 months ago
- Tutorial for building LLM router☆241Updated last year
- Collection of scripts and notebooks for OpenAI's latest GPT OSS models☆494Updated 4 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆454Updated last year
- Harbor is a framework for running agent evaluations and creating and using RL environments.☆306Updated this week
- Train your own SOTA deductive reasoning model☆107Updated 10 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 3 months ago
- Claude Deep Research config for Claude Code.☆225Updated 9 months ago
- Code to accompany the Universal Deep Research paper (https://arxiv.org/abs/2509.00244)☆456Updated 4 months ago
- Open-source release accompanying Gao et al. 2025☆478Updated 3 weeks ago
- An interface library for RL post training with environments.☆973Updated this week
- Evolve your language agent with Agentic Context Engineering (ACE)☆480Updated last month
- CodeScientist: An automated scientific discovery system for code-based experiments☆306Updated last month
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆133Updated this week
- Simple UI for debugging correlations of text embeddings☆306Updated 7 months ago
- Inference-time scaling for LLMs-as-a-judge.☆320Updated 2 months ago
- Simple & Scalable Pretraining for Neural Architecture Research☆306Updated last month
- OpenTinker is an RL-as-a-Service infrastructure for foundation models☆499Updated last week
- ☆136Updated 9 months ago
- ⚖️ Awesome LLM Judges ⚖️☆148Updated 8 months ago
- ☆801Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆250Updated this week