bageldotcom / bagel-RLLinks
☆93Updated 7 months ago
Alternatives and similar repositories for bagel-RL
Users that are interested in bagel-RL are comparing it to the libraries listed below
Sorting:
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆278Updated 2 months ago
- ☆94Updated last year
- Official CLI and Python SDK for Prime Intellect - access GPU compute, remote sandboxes, RL environments, and distributed training infrast…☆151Updated this week
- ☆67Updated 8 months ago
- ☆129Updated 4 months ago
- Inference-time scaling for LLMs-as-a-judge.☆328Updated 3 months ago
- The State Of The Art, intelligence☆157Updated 5 months ago
- An agent orchestration framework for economic agents☆112Updated 5 months ago
- GraphRAG database - hybrid graph / vector db☆134Updated last year
- Simple examples using Argilla tools to build AI☆57Updated last year
- they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?☆127Updated 4 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆77Updated 11 months ago
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆76Updated 10 months ago
- Routing on Random Forest (RoRF)☆239Updated last year
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Updated 11 months ago
- Simple UI for debugging correlations of text embeddings☆305Updated 8 months ago
- ☆80Updated 4 months ago
- A framework for orchestrating AI agents using a mermaid graph☆76Updated last year
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆56Updated 6 months ago
- Train your own SOTA deductive reasoning model☆107Updated 11 months ago
- Claude Deep Research config for Claude Code.☆226Updated 10 months ago
- Context Engineering Course with DSPy☆214Updated 6 months ago
- ☆37Updated last year
- Letting Claude Code develop his own MCP tools :)☆123Updated 11 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆261Updated last week
- Verifiers for LLM Reinforcement Learning☆81Updated 4 months ago
- oda-r is a professional-grade compiler for Declarative Self-improving Python (DSPy), featuring comprehensive error handling, logging, and…☆39Updated last year
- An automated tool for discovering insights from research papaer corpora☆137Updated last year
- Super basic implementation (gist-like) of RLMs with REPL environments.☆592Updated last month
- Codebase from our first release.☆43Updated last month