bageldotcom / bagel-RLLinks
☆91Updated 5 months ago
Alternatives and similar repositories for bagel-RL
Users that are interested in bagel-RL are comparing it to the libraries listed below
Sorting:
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆271Updated last month
- Super basic implementation (gist-like) of RLMs with REPL environments.☆280Updated 2 months ago
- ☆62Updated last year
- ☆90Updated 11 months ago
- GraphRAG database - hybrid graph / vector db☆134Updated last year
- An agent orchestration framework for economic agents☆109Updated 4 months ago
- oda-r is a professional-grade compiler for Declarative Self-improving Python (DSPy), featuring comprehensive error handling, logging, and…☆38Updated 10 months ago
- Inference-time scaling for LLMs-as-a-judge.☆316Updated last month
- The official code implementation for "Cache-to-Cache: Direct Semantic Communication Between Large Language Models"☆293Updated last week
- Routing on Random Forest (RoRF)☆233Updated last year
- The State Of The Art, intelligence☆157Updated 4 months ago
- ☆36Updated 10 months ago
- ☆173Updated 9 months ago
- Deep research agents using MiniMax-M2 interleaved thinking☆143Updated 3 weeks ago
- Letting Claude Code develop his own MCP tools :)☆122Updated 9 months ago
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆43Updated last year
- Verbosity control for AI agents☆64Updated last year
- Simple examples using Argilla tools to build AI☆56Updated last year
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 4 months ago
- Verifiers for LLM Reinforcement Learning☆79Updated 3 months ago
- Simple UI for debugging correlations of text embeddings☆305Updated 6 months ago
- One click templates for inferencing Language Models☆222Updated 3 weeks ago
- Tutorial for building LLM router☆239Updated last year
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 10 months ago
- DevQualityEval: An evaluation benchmark 📈 and framework to compare and evolve the quality of code generation of LLMs.☆182Updated 7 months ago
- A tool that facilitates easy, efficient and high-quality fine-tuning of Cohere's models☆76Updated 9 months ago
- ☆125Updated 3 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆241Updated last week
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated last year
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆123Updated 9 months ago