bageldotcom / bagel-RLLinks
☆92Updated 6 months ago
Alternatives and similar repositories for bagel-RL
Users that are interested in bagel-RL are comparing it to the libraries listed below
Sorting:
- Real-Time Detection of Hallucinated Entities in Long-Form Generation☆275Updated 2 months ago
- An MCP Server that's also an MCP Client. Useful for letting Claude develop and test MCPs without needing to reset the application.☆124Updated 10 months ago
- Inference-time scaling for LLMs-as-a-judge.☆324Updated 2 months ago
- ☆68Updated 7 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.☆435Updated last week
- An agent orchestration framework for economic agents☆111Updated 5 months ago
- ☆61Updated last year
- The State Of The Art, intelligence☆157Updated 5 months ago
- Letting Claude Code develop his own MCP tools :)☆122Updated 10 months ago
- ☆128Updated 4 months ago
- ☆176Updated 10 months ago
- Simple examples using Argilla tools to build AI☆57Updated last year
- ☆90Updated 11 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆61Updated last year
- ☆92Updated last year
- Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.☆141Updated 4 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆178Updated 9 months ago
- ☆79Updated 3 months ago
- oda-r is a professional-grade compiler for Declarative Self-improving Python (DSPy), featuring comprehensive error handling, logging, and…☆38Updated 11 months ago
- Deep research agents using MiniMax M2.1 interleaved thinking☆192Updated 3 weeks ago
- Context Engineering Course with DSPy☆211Updated 5 months ago
- GraphRAG database - hybrid graph / vector db☆134Updated last year
- Codebase from our first release.☆39Updated last week
- ☆36Updated 11 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 4 months ago
- A framework for generative software.☆115Updated 6 months ago
- Finetune Llama-3-8b on the MathInstruct dataset☆116Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆90Updated last month
- ☆87Updated last year
- they've simulated websites, worlds, and imaginary CLIs... but what if they simulated *you*?☆127Updated 3 months ago