OpenPipe / Summary-RLLinks
Train an agent to generate high quality summaries
β39Updated 5 months ago
Alternatives and similar repositories for Summary-RL
Users that are interested in Summary-RL are comparing it to the libraries listed below
Sorting:
- LLM reads a paper and produce a working prototypeβ60Updated 8 months ago
- π€ Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architectuβ¦β26Updated 4 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.β99Updated 5 months ago
- β68Updated 7 months ago
- Train your own SOTA deductive reasoning modelβ107Updated 9 months ago
- Challenges for general-purpose web-browsing AI agentsβ67Updated 6 months ago
- β92Updated last month
- A user interface for DSPyβ202Updated 2 months ago
- β67Updated 5 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive argumentsβ94Updated 2 months ago
- Super basic implementation (gist-like) of RLMs with REPL environments.β286Updated 2 months ago
- β136Updated 9 months ago
- Claude Deep Research config for Claude Code.β224Updated 9 months ago
- βοΈ Awesome LLM Judges βοΈβ146Updated 7 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data generaβ¦β246Updated this week
- Clue inspired puzzles for testing LLM deduction abilitiesβ45Updated 9 months ago
- β17Updated 10 months ago
- β59Updated 10 months ago
- Public repository containing METR's DVC pipeline for eval data analysisβ164Updated 8 months ago
- A framework for pitting LLMs against each other in an evolving library of games ββ34Updated 8 months ago
- A framework for orchestrating AI agents using a mermaid graphβ77Updated last year
- A mcp server that uses the Osmosis-Apply-1.7B model to apply code mergesβ53Updated 5 months ago
- An automated tool for discovering insights from research papaer corporaβ137Updated last year
- β79Updated 2 months ago
- Code for ScribeAgent paperβ63Updated 9 months ago
- β90Updated 11 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ59Updated 2 months ago
- Leveraging DSPy for AI-driven task understanding and solution generation, the Self-Discover Framework automates problem-solving through rβ¦β72Updated last month
- β57Updated 10 months ago
- Training setup for Langchain's Open Deep Researchβ73Updated 3 months ago