OpenPipe / Summary-RLLinks
Train an agent to generate high quality summaries
☆35Updated 2 months ago
Alternatives and similar repositories for Summary-RL
Users that are interested in Summary-RL are comparing it to the libraries listed below
Sorting:
- Challenges for general-purpose web-browsing AI agents☆64Updated 3 months ago
- j1-micro (1.7B) & j1-nano (600M) are absurdly tiny but mighty reward models.☆96Updated last month
- LLM reads a paper and produce a working prototype☆57Updated 4 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆92Updated 7 months ago
- An automated tool for discovering insights from research papaer corpora☆138Updated last year
- Train your own SOTA deductive reasoning model☆105Updated 6 months ago
- OSS RL environment + evals toolkit☆159Updated this week
- 🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architectu…☆23Updated last month
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆55Updated 7 months ago
- ☆133Updated 5 months ago
- ⚖️ Awesome LLM Judges ⚖️☆128Updated 4 months ago
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆88Updated 11 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?