HyperPotatoNeo / RSALinks
☆73Updated last month
Alternatives and similar repositories for RSA
Users that are interested in RSA are comparing it to the libraries listed below
Sorting:
- ☆77Updated last week
- accompanying material for sleep-time compute paper☆117Updated 6 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆56Updated 4 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆34Updated 6 months ago
- Verifiers for LLM Reinforcement Learning☆79Updated 7 months ago
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆58Updated 3 weeks ago
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆94Updated 5 months ago
- Leveraging Base Language Models for Few-Shot Synthetic Data Generation☆37Updated 3 weeks ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆63Updated 11 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆72Updated this week
- QAlign is a new test-time alignment approach that improves language model performance by using Markov chain Monte Carlo methods.☆24Updated 2 weeks ago
- ☆60Updated 4 months ago
- ScreenSuite - The most comprehensive benchmarking suite for GUI Agents!☆131Updated last month
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated last year
- ☆25Updated 5 months ago
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆100Updated this week
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 8 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆72Updated 6 months ago
- [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆82Updated 7 months ago
- ☆55Updated last year
- Train your own SOTA deductive reasoning model☆108Updated 8 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 2 months ago
- Source code of "How to Correctly do Semantic Backpropagation on Language-based Agentic Systems" 🤖☆76Updated 11 months ago
- ☆81Updated this week
- Code for RATIONALYST: Pre-training Process-Supervision for Improving Reasoning https://arxiv.org/pdf/2410.01044☆35Updated last year
- Source code for the collaborative reasoner research project at Meta FAIR.☆105Updated 6 months ago
- NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer☆44Updated last year
- LLM reads a paper and produce a working prototype☆57Updated 7 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆109Updated 5 months ago