vsubramaniam851 / multiagent-ft
☆171Updated last week
Alternatives and similar repositories for multiagent-ft:
Users that are interested in multiagent-ft are comparing it to the libraries listed below
- ☆98Updated last month
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆167Updated last month
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆84Updated 4 months ago
- 🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.☆210Updated last week
- ☆109Updated last week
- AWM: Agent Workflow Memory☆245Updated last month
- official implementation of paper "Process Reward Model with Q-value Rankings"☆49Updated 3 weeks ago
- ☆95Updated 8 months ago
- Code for NeurIPS'24 paper 'Grokked Transformers are Implicit Reasoners: A Mechanistic Journey to the Edge of Generalization'☆183Updated 3 months ago
- ☆54Updated 2 months ago
- Code for the paper 🌳 Tree Search for Language Model Agents☆184Updated 7 months ago
- Code and example data for the paper: Rule Based Rewards for Language Model Safety☆179Updated 7 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆138Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆163Updated 2 weeks ago
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).☆204Updated last week
- ☆152Updated 3 weeks ago
- What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆63Updated this week
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆74Updated last month
- Repo of paper "Free Process Rewards without Process Labels"☆128Updated last month
- A simple unified framework for evaluating LLMs☆199Updated last month
- Self-playing Adversarial Language Game Enhances LLM Reasoning, NeurIPS 2024☆120Updated last week
- Code for Paper: Training Software Engineering Agents and Verifiers with SWE-Gym☆369Updated this week
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆114Updated 2 months ago
- Code for Paper: Autonomous Evaluation and Refinement of Digital Agents [COLM 2024]☆125Updated 3 months ago