qiancheng0 / Open-SMARTAgent
The official repo for the code and data of paper SMART
☆22Updated last month
Alternatives and similar repositories for Open-SMARTAgent:
Users that are interested in Open-SMARTAgent are comparing it to the libraries listed below
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆33Updated last year
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆86Updated 5 months ago
- ☆34Updated 3 months ago
- ☆82Updated last month
- Systematic evaluation framework that automatically rates overthinking behavior in large language models.☆80Updated last month
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆83Updated last week
- LongHeads: Multi-Head Attention is Secretly a Long Context Processor☆29Updated 11 months ago
- ☆56Updated 6 months ago
- Code for Paper: Teaching Language Models to Critique via Reinforcement Learning☆84Updated last month
- Implementation of "SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models"☆26Updated last month
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆45Updated last month
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆52Updated 9 months ago
- Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models☆35Updated 5 months ago
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆75Updated 2 months ago
- ☆24Updated 6 months ago
- This repository contains popular code generation frameworks such as MapCoder, CodeSIM.☆34Updated last month
- [Preprint] An inference-time decoding strategy with adaptive foresight sampling☆79Updated this week
- Benchmarking Chat Assistants on Long-Term Interactive Memory (ICLR 2025)☆57Updated last month
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆78Updated 3 weeks ago
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆50Updated 3 months ago
- [NAACL'25] "Revealing the Barriers of Language Agents in Planning"☆12Updated 5 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆68Updated last week
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆48Updated last month
- [NeurIPS 2024] OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI☆96Updated 3 weeks ago
- ☆42Updated last month
- Code for EMNLP 2024 paper "Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning"☆53Updated 5 months ago
- Official Repository of Are Your LLMs Capable of Stable Reasoning?☆22Updated last week
- (ICLR 2025) The Official Code Repository for GUI-World.☆53Updated 3 months ago
- Research Code for preprint "Optimizing Test-Time Compute via Meta Reinforcement Finetuning".☆74Updated 2 weeks ago
- Source code for our paper: "Put Your Money Where Your Mouth Is: Evaluating Strategic Planning and Execution of LLM Agents in an Auction A…☆44Updated last year