heimy2000 / CMATLinks
☆20Updated last year
Alternatives and similar repositories for CMAT
Users that are interested in CMAT are comparing it to the libraries listed below
Sorting:
- Official code implementation for the ACL 2025 paper: 'CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis'☆27Updated 2 months ago
- ☆57Updated 3 weeks ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆67Updated 2 months ago
- MPO: Boosting LLM Agents with Meta Plan Optimization☆63Updated 4 months ago
- A Comprehensive Library for Memory of LLM-based Agents.☆52Updated 2 months ago
- ☆47Updated last month
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆142Updated 7 months ago
- MARFT stands for Multi-Agent Reinforcement Fine-Tuning. This repository implements an LLM-based multi-agent reinforcement fine-tuning fra…☆50Updated this week
- DSBench: How Far are Data Science Agents from Becoming Data Science Experts?☆58Updated 5 months ago
- ☆136Updated last month
- ☆102Updated 7 months ago
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆28Updated 3 weeks ago
- This is the code of MMOA-RAG.☆60Updated 2 months ago
- [ACL 2024] AutoAct: Automatic Agent Learning from Scratch for QA via Self-Planning☆229Updated 6 months ago
- Code implementation of synthetic continued pretraining☆118Updated 6 months ago
- ☆64Updated last month
- [NeurIPS 2024 D&B Track] GTA: A Benchmark for General Tool Agents☆112Updated 3 months ago
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performance☆77Updated 7 months ago
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆57Updated last year
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆103Updated last month
- This is the code repo for our paper "Autonomously Knowledge Assimilation and Accommodation through Retrieval-Augmented Agents".☆107Updated 8 months ago
- Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement (EMNLP 2024 Main Conference)☆60Updated 9 months ago
- The GitHub repository for the paper "Self-prompted Chain-of-Thought on Large Language Models for Open-domain Multi-hop Reasoning" accepte…☆19Updated last year
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆106Updated 5 months ago
- Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning☆81Updated last year
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆26Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆96Updated last month
- Automatic prompt optimization framework for multi-step agent tasks.☆31Updated 8 months ago
- AutoCoA (Automatic generation of Chain-of-Action) is an agent model framework that enhances the multi-turn tool usage capability of reaso…☆121Updated 4 months ago
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆201Updated last week