SecureAIAutonomyLab / MA-ToTLinks
☆9Updated 7 months ago
Alternatives and similar repositories for MA-ToT
Users that are interested in MA-ToT are comparing it to the libraries listed below
Sorting:
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performance☆75Updated 6 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆105Updated 4 months ago
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆67Updated last month
- ☆89Updated last week
- The official code of paper “Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning”☆117Updated this week
- ☆104Updated last month
- ☆17Updated 7 months ago
- Maximizing the Performance of a Simple RAG using RL☆61Updated 2 months ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32Updated last year
- ☆41Updated 7 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆155Updated this week
- CORAL: Benchmarking Multi-turn Conversational Retrieval-Augmentation Generation☆52Updated 2 weeks ago
- This is the code of MMOA-RAG.☆53Updated 3 weeks ago
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆52Updated 7 months ago
- ☆12Updated 4 months ago
- Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering☆48Updated 3 months ago
- [NeurIPS 2024] Agent Planning with World Knowledge Model☆139Updated 5 months ago
- ☆102Updated 6 months ago
- [ICLR 2025] Benchmarking Agentic Workflow Generation☆94Updated 3 months ago
- MACM: Utilizing a Multi-Agent System for Condition Mining in Solving Complex Mathematical Problems☆86Updated 10 months ago
- Source code of DRAGIN, ACL 2024 main conference Long Paper (Oral)☆147Updated 3 months ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆91Updated 3 months ago
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆54Updated 2 weeks ago
- ☆60Updated 2 weeks ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆59Updated 4 months ago
- Benchmarking Multi-Agent Debate between Language Models for Truthfulness in Q&A.☆34Updated last year
- ☆47Updated 3 months ago
- This the implementation of LeCo☆31Updated 4 months ago
- ☆114Updated 4 months ago
- ☆83Updated 3 weeks ago