amazon-science / comm-promptLinks
CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving (NAACL 2024 Findings))
☆16Updated last year
Alternatives and similar repositories for comm-prompt
Users that are interested in comm-prompt are comparing it to the libraries listed below
Sorting:
- ☆13Updated 8 months ago
- Self-Reflection in LLM Agents: Effects on Problem-Solving Performance☆86Updated 10 months ago
- ☆25Updated 6 months ago
- Contrastive Chain-of-Thought Prompting☆68Updated last year
- ☆104Updated 10 months ago
- [ICLR 2024] MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use☆99Updated last year
- Code for paper "Factual Confidence of LLMs: on Reliability and Robustness of Current Estimators"☆15Updated 10 months ago
- ☆46Updated 11 months ago
- This is the implementation for the paper "LARGE LANGUAGE MODEL CASCADES WITH MIX- TURE OF THOUGHT REPRESENTATIONS FOR COST- EFFICIENT REA…☆27Updated last year
- Reasoning Agentic Retrieval-Augmented Generation for Industry Challenges☆21Updated 5 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆37Updated 2 months ago
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆127Updated 8 months ago
- ☆38Updated 2 months ago
- a survey on deep research☆34Updated last month
- Source code of paper: Process vs. Outcome Reward: Which is Better for Agentic RAG Reinforcement Learning☆37Updated 3 months ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆44Updated last year
- [ACL'25] We propose a novel fine-tuning method, Separate Memory and Reasoning, which combines prompt tuning with LoRA.☆76Updated last month
- ☆50Updated 4 months ago
- Using Explanations as a Tool for Advanced LLMs☆67Updated last year
- ☆33Updated 11 months ago
- [NAACL 2025] The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language M…☆29Updated last year
- The implementation for CIKM 2024: Towards Completeness-Oriented Tool Retrieval for Large Language Models.☆23Updated 11 months ago
- ☆23Updated last year
- ☆19Updated last year
- [NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$☆49Updated 11 months ago
- CoNLI: a plug-and-play framework for ungrounded hallucination detection and reduction☆31Updated last year
- ☆18Updated 2 months ago
- ☆47Updated 4 months ago
- DataSciBench: An LLM Agent Benchmark for Data Science☆35Updated last month
- Automatic prompt optimization framework for multi-step agent tasks.☆34Updated 11 months ago