amazon-science / comm-prompt
CoMM: Collaborative Multi-Agent, Multi-Reasoning-Path Prompting for Complex Problem Solving (NAACL 2024 Findings))
☆11Updated 4 months ago
Related projects: ⓘ
- ☆12Updated 6 months ago
- ☆13Updated this week
- [NAACL 2024 Findings] Evaluation suite for the systematic evaluation of instruction selection methods.☆23Updated last year
- DSBench: How Far are Data Science Agents Becoming Data Science Experts?☆20Updated this week
- Source code for the paper "Automatic Prompt Augmentation and Selection with Chain-of-Thought from Labeled Data"☆20Updated 6 months ago
- ☆13Updated last month
- Automatic prompt optimization framework for multi-step agent tasks.☆15Updated 2 months ago
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆14Updated 6 months ago
- About The official GitHub page for ''Unleashing the Potential of Large Language Models as Prompt Optimizers: An Analogical Analysis with …☆13Updated 2 months ago
- Code implementation of synthetic continued pretraining☆13Updated this week
- official repo of AAAI2024 paper Mitigating the Impact of False Negatives in Dense Retrieval with Contrastive Confidence Regularization☆12Updated 8 months ago
- Code repo for MathAgent☆13Updated 9 months ago
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆11Updated 3 weeks ago
- The open source implementation of "Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers"☆18Updated 6 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆26Updated 11 months ago
- ☆15Updated 2 months ago
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆29Updated 7 months ago
- The official implementation of paper "Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agen…☆20Updated 6 months ago
- RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning☆12Updated 5 months ago
- InstructRAG: Instructing Retrieval-Augmented Generation with Explicit Denoising☆32Updated 2 months ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023☆10Updated 9 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆28Updated 4 months ago
- ☆18Updated 3 months ago
- Accompanying code for "Boosted Prompt Ensembles for Large Language Models"☆28Updated last year
- This repository includes a benchmark and code for the paper "Evaluating LLMs at Detecting Errors in LLM Responses".☆22Updated last month
- ☆17Updated 2 months ago
- ☆23Updated 3 weeks ago
- ☆24Updated 7 months ago
- Official repository for paper "GTA: A Benchmark for General Tool Agents"☆28Updated 2 months ago
- Tree prompting: easy-to-use scikit-learn interface for improved prompting.☆27Updated 10 months ago