dinobby / MAGDiLinks
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models. Paper: https://arxiv.org/abs/2402.01620
☆34Updated last year
Alternatives and similar repositories for MAGDi
Users that are interested in MAGDi are comparing it to the libraries listed below
Sorting:
- ☆65Updated 2 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 5 months ago
- Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments (EMNLP'2024)☆37Updated 5 months ago
- ☆27Updated 2 weeks ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆17Updated last week
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- ☆24Updated 9 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆24Updated 3 months ago
- Codebase for Instruction Following without Instruction Tuning☆34Updated 8 months ago
- Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data☆38Updated 4 months ago
- [ACL 2025] Are Your LLMs Capable of Stable Reasoning?☆25Updated 3 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- ☆25Updated last year
- Scalable Meta-Evaluation of LLMs as Evaluators☆42Updated last year
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆30Updated last year
- Evaluation on Logical Reasoning and Abstract Reasoning Challenges☆27Updated 2 months ago
- ☆20Updated last month
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆42Updated 8 months ago
- ☆32Updated 5 months ago
- Official Repo for InSTA: Towards Internet-Scale Training For Agents☆42Updated this week
- ☆53Updated this week
- Evaluate the Quality of Critique☆35Updated last year
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Syntax Error-Free and Generalizable Tool Use for LLMs via Finite-State Decoding☆27Updated last year
- ☆36Updated last week
- Repository for NPHardEval, a quantified-dynamic benchmark of LLMs☆54Updated last year
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆42Updated 7 months ago