du-nlp-lab / MLR-Copilot
☆62Updated 3 weeks ago
Alternatives and similar repositories for MLR-Copilot:
Users that are interested in MLR-Copilot are comparing it to the libraries listed below
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆34Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 7 months ago
- ☆24Updated 7 months ago
- Code for the paper: CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models☆18Updated 3 weeks ago
- Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆84Updated last month
- ReBase: Training Task Experts through Retrieval Based Distillation