sileix / chain-of-draftLinks
Code and data for the Chain-of-Draft (CoD) paper
☆313Updated 4 months ago
Alternatives and similar repositories for chain-of-draft
Users that are interested in chain-of-draft are comparing it to the libraries listed below
Sorting:
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆324Updated last month
- ☆212Updated 5 months ago
- Tina: Tiny Reasoning Models via LoRA☆274Updated 2 months ago
- Official code repository for Sketch-of-Thought (SoT)☆125Updated 3 months ago
- Atom of Thoughts for Markov LLM Test-Time Scaling☆580Updated last month
- A MemAgent framework that can be extrapolated to 3.5M, along with a training framework for RL training of any agent workflow.☆588Updated last week
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆513Updated last week
- AWM: Agent Workflow Memory☆300Updated 6 months ago
- Official Code Repository for the paper "Distilling LLM Agent into Small Models with Retrieval and Code Tools"☆130Updated this week
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆214Updated last month
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆154Updated last month
- This is InfiniRetri, a tool enhance Transformer-based LLMs(Large Language Model) ablity to hangle Long-Context.☆115Updated 4 months ago
- Code for the paper: "Learning to Reason without External Rewards"☆344Updated 3 weeks ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆248Updated 2 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆235Updated 11 months ago
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆537Updated 3 months ago
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆268Updated 5 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆363Updated 3 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆131Updated 5 months ago
- Complex Function Calling Benchmark.☆123Updated 6 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆233Updated 3 months ago
- Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and …☆345Updated last year
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆184Updated 4 months ago
- ☆288Updated 2 months ago
- This is the official repository for Auto-RAG.☆218Updated 3 weeks ago
- ☆87Updated 2 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆101Updated last week
- ⚖️ The First Coding Agent-as-a-Judge☆594Updated 2 months ago
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆245Updated this week
- ☆194Updated 11 months ago