sileix / chain-of-draftLinks
Code and data for the Chain-of-Draft (CoD) paper
☆336Updated 9 months ago
Alternatives and similar repositories for chain-of-draft
Users that are interested in chain-of-draft are comparing it to the libraries listed below
Sorting:
- Official code repository for Sketch-of-Thought (SoT)☆129Updated 7 months ago
- ☆226Updated 10 months ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆353Updated 6 months ago
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆165Updated 2 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆297Updated 2 months ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆230Updated 7 months ago
- Official repository for DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research☆476Updated this week
- Beating the GAIA benchmark with Transformers Agents. 🚀☆142Updated 10 months ago
- accompanying material for sleep-time compute paper☆118Updated 7 months ago
- ☆73Updated 2 months ago
- This is the official repository for Auto-RAG.☆231Updated 5 months ago
- Tina: Tiny Reasoning Models via LoRA☆310Updated 3 months ago
- AWM: Agent Workflow Memory☆375Updated last week
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆422Updated 8 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆220Updated 6 months ago
- Code for the paper: "Learning to Reason without External Rewards"☆383Updated 5 months ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆270Updated 2 months ago
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆277Updated 10 months ago
- Official Repository for "Glyph: Scaling Context Windows via Visual-Text Compression"☆539Updated last month
- [ICML 2025 Oral] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆565Updated 7 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆189Updated 4 months ago
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆595Updated 4 months ago
- ☆227Updated last month
- Code and implementations for the paper "AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcemen…☆537Updated 3 months ago
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆620Updated last month
- Gödel Agent: A Self-Referential Agent Framework for Recursive Self-Improvement☆145Updated 3 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆139Updated last year
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆86Updated 3 weeks ago
- 👩⚖️ Agent-as-a-Judge: The Magic for Open-Endedness☆693Updated 7 months ago
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆254Updated 7 months ago