sileix / chain-of-draft
Code and data for the Chain-of-Draft (CoD) paper
☆274Updated 2 months ago
Alternatives and similar repositories for chain-of-draft
Users that are interested in chain-of-draft are comparing it to the libraries listed below
Sorting:
- Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks☆188Updated last week
- AWM: Agent Workflow Memory☆269Updated 3 months ago
- Official code repository for Sketch-of-Thought (SoT)☆112Updated this week
- ☆199Updated 2 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆114Updated 2 months ago
- OS-ATLAS: A Foundation Action Model For Generalist GUI Agents☆332Updated 3 weeks ago
- Official repo for paper: "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't"☆222Updated last month
- Official codebase for "Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling".☆254Updated 2 months ago
- Code for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate"☆144Updated 3 weeks ago
- ReCall: Learning to Reason with Tool Call for LLMs via Reinforcement Learning☆835Updated last week
- Atom of Thoughts for Markov LLM Test-Time Scaling☆562Updated last week
- Tina: Tiny Reasoning Models via LoRA☆192Updated 2 weeks ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆216Updated 6 months ago
- ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates☆382Updated this week
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆201Updated 3 weeks ago
- ☆176Updated 2 weeks ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆345Updated last month
- AN O1 REPLICATION FOR CODING☆333Updated 5 months ago
- Building Open LLM Web Agents with Self-Evolving Online Curriculum RL☆373Updated last week
- Official codebase for "SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution"☆518Updated last month
- This is the official repository for Auto-RAG.☆208Updated 3 weeks ago
- [ICML2025] Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction☆289Updated 2 months ago
- ☆524Updated 3 weeks ago
- [ICML 2025 Spotlight] CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction☆520Updated last week
- Search-o1: Agentic Search-Enhanced Large Reasoning Models☆851Updated last week
- [ICLR'25 Oral] UGround: Universal GUI Visual Grounding for GUI Agents☆219Updated last week
- A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning☆180Updated last week
- Resources for our paper: "Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training"☆130Updated last month
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆230Updated 8 months ago
- Official implementation of paper "On the Diagram of Thought" (https://arxiv.org/abs/2409.10038)☆179Updated last month