octotools / octotoolsLinks
OctoTools: An agentic framework with extensible tools for complex reasoning
☆1,401Updated 3 months ago
Alternatives and similar repositories for octotools
Users that are interested in octotools are comparing it to the libraries listed below
Sorting:
- Official Code of Memento: Fine-tuning LLM Agents without Fine-tuning LLMs☆2,147Updated 3 months ago
- A-MEM: Agentic Memory for LLM Agents☆767Updated last month
- 👩⚖️ Agent-as-a-Judge: The Magic for Open-Endedness☆707Updated 8 months ago
- Official Repo for ICML 2024 paper "Executable Code Actions Elicit Better LLM Agents" by Xingyao Wang, Yangyi Chen, Lifan Yuan, Yizhe Zhan…☆1,547Updated last year
- Windows Agent Arena (WAA) 🪟 is a scalable OS platform for testing and benchmarking of multi-modal AI agents.☆808Updated 8 months ago
- [ICLR 2025] Automated Design of Agentic Systems☆1,487Updated 11 months ago
- A framework for comprehensive diagnosis and optimization of agents using simulated, realistic synthetic interactions☆1,158Updated 2 weeks ago
- Darwin Gödel Machine: Open-Ended Evolution of Self-Improving Agents☆1,789Updated 5 months ago
- End-to-end Generative Optimization for AI Agents☆704Updated last month
- Integrating Tool Use into LLM Reasoning☆704Updated 10 months ago
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,144Updated 2 months ago
- AIDE: AI-Driven Exploration in the Space of Code. The machine Learning engineering agent that automates AI R&D.☆1,113Updated 2 months ago
- An agent benchmark with tasks in a simulated software company.☆622Updated 2 months ago
- [NeurIPS 2025] 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability☆1,387Updated last month
- Synthetic data curation for post-training and structured data extraction☆1,594Updated last week
- [NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling☆631Updated last month
- ☆1,242Updated 2 months ago
- The code for NeurIPS 2025 paper "A-Mem: Agentic Memory for LLM Agents"☆747Updated 3 weeks ago
- Cache-Augmented Generation: A Simple, Efficient Alternative to RAG☆1,463Updated 7 months ago
- ZeroSearch: Incentivize the Search Capability of LLMs without Searching☆1,229Updated 5 months ago
- MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering☆1,276Updated this week
- xLAM: A Family of Large Action Models to Empower AI Agent Systems☆598Updated 4 months ago
- Official Implementation of "KBLaM: Knowledge Base augmented Language Model"☆1,434Updated 3 months ago
- An open-source framework for collaborative AI agents, enabling diverse, distributed agents to team up and tackle complex tasks through in…☆791Updated 3 months ago
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆961Updated 7 months ago
- A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!☆1,179Updated 11 months ago
- An Open Large Reasoning Model for Real-World Solutions☆1,534Updated 7 months ago
- E2B Desktop Sandbox for LLMs. E2B Sandbox with desktop graphical environment that you can connect to any LLM for secure computer use.☆1,219Updated last week
- This is a collection of resources for computer-use GUI agents, including videos, blogs, papers, and projects.☆476Updated 2 months ago
- Agentless🐱: an agentless approach to automatically solve software development problems☆1,998Updated last year