colonylabs / ScribeAgentLinks
Code for ScribeAgent paper
☆63Updated 10 months ago
Alternatives and similar repositories for ScribeAgent
Users that are interested in ScribeAgent are comparing it to the libraries listed below
Sorting:
- ☆86Updated last year
- Challenges for general-purpose web-browsing AI agents☆67Updated 7 months ago
- Computer Agent Arena: Test & compare AI agents in real desktop apps & web environments. Code/data coming soon!☆51Updated 9 months ago
- SiriuS: Self-improving Multi-agent Systems via Bootstrapped Reasoning☆87Updated last month
- LLM reads a paper and produce a working prototype☆60Updated 8 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆143Updated 10 months ago
- ☆63Updated 6 months ago
- ☆74Updated 3 months ago
- [ICLR 2025] A trinity of environments, tools, and benchmarks for general virtual agents☆219Updated 6 months ago
- Official homepage for "Self-Harmonized Chain of Thought" (NAACL 2025)☆91Updated 11 months ago
- [EMNLP 2025] The official implementation for paper "Agentic-R1: Distilled Dual-Strategy Reasoning"☆101Updated 4 months ago
- ☆173Updated 10 months ago
- ☆125Updated last year
- Official code repository for Sketch-of-Thought (SoT)☆130Updated 8 months ago
- Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory☆231Updated 7 months ago
- WebLINX is a benchmark for building web navigation agents with conversational capabilities☆156Updated 10 months ago
- Resources for our paper: "EvoAgent: Towards Automatic Multi-Agent Generation via Evolutionary Algorithms"☆139Updated last year
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆126Updated 10 months ago
- DeepDive: Advancing Deep Search Agents with Knowledge Graphs and Multi-Turn RL☆224Updated 3 months ago
- The Library for LLM-based multi-agent applications☆100Updated 5 months ago
- ☆105Updated last year
- [NeurIPS 2024] Spider2-V: How Far Are Multimodal Agents From Automating Data Science and Engineering Workflows?☆136Updated last year
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆119Updated 6 months ago
- EcoAssistant: using LLM assistant more affordably and accurately☆133Updated last year
- Agent computer interface for AI software engineer.☆113Updated last month
- ☆93Updated 2 months ago
- Analysis code for Neurips 2025 paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆55Updated 5 months ago
- Designing Multi-Agent Systems with Zero Supervision☆107Updated 6 months ago
- Source code for our paper: "SelfGoal: Your Language Agents Already Know How to Achieve High-level Goals".☆69Updated last year
- ReDel is a toolkit for researchers and developers to build, iterate on, and analyze recursive multi-agent systems. (EMNLP 2024 Demo)☆89Updated 3 weeks ago