[ACL2026 Main] AgencyBench: Benchmarking the Frontiers of Autonomous Agents in 1M-Token Real-World Contexts
☆87Jan 23, 2026Updated 4 months ago
Alternatives and similar repositories for AgencyBench
Users that are interested in AgencyBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2026 Oral] Agent-native Mid-training for Software Engineering☆59Jun 7, 2026Updated last week
- The official implementation of Bi-Mamba☆17Oct 22, 2025Updated 7 months ago
- [ICLR 2026] The official repository for the paper "AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning".☆81Feb 27, 2026Updated 3 months ago
- Official PyTorch code for ICLR 2025 paper "Gnothi Seauton: Empowering Faithful Self-Interpretability in Black-Box Models"☆23Mar 4, 2025Updated last year
- 🎨 Single-file distributable React posters — one .tsx file, every format you'll ever need. Works as a CLI and as a library.☆68May 16, 2026Updated 3 weeks ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- official repo for `thinking with images through-self-calling`☆26Dec 28, 2025Updated 5 months ago
- [WWW '24] UnifiedSSR: A Unified Framework of Sequential Search and Recommendation☆12Feb 16, 2024Updated 2 years ago
- Deep Learning 2021 in School of Data Science, USTC☆12May 17, 2023Updated 3 years ago
- [AAAI 2025 (Oral)] SAIL: Sample-Centric In-Context Learning for Document Information Extraction☆20Dec 24, 2024Updated last year
- Your friendly terminal-based AI pair programmer☆41Jun 5, 2023Updated 3 years ago
- Code generation from natural language with less prior and more monolingual data☆12Aug 24, 2021Updated 4 years ago
- A scalable benchmark for state representation learning in visual reinforcement learning.☆17Jun 23, 2025Updated 11 months ago
- Templates and examples for ACL and EMNLP conference posters.☆14Oct 5, 2024Updated last year
- a library of works related to Large Language Models (LLMs) based Agent Hallucination☆59Oct 30, 2025Updated 7 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Vim plugin to copy text to Windows clipboard on WSL☆12Jan 8, 2023Updated 3 years ago
- Table logger using Rich☆13Aug 13, 2025Updated 10 months ago
- ☆20May 14, 2024Updated 2 years ago
- walterra's collections of helpers for agentic coding☆34Mar 23, 2026Updated 2 months ago
- ☆12Sep 23, 2024Updated last year
- JAX implementation of the Mistral 7b v0.1 model☆13Mar 27, 2024Updated 2 years ago
- ☆13Jul 14, 2024Updated last year
- (🔥ICML2026) Reward Auditor: Inference on Reward Modeling Suitability in Real-World Perturbed Scenarios☆36Jan 24, 2026Updated 4 months ago
- ☆14Jul 12, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).☆20Feb 17, 2025Updated last year
- [ACL 2023] To Copy Rather Than Memorize: A Vertical Learning Paradigm for Knowledge Graph Completion☆12Feb 3, 2023Updated 3 years ago
- This repo is reproduction resources for linear alignment paper, still working☆17May 19, 2024Updated 2 years ago
- Official implementation for "ALI-Agent: Assessing LLMs'Alignment with Human Values via Agent-based Evaluation"☆21Jan 31, 2026Updated 4 months ago
- ☆43Nov 8, 2025Updated 7 months ago
- ☆12Oct 9, 2020Updated 5 years ago
- ☆42Jun 2, 2026Updated last week
- ☆34Jun 1, 2026Updated 2 weeks ago
- [CVPR' 25] Official repo for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Cal…☆22Jun 6, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.