Reached #1 on Stanford's Terminal Bench leaderboard. New SOTA on agentic coding. Sharing some insights on how it is built and some ablation studies on different techniques
☆66Nov 3, 2025Updated 5 months ago
Alternatives and similar repositories for Apex2-Terminal-Bench-Agent
Users that are interested in Apex2-Terminal-Bench-Agent are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆40Apr 20, 2026Updated last week
- ☆38Jan 10, 2026Updated 3 months ago
- A protocol adapter that lets the Codex Desktop GUI work with alternative AI backends.☆73Apr 8, 2026Updated 3 weeks ago
- Agent skill for managing Omarchy Linux systems with natural language☆21Jan 5, 2026Updated 3 months ago
- ☆28Apr 2, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Pi skill for multi-agent workflow orchestration with file-based handoff☆33Jan 18, 2026Updated 3 months ago
- An AI agent that stays running, remembers across sessions, and checks in on its own. macOS, Linux, Android. Built on Pi.☆354Apr 22, 2026Updated last week
- Parse SVG files and render them as PNG, PDF, SVG, or raw memory buffer images.☆16May 7, 2019Updated 6 years ago
- Most of my *nix-y configuration files.☆12Mar 25, 2021Updated 5 years ago
- Refresh Sparse items Plugin for Jellyfin☆12Feb 4, 2026Updated 2 months ago
- Uses mininet and vizceral to visualize some interesting topologies with interactivity☆12Aug 9, 2018Updated 7 years ago
- an AI rock tumbler (orchestrator)☆40Feb 24, 2026Updated 2 months ago
- 🦀 rtoon is the official Rust implementation of the Token-Oriented Object Notation (TOON) — a compact, human-readable, token-efficient f…☆19Nov 3, 2025Updated 5 months ago
- Bit database☆17Nov 28, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Benchmarking Goal-Oriented Software Engineering☆144Jan 7, 2026Updated 3 months ago
- Platform for creating audio-first AI assistants that can work offline using a flexible plugin architecture☆13Jun 29, 2025Updated 10 months ago
- Discover nearby FlyWeb services in Chrome / Chrome OS☆10Nov 7, 2016Updated 9 years ago
- Web Extension that allows webpages to access Secure Scuttlebutt☆14Jan 20, 2021Updated 5 years ago
- Enables browser and plugin spell checking ability within static website content. Simplifies QA workflows when no other tools or text sour…☆11Jun 11, 2023Updated 2 years ago
- A drop-in replacement of ssb-keys, implemented in Rust and delivered as a native module in Node.js☆13Dec 15, 2023Updated 2 years ago
- 🔒 Isomorphic crypto package for node and the browser.☆12Aug 22, 2017Updated 8 years ago
- Minimal Ethereum RPC Client in Rust☆11Nov 18, 2023Updated 2 years ago
- Code for "Zero-Shot Out-of-Distribution Detection with Feature Correlations"☆13Jan 19, 2020Updated 6 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆21Dec 23, 2018Updated 7 years ago
- peermaps roadmap and discussion repo☆21Jan 11, 2019Updated 7 years ago
- ☆14Oct 11, 2021Updated 4 years ago
- a flume-like persisted append-only log implementation☆19Mar 8, 2026Updated last month
- ☆18Dec 2, 2025Updated 5 months ago
- ☆18Sep 15, 2025Updated 7 months ago
- A Uniswap Fork enabling ERC20 to ERC20 trades☆12Jan 24, 2023Updated 3 years ago
- flashbots builder docker compose☆12Jun 28, 2023Updated 2 years ago
- git remote for hypergit☆14Jun 17, 2018Updated 7 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆21May 25, 2017Updated 8 years ago
- 📊 LLM Context Benchmarks - A comprehensive benchmarking tool for testing LLMs with varying context sizes using Ollama. Features dual b…☆60Updated this week
- ☆26Aug 21, 2018Updated 7 years ago
- Local economic development software...alas the homepage has been taken over by somebody who sprays poisons..☆17Mar 26, 2017Updated 9 years ago
- social coding web UI on secure-scuttlebutt☆10Jun 11, 2018Updated 7 years ago
- An ssb client for image sharing.☆11Apr 16, 2019Updated 7 years ago
- ☆10Dec 18, 2020Updated 5 years ago