Evaluation harness for OpenHands V1.
☆62Mar 26, 2026Updated this week
Alternatives and similar repositories for benchmarks
Users that are interested in benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Elastic computing platform☆30Updated this week
- Public Evaluation Result Archieve for BFCL☆28Dec 17, 2025Updated 3 months ago
- TopoTrans: Optimal Transport meets Topological Data Analysis☆14Apr 20, 2023Updated 2 years ago
- Code and results accompanying our paper titled Leveraging Unlabeled Data to Predict Out-of-Distribution Performance at ICLR 2022☆10Dec 8, 2022Updated 3 years ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆13May 14, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [ICLR 2026] The official repository for the paper "AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning".☆80Feb 27, 2026Updated last month
- A universal workflow system for exactly-once DAGs☆23Jun 1, 2023Updated 2 years ago
- Game engine for website version avalon card-board game☆12Aug 2, 2025Updated 7 months ago
- KGym - A platform to run hundreds to thousands of ML4Linux kernel experiments at scale☆14Nov 8, 2025Updated 4 months ago
- ☆31Feb 4, 2026Updated last month
- Voxel-based Editor☆13Jul 11, 2018Updated 7 years ago
- ATP: Directed Graph Embedding with Asymmetric Transitivity Preservation☆10Apr 18, 2019Updated 6 years ago
- This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…☆11Aug 24, 2022Updated 3 years ago
- ☆12May 17, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Jul 22, 2021Updated 4 years ago
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆15Oct 2, 2025Updated 5 months ago
- A Gateway for connecting application services in different domains, networks, and cloud infrastructures☆23Feb 1, 2026Updated last month
- Contains examples and assignments for my CS 254 course at Vanderbilt University, which can be accessed via http://www.dre.vanderbilt.edu/…☆14Apr 25, 2022Updated 3 years ago
- A port of Grafx2 to DOS☆12Apr 3, 2022Updated 3 years ago
- A Natural Language Generation System☆14Feb 17, 2024Updated 2 years ago
- Color palette and swatches for macOS's color picker.☆20Jun 9, 2020Updated 5 years ago
- The TacTok automated Coq proof script synthesis tool☆17Jan 9, 2024Updated 2 years ago
- PMP: Cost-Effective Forced Execution with Probabilistic Memory Pre-Planning☆13Sep 8, 2020Updated 5 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Never lose context again with a persistent, queryable memory system for AI agents and development teams.☆23Jan 29, 2026Updated last month
- hq2x scaling algorithm updated to support RGBA☆17Jan 14, 2016Updated 10 years ago
- ☆22Dec 25, 2025Updated 3 months ago
- AI-powered Python CLI tool that automates the entire software development lifecycle using the Claude Code SDK. From specification to depl…☆20Nov 17, 2025Updated 4 months ago
- ☆12Nov 19, 2024Updated last year
- Reflection library for Coq☆12Sep 26, 2019Updated 6 years ago
- MCPfy your scripts and tasks, half agent, half mcp server, fully at your command☆26Nov 4, 2025Updated 4 months ago
- ☆14May 17, 2021Updated 4 years ago
- ☆20Jul 8, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks☆20Sep 18, 2025Updated 6 months ago
- ☆35Mar 4, 2026Updated 3 weeks ago
- WizardsToolkit is a secure C library offering cross-platform cryptography, hashing, authentication, and data integrity tools. It supports…☆16Feb 16, 2026Updated last month
- Siren: Byzantine-robust Federated Learning via Proactive Alarming (SoCC '21)☆11Mar 28, 2024Updated last year
- Feature-level domain adaptation☆11Sep 6, 2019Updated 6 years ago
- Official repo for FSE'24 paper "CodeArt: Better Code Models by Attention Regularization When Symbols Are Lacking"☆18Mar 10, 2025Updated last year
- SDL2 emscripten port, non-upstreamed changes☆34May 7, 2021Updated 4 years ago