Evaluation harness for OpenHands V1.
☆75Apr 30, 2026Updated this week
Alternatives and similar repositories for benchmarks
Users that are interested in benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Elastic computing platform☆31Updated this week
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- Public Evaluation Result Archieve for BFCL☆29Dec 17, 2025Updated 4 months ago
- Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]☆18Apr 14, 2025Updated last year
- [NeurIPS 23] Characterizing OOD Error via Optimal Transport☆13Nov 19, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆14May 14, 2024Updated last year
- ☆32Jan 31, 2026Updated 3 months ago
- Meta RL codebase for Unstable Baselines☆22Dec 6, 2022Updated 3 years ago
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆33Apr 14, 2024Updated 2 years ago
- KGym - A platform to run hundreds to thousands of ML4Linux kernel experiments at scale☆16Nov 8, 2025Updated 5 months ago
- ☆34Feb 4, 2026Updated 3 months ago
- The CompCert formally-verified C compiler☆11Updated this week
- This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…☆11Aug 24, 2022Updated 3 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Jul 22, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- open IPython notebooks for the book of Scientific Computing with Python☆11Jul 16, 2015Updated 10 years ago
- A port of Grafx2 to DOS☆13Apr 3, 2022Updated 4 years ago
- ☆12Nov 26, 2024Updated last year
- A Natural Language Generation System☆14Apr 21, 2026Updated 2 weeks ago
- Predicting Out-of-Distribution Error with the Projection Norm☆19Jul 27, 2022Updated 3 years ago
- Color palette and swatches for macOS's color picker.☆20Jun 9, 2020Updated 5 years ago
- The TacTok automated Coq proof script synthesis tool☆17Jan 9, 2024Updated 2 years ago
- A Model Context Protocol (MCP) server implementation that provides file backup and restoration capabilities☆12Aug 8, 2025Updated 8 months ago
- PMP: Cost-Effective Forced Execution with Probabilistic Memory Pre-Planning☆13Sep 8, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆27Apr 7, 2026Updated 3 weeks ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆27Oct 20, 2022Updated 3 years ago
- ☆14May 17, 2021Updated 4 years ago
- Convex Formulation of Multiple Instance Learning from Positive and Unlabeled Bags☆10Apr 28, 2018Updated 8 years ago
- Navigator Helpers☆11Nov 7, 2024Updated last year
- A naive interpreter for IR of NJU compiler principle lab3, to accelerate interpretation, the ir will be compiled to machine-friendly bina…☆16Jun 17, 2020Updated 5 years ago
- 面试经验记录☆14Sep 11, 2019Updated 6 years ago
- Efficient MCP tool calling in code mode for Claude Code☆22Dec 12, 2025Updated 4 months ago
- MCP server that allows Claude Code to interact with OpenAI Codex CLI☆21Aug 19, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR'24] LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning☆15Apr 17, 2026Updated 2 weeks ago
- This is a prototype of my Super Monkey Ball clone, Ultra Caveman Spheres, developed for my Youtube channel.☆12Jun 18, 2020Updated 5 years ago
- Specification language for generating Generalized Linear Models (with or without mixed effects) from conceptual models☆23Apr 26, 2022Updated 4 years ago
- GSOC 2017 - Apache Organization - # Implementation of Factorization Machines on Spark using parallel stochastic gradient descent (python…☆14Mar 26, 2017Updated 9 years ago
- An open source MCP proxy.☆17Jan 3, 2025Updated last year
- Vanderbilt Course Notes☆19Dec 11, 2020Updated 5 years ago
- open-source assistant with computer use agents☆25Mar 6, 2026Updated 2 months ago