Evaluation harness for OpenHands V1.
☆63Apr 9, 2026Updated this week
Alternatives and similar repositories for benchmarks
Users that are interested in benchmarks are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Elastic computing platform☆31Updated this week
- [NeurIPS 2025] GuideFlow3D: Optimization-Guided Rectified Flow For Appearance Transfer☆25Mar 20, 2026Updated 3 weeks ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jun 1, 2023Updated 2 years ago
- [NeurIPS 2025] Official codebase for T2DA: Offline Meta-RL from Natural Language Supervision☆15Jun 1, 2025Updated 10 months ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Public Evaluation Result Archieve for BFCL☆29Dec 17, 2025Updated 3 months ago
- Small repository for my video on LoRA☆16May 14, 2023Updated 2 years ago
- Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]☆18Apr 14, 2025Updated last year
- [NeurIPS 23] Characterizing OOD Error via Optimal Transport☆13Nov 19, 2023Updated 2 years ago
- ☆29Jan 31, 2026Updated 2 months ago
- TopoTrans: Optimal Transport meets Topological Data Analysis☆14Apr 20, 2023Updated 2 years ago
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆14May 14, 2024Updated last year
- Code and results accompanying our paper titled Leveraging Unlabeled Data to Predict Out-of-Distribution Performance at ICLR 2022☆10Dec 8, 2022Updated 3 years ago
- A universal workflow system for exactly-once DAGs☆23Jun 1, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ICLR 2026] The official repository for the paper "AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning".☆78Feb 27, 2026Updated last month
- KGym - A platform to run hundreds to thousands of ML4Linux kernel experiments at scale☆16Nov 8, 2025Updated 5 months ago
- The CompCert formally-verified C compiler☆11Apr 4, 2026Updated last week
- A thin MCP proxy☆11Jul 28, 2025Updated 8 months ago
- MHC-peptide class II interaction prediction, binding, presentation☆22Mar 16, 2022Updated 4 years ago
- ATP: Directed Graph Embedding with Asymmetric Transitivity Preservation☆10Apr 18, 2019Updated 6 years ago
- This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…☆11Aug 24, 2022Updated 3 years ago
- ☆12May 17, 2021Updated 4 years ago
- A Gateway for connecting application services in different domains, networks, and cloud infrastructures☆23Feb 1, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Jul 22, 2021Updated 4 years ago
- Code for paper "W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering"☆15Oct 2, 2025Updated 6 months ago
- Streamlit-based Web App for Ai Text Generation based on GPT-2 Models from HuggingFace Model Hub using Python library aitextgen☆27Nov 26, 2020Updated 5 years ago
- idaflirt-detector is Python scripts and IDA FLIRT signatures to detect statically linked libraries from stripped ELF file.☆12May 19, 2022Updated 3 years ago
- Contains examples and assignments for my CS 254 course at Vanderbilt University, which can be accessed via http://www.dre.vanderbilt.edu/…☆14Apr 25, 2022Updated 3 years ago
- open IPython notebooks for the book of Scientific Computing with Python☆11Jul 16, 2015Updated 10 years ago
- A port of Grafx2 to DOS☆13Apr 3, 2022Updated 4 years ago
- Predicting Out-of-Distribution Error with the Projection Norm☆19Jul 27, 2022Updated 3 years ago
- The TacTok automated Coq proof script synthesis tool☆17Jan 9, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A Model Context Protocol (MCP) server implementation that provides file backup and restoration capabilities☆12Aug 8, 2025Updated 8 months ago
- ☆28Jul 11, 2024Updated last year
- PMP: Cost-Effective Forced Execution with Probabilistic Memory Pre-Planning☆13Sep 8, 2020Updated 5 years ago
- ☆25Apr 7, 2026Updated last week
- AI-powered Python CLI tool that automates the entire software development lifecycle using the Claude Code SDK. From specification to depl…☆21Nov 17, 2025Updated 4 months ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆27Oct 20, 2022Updated 3 years ago
- Agentless Lite: RAG-based SWE-Bench software engineering scaffold☆45Apr 15, 2025Updated last year