OpenHands / benchmarksView external linksLinks
Evaluation harness for OpenHands V1.
☆35Feb 6, 2026Updated last week
Alternatives and similar repositories for benchmarks
Users that are interested in benchmarks are comparing it to the libraries listed below
Sorting:
- A zero-cost, open-source desktop AI assistant that understands your screen and responds in real time (with cua)☆20Jan 26, 2026Updated 2 weeks ago
- Game engine for website version avalon card-board game☆12Aug 2, 2025Updated 6 months ago
- ATP: Directed Graph Embedding with Asymmetric Transitivity Preservation☆10Apr 18, 2019Updated 6 years ago
- Efficient MCP tool calling in code mode for Claude Code☆21Dec 12, 2025Updated 2 months ago
- Aline: Agentic Git for Vibe Coders☆27Nov 26, 2025Updated 2 months ago
- MCP server for NotebookLM - Let your AI agents (Claude Code, Codex) research documentation directly with grounded, citation-backed answer…☆20Jan 31, 2026Updated last week
- Siren: Byzantine-robust Federated Learning via Proactive Alarming (SoCC '21)☆11Mar 28, 2024Updated last year
- hq2x scaling algorithm updated to support RGBA☆17Jan 14, 2016Updated 10 years ago
- KGym - A platform to run hundreds to thousands of ML4Linux kernel experiments at scale☆14Nov 8, 2025Updated 3 months ago
- MCPfy your scripts and tasks, half agent, half mcp server, fully at your command☆25Nov 4, 2025Updated 3 months ago
- 让你的AI助手通过Suno唱歌给你听!☆21May 8, 2025Updated 9 months ago
- [USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks☆19Sep 18, 2025Updated 4 months ago
- Continual Memorization of Factoids in Large Language Models☆12Nov 20, 2024Updated last year
- This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…☆11Aug 24, 2022Updated 3 years ago
- Claude Code Review Skill☆26Jan 9, 2026Updated last month
- Voxel-based Editor☆13Jul 11, 2018Updated 7 years ago
- Thoughtbox is a Git-inspired workspace for Agent Teams.☆38Updated this week
- Implement of Implicit Knowledge Extraction Attack.☆18May 28, 2025Updated 8 months ago
- PMP: Cost-Effective Forced Execution with Probabilistic Memory Pre-Planning☆13Sep 8, 2020Updated 5 years ago
- ☆14May 17, 2021Updated 4 years ago
- Replacement for the old Unix crypt☆16Sep 17, 2019Updated 6 years ago
- SDL2 emscripten port, non-upstreamed changes☆34May 7, 2021Updated 4 years ago
- Audio transcription using mlx whisper and vad silence processing☆17Oct 14, 2024Updated last year
- [CVPR'24] LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning☆15Jan 15, 2025Updated last year
- A collection of MCP tools used to speed up local development for Kilo Code☆25Jun 12, 2025Updated 8 months ago
- 面试经验记录☆14Sep 11, 2019Updated 6 years ago
- ☆18Nov 30, 2025Updated 2 months ago
- A naive interpreter for IR of NJU compiler principle lab3, to accelerate interpretation, the ir will be compiled to machine-friendly bina…☆16Jun 17, 2020Updated 5 years ago
- Simple MOD player library written in C☆19Jan 30, 2026Updated 2 weeks ago
- A Model Context Protocol (MCP) server that lets your AI interact with Yahoo Finance to get comprehensive stock market data, news, financi…☆35Jul 22, 2025Updated 6 months ago
- Official repo for FSE'24 paper "CodeArt: Better Code Models by Attention Regularization When Symbols Are Lacking"☆18Mar 10, 2025Updated 11 months ago
- Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]☆18Apr 14, 2025Updated 10 months ago
- ☆24Sep 9, 2025Updated 5 months ago
- ☆23Aug 19, 2025Updated 5 months ago
- ☆23Jan 13, 2026Updated last month
- Predictive memory layer for AI agents. MongoDB + Qdrant + Neo4j with multi-tier caching, custom schema support & GraphQL. 91% Stanford ST…☆35Updated this week
- ☆18Aug 15, 2022Updated 3 years ago
- Template for OpenCode Plugins☆37Jan 11, 2026Updated last month
- cytoscape in vue☆16Jan 6, 2023Updated 3 years ago