AutoHarness: Automated Harness Engineering for AI Agents
☆309Apr 2, 2026Updated 2 months ago
Alternatives and similar repositories for AutoHarness
Users that are interested in AutoHarness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆36Nov 3, 2024Updated last year
- Multi-Agent Collaboration Design Patterns Built on LangGraph with 10+ battle-tested patterns, each with complete code, architectu…☆51Apr 9, 2026Updated last month
- HSML Dynamic version for ICML 2019☆12Jul 11, 2019Updated 6 years ago
- KGML for EMNLP 2021☆10Feb 2, 2022Updated 4 years ago
- When Visualizing is the First Step to Reasoning: MIRA, a Benchmark for Visual Chain-of-Thought☆32Feb 14, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch implementation for "ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback" (https://arxiv.org/abs/2107.14035).☆16Sep 9, 2022Updated 3 years ago
- Claude skill for finding ML research papers.☆213Apr 14, 2026Updated last month
- ☆73Apr 1, 2026Updated 2 months ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆343Aug 8, 2025Updated 10 months ago
- A proactive tutor mode for Clicky☆96Apr 8, 2026Updated 2 months ago
- A collection of interesting links, articles, research papers and projects related to knowledge graphs, GenAI and LLMs (large language mod…☆28Jul 5, 2024Updated last year
- Pytorch implementation of EpiFoundation☆27Feb 25, 2025Updated last year
- ☆32Dec 1, 2025Updated 6 months ago
- 功能上来说就是Claude Code webUI和frp的结合体,简化配置和部署☆63Nov 7, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Visibility Deferred Rendering☆14Sep 11, 2024Updated last year
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆29Feb 17, 2025Updated last year
- Source code for Regional Homogeneity: Towards Learning Transferable Universal Adversarial Perturbations Against Defenses (ECCV 2020)☆42Apr 2, 2019Updated 7 years ago
- A tool that can be used to measure the sequential performance of any OpenAI-compatible LLM API☆24Aug 1, 2024Updated last year
- Code for LDLForests☆20Oct 4, 2018Updated 7 years ago
- [ECCV 2022] This repository includes the official implementation our paper "In Defense of Image Pre-Training for Spatiotemporal Recogniti…☆19Dec 22, 2022Updated 3 years ago
- Neo4j example movie search application with GraphQL backend☆14Apr 3, 2018Updated 8 years ago
- ☆17Feb 14, 2024Updated 2 years ago
- gpu-ray-tracing-in-unity implements by three-eyed-games☆20Jan 9, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The Vulkan Tutorial adapted to SDL2, VMA, Slang, Volk, Imgui and pure functions.☆13Apr 21, 2025Updated last year
- [NeurIPS'24] CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models☆81Dec 4, 2024Updated last year
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated last year
- official repository for the NeurIPS 2022 paper "Adversarial Attack on Attackers: Post-Process to Mitigate Black-Box Score-Based Query Att…☆20Oct 28, 2022Updated 3 years ago
- Example for agent orchestration☆19Mar 31, 2025Updated last year
- ☆27Jan 23, 2024Updated 2 years ago
- Collections of Actions for Custom GPTs (some created by Captain Action)☆11Jan 7, 2024Updated 2 years ago
- ☆27May 30, 2026Updated last week
- ☆29Feb 27, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An implementation of a Vulkan RayQuery ray tracing integration project in Unity./移动端 Vulkan 光追实现☆17Feb 23, 2025Updated last year
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆87Oct 26, 2025Updated 7 months ago
- [TMLR 2025] Official implementation of AttnGCG: Enhancing Jailbreaking Attacks on LLMs with Attention Manipulation☆26Jun 17, 2025Updated 11 months ago
- Unity sample project using instancing in HDRP path tracing.☆17May 1, 2025Updated last year
- [CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"☆20Apr 21, 2024Updated 2 years ago
- Code for PII detection and redaction in code datasets☆15Jan 24, 2023Updated 3 years ago
- [TMLR 2024] On the Adversarial Robustness of Camera-based 3D Object Detection☆31Apr 23, 2024Updated 2 years ago