AutoHarness: Automated Harness Engineering for AI Agents
☆333Apr 2, 2026Updated 2 months ago
Alternatives and similar repositories for AutoHarness
Users that are interested in AutoHarness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLR'25 Oral] MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models☆36Nov 3, 2024Updated last year
- [ICLR'26] EduVisAgent: A Benchmark and Multi-Agent Framework for Pedagogical Visualization☆30Aug 5, 2025Updated 10 months ago
- Agents' Last Exam☆743Updated this week
- Multi-Agent Collaboration Design Patterns Built on LangGraph with 10+ battle-tested patterns, each with complete code, architectu…☆52Apr 9, 2026Updated 2 months ago
- HSML Dynamic version for ICML 2019☆12Jul 11, 2019Updated 6 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Support plugins for simulating RMF scenarios☆20Jun 17, 2026Updated last week
- PyTorch implementation for "ProtoTransformer: A Meta-Learning Approach to Providing Student Feedback" (https://arxiv.org/abs/2107.14035).☆16Sep 9, 2022Updated 3 years ago
- Claude skill for finding ML research papers.☆228Apr 14, 2026Updated 2 months ago
- Dynamic Mixture of Progressive Parameter-Efficient Expert Library for Lifelong Robot Learning☆29Jul 4, 2025Updated 11 months ago
- Reward Evolution with Large Language Models using Human Feedback☆20Nov 14, 2025Updated 7 months ago
- ☆12Feb 18, 2025Updated last year
- High throughput streaming of Protobuf data from Kafka into DuckDB☆13Mar 4, 2026Updated 3 months ago
- PyTorch implementation for MRL☆23Feb 22, 2024Updated 2 years ago
- [ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"☆88Nov 28, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ATS for NeurIPS 2021☆24Nov 4, 2021Updated 4 years ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆349Aug 8, 2025Updated 10 months ago
- A collection of interesting links, articles, research papers and projects related to knowledge graphs, GenAI and LLMs (large language mod…☆28Jul 5, 2024Updated last year
- This is the official implementation of WiseAD.☆26Apr 22, 2025Updated last year
- ☆19Jun 21, 2026Updated last week
- ☆32Dec 1, 2025Updated 6 months ago
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆29Feb 17, 2025Updated last year
- unofficial implementation of the CoT-decoding method for extract cot paths in an unsupervised way☆20Jan 11, 2026Updated 5 months ago
- Code for LDLForests☆20Oct 4, 2018Updated 7 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More☆25Feb 25, 2025Updated last year
- [NeurIPS'25] ReAgent-V: A Reward-Driven Multi-Agent Framework for Video Understanding☆53Sep 21, 2025Updated 9 months ago
- A Python package to dynamically load functions for OpenAI Assistant☆55Dec 6, 2023Updated 2 years ago
- An OptiX/CUDA code sample showing how to quickly build ray-tracing acceleration structures for dynamic subdivision surfaces☆27Feb 12, 2026Updated 4 months ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆29Apr 22, 2025Updated last year
- official repository for the NeurIPS 2022 paper "Adversarial Attack on Attackers: Post-Process to Mitigate Black-Box Score-Based Query Att…☆20Oct 28, 2022Updated 3 years ago
- ☆27Jan 23, 2024Updated 2 years ago
- Collections of Actions for Custom GPTs (some created by Captain Action)☆11Jan 7, 2024Updated 2 years ago
- ☆29Feb 27, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆87Oct 26, 2025Updated 8 months ago
- ☆19Jan 2, 2023Updated 3 years ago
- Unity sample project using instancing in HDRP path tracing.☆17May 1, 2025Updated last year
- [TMLR 2024] On the Adversarial Robustness of Camera-based 3D Object Detection☆31Apr 23, 2024Updated 2 years ago
- MLTI for ICLR 2022☆30May 6, 2022Updated 4 years ago
- ☆14Sep 4, 2024Updated last year
- [Arxiv] Aligning Modalities in Vision Large Language Models via Preference Fine-tuning☆93Apr 30, 2024Updated 2 years ago