Open-sourced evaluation suite from the Monitoring Monitorability paper
☆69Apr 22, 2026Updated last week
Alternatives and similar repositories for monitorability-evals
Users that are interested in monitorability-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a metaprogramming language that compiles from types☆10Jun 26, 2024Updated last year
- [arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning☆69Dec 17, 2025Updated 4 months ago
- Auditing agents for fine-tuning safety☆21Oct 21, 2025Updated 6 months ago
- EstrousNet is a deep learning network that provides unbiased classification of estrous stage.☆20Aug 28, 2025Updated 8 months ago
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- C# SDK for The Eye Tribe Tracker☆24Nov 23, 2016Updated 9 years ago
- ☆25Sep 3, 2025Updated 8 months ago
- Minimal coding, computer-use and deep research agents using the OpenAI Agents SDK☆35Mar 9, 2026Updated last month
- ☆13Oct 5, 2025Updated 6 months ago
- Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"☆14Jun 24, 2023Updated 2 years ago
- Code for evaluating AI systems on the MASK honesty benchmark.☆20Mar 6, 2025Updated last year
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated last year
- ☆15Jun 7, 2024Updated last year
- SVIP: Towards Verifiable Inference of Open-Source Large Language Models☆15Jun 3, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Bulk operations extension for Entity Framework(EF6 and EFCore).☆12Aug 22, 2017Updated 8 years ago
- 주식 시장 관련 지표들을 모아서 보여주고, AI를 통해 시장의 향방을 예측해주는 웹 페이지 입니다.☆44Mar 8, 2026Updated last month
- [ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation☆17Aug 12, 2022Updated 3 years ago
- TRAIL: Simulating the Impact of Human Locomotion on Natural Landscapes - 2024 - Computer Graphics International (CGI)☆13Apr 7, 2025Updated last year
- ☆10Dec 17, 2020Updated 5 years ago
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆17Mar 31, 2025Updated last year
- Reasoning-based Evaluation and Ranking of Translations.☆20Jul 18, 2025Updated 9 months ago
- Animal Harm Assessment public repository☆12Updated this week
- Fast optimisation of tuning curves by iterative refitting and decoding☆14Apr 18, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for the API, workload execution, and agents underlying the LLMail-Inject Adpative Prompt Injection Challenge☆23Apr 9, 2026Updated 3 weeks ago
- helpful code for solving statistical problems☆15Oct 13, 2025Updated 6 months ago
- Course for ISP: craft of data visualization☆22Sep 1, 2022Updated 3 years ago
- ☆18Apr 15, 2024Updated 2 years ago
- Example agents for the Dreadnode platform☆33Dec 19, 2025Updated 4 months ago
- A framework for evaluating Machine Translation models.☆12Apr 21, 2026Updated last week
- pointcloud data format binary version web viewer using Three.js☆18Oct 11, 2020Updated 5 years ago
- ☆12Aug 2, 2016Updated 9 years ago
- Facial Recognition Software for Macaque Monkeys☆13Jul 18, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆43May 9, 2025Updated 11 months ago
- ☆18Mar 30, 2025Updated last year
- [COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…☆12Jul 26, 2024Updated last year
- ☆94Updated this week
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated last year
- ☆12Sep 11, 2022Updated 3 years ago
- Basic Shooting Game in C++ and OpenCV☆18Dec 28, 2018Updated 7 years ago