Open-sourced evaluation suite from the Monitoring Monitorability paper
☆84Jun 11, 2026Updated 3 weeks ago
Alternatives and similar repositories for monitorability-evals
Users that are interested in monitorability-evals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a metaprogramming language that compiles from types☆10Jun 26, 2024Updated 2 years ago
- ADAG: Transluce's MLP neuron-level circuit tracing library☆29Apr 10, 2026Updated 2 months ago
- [arXiv 2025] SAGE: Training Smart Any-Horizon Agents for Long Video Reasoning with Reinforcement Learning☆71Dec 17, 2025Updated 6 months ago
- Auditing agents for fine-tuning safety☆21Oct 21, 2025Updated 8 months ago
- EstrousNet is a deep learning network that provides unbiased classification of estrous stage.☆21Aug 28, 2025Updated 10 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs☆13Jun 20, 2025Updated last year
- C# SDK for The Eye Tribe Tracker☆24Nov 23, 2016Updated 9 years ago
- ☆26Sep 3, 2025Updated 10 months ago
- Minimal coding, computer-use and deep research agents using the OpenAI Agents SDK☆36May 19, 2026Updated last month
- Code for ICML 2023 paper "When and How Does Known Class Help Discover Unknown Ones? Provable Understandings Through Spectral Analysis"☆14Jun 24, 2023Updated 3 years ago
- ☆15Oct 5, 2025Updated 8 months ago
- Code for evaluating AI systems on the MASK honesty benchmark.☆22Mar 6, 2025Updated last year
- An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)☆10May 31, 2024Updated 2 years ago
- ☆15Jun 7, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SVIP: Towards Verifiable Inference of Open-Source Large Language Models☆15Jun 3, 2025Updated last year
- Bulk operations extension for Entity Framework(EF6 and EFCore).☆12Aug 22, 2017Updated 8 years ago
- 주식 시장 관련 지표들을 모아서 보여주고, AI를 통해 시장의 향방을 예측해주는 웹 페이지 입니다.☆49May 12, 2026Updated last month
- [ECCV2022] Motion Sensitive Contrastive Learning for Self-supervised Video Representation☆17Aug 12, 2022Updated 3 years ago
- TRAIL: Simulating the Impact of Human Locomotion on Natural Landscapes - 2024 - Computer Graphics International (CGI)☆13Apr 7, 2025Updated last year
- Röttger et al. (2025): "MSTS: A Multimodal Safety Test Suite for Vision-Language Models"☆20Mar 31, 2025Updated last year
- ☆10Dec 17, 2020Updated 5 years ago
- Reasoning-based Evaluation and Ranking of Translations.☆19Jun 2, 2026Updated last month
- Animal Harm Assessment public repository☆12May 3, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Fast optimisation of tuning curves by iterative refitting and decoding☆15Jun 6, 2026Updated 3 weeks ago
- Code for the API, workload execution, and agents underlying the LLMail-Inject Adpative Prompt Injection Challenge☆25Apr 9, 2026Updated 2 months ago
- helpful code for solving statistical problems☆15Oct 13, 2025Updated 8 months ago
- Course for ISP: craft of data visualization☆22Sep 1, 2022Updated 3 years ago
- ☆18Apr 15, 2024Updated 2 years ago
- Example agents for the Dreadnode platform☆33Dec 19, 2025Updated 6 months ago
- A framework for evaluating Machine Translation models.☆12Apr 21, 2026Updated 2 months ago
- pointcloud data format binary version web viewer using Three.js☆18Oct 11, 2020Updated 5 years ago
- ☆12Aug 2, 2016Updated 9 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Facial Recognition Software for Macaque Monkeys☆13Jul 18, 2017Updated 8 years ago
- ☆18Mar 30, 2025Updated last year
- [COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…☆12Jul 26, 2024Updated last year
- A new multi-task learning framework using Vision Transformers☆11Jun 19, 2024Updated 2 years ago
- ☆111Jun 25, 2026Updated last week
- ☆12Sep 11, 2022Updated 3 years ago
- Basic Shooting Game in C++ and OpenCV☆18Dec 28, 2018Updated 7 years ago