Prompts used in the Automated Auditing Blog Post
☆148Jul 24, 2025Updated 8 months ago
Alternatives and similar repositories for automated-auditing
Users that are interested in automated-auditing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open Source Replication of Anthropic's Alignment Faking Paper☆56Apr 4, 2025Updated last year
- ☆588Jun 19, 2025Updated 9 months ago
- 🤖 Complete reproduction of 'AlphaGo Moment for Model Architecture Discovery' using MLX-LM instead of GPT-4. Autonomous neural architectu…☆29Jul 27, 2025Updated 8 months ago
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Feb 15, 2026Updated last month
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆20Apr 10, 2025Updated last year
- [TMLR 25] An automated method for explaining complex neuron behaviors in deep vision models using large language models☆10Feb 20, 2025Updated last year
- ☆80Feb 18, 2026Updated last month
- Stochastic Parameter Decomposition☆70Updated this week
- Measuring and Controlling Persona Drift in Language Model Dialogs☆23Feb 26, 2024Updated 2 years ago
- ☆24Jun 22, 2025Updated 9 months ago
- ☆12Sep 25, 2022Updated 3 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆14May 17, 2024Updated last year
- ☆279Oct 1, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆48Mar 19, 2026Updated 3 weeks ago
- ☆35Feb 20, 2025Updated last year
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆177Updated this week
- A basic ls replacement, written in rust, using cursor ai and Geoffrey Huntley's techniques☆31Mar 3, 2025Updated last year
- ☆88Apr 1, 2026Updated last week
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆248Updated this week
- ppx_system is a syntax extension to known operating system at compile time☆12May 9, 2023Updated 2 years ago
- Official repository for the paper "Gradient-based Jailbreak Images for Multimodal Fusion Models" (https//arxiv.org/abs/2410.03489)☆19Oct 22, 2024Updated last year
- ☆15Apr 26, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- This was designed for interp researchers who want to do research on or with interp agents to give quality of life improvements and fix …☆137Feb 8, 2026Updated 2 months ago
- ☆134Oct 16, 2025Updated 5 months ago
- ☆20Jan 21, 2023Updated 3 years ago
- ☆25Feb 23, 2026Updated last month
- The nnsight package enables interpreting and manipulating the internals of deep learned models.☆885Apr 5, 2026Updated last week
- Auditing agents for fine-tuning safety☆20Oct 21, 2025Updated 5 months ago
- Langton's Ant implemented in Javascript☆27Sep 8, 2020Updated 5 years ago
- A Model Context Protocol server for Flux image generation, providing tools for image generation, manipulation, and control☆25Mar 25, 2026Updated 2 weeks ago
- Semantic search over every Emergent Ventures winner.☆30Feb 26, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18Dec 10, 2025Updated 4 months ago
- ☆13Jun 30, 2020Updated 5 years ago
- Runtime library and schema compiler for the Avro serialization format☆21Dec 13, 2021Updated 4 years ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆52Jan 12, 2026Updated 3 months ago
- A powerful keybind library and daemon for Linux.☆11Jul 24, 2022Updated 3 years ago
- Scala Native 3 bindings for SFML library☆15Jul 9, 2023Updated 2 years ago
- Sparse Autoencoder Training Library☆55May 1, 2025Updated 11 months ago