safety-research / petriView external linksLinks
An alignment auditing agent capable of quickly exploring alignment hypothesis
☆887Updated this week
Alternatives and similar repositories for petri
Users that are interested in petri are comparing it to the libraries listed below
Sorting:
- Prompts used in the Automated Auditing Blog Post☆138Jul 24, 2025Updated 6 months ago
- Inference API for many LLMs and other useful tools for empirical research☆104Updated this week
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated last month
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆134Feb 8, 2026Updated last week
- Open Source Replication of Anthropic's Alignment Faking Paper☆54Apr 4, 2025Updated 10 months ago
- A python sdk for LLM finetuning and inference on runpod infrastructure☆17Updated this week
- Public repository containing METR's DVC pipeline for eval data analysis☆206Updated this week
- ☆140Sep 29, 2025Updated 4 months ago
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆57Updated this week
- 💬 Simple TUI for ChatGPT.☆20Feb 9, 2024Updated 2 years ago
- ☆19Jan 21, 2023Updated 3 years ago
- ☆25Nov 11, 2025Updated 3 months ago
- A library for training crosscoders☆15May 28, 2025Updated 8 months ago
- Code for simulations in "Computational mechanisms of curiosity and goal-directed exploration"☆10May 22, 2020Updated 5 years ago
- A guide on how to provide external configuration to microservices using MicroProfile Config: https://openliberty.io/guides/microprofile-c…☆13Feb 1, 2026Updated 2 weeks ago
- Deploy, update, and manage multiple stateful AI agents from YAML configuration files with simple commands like lettactl apply -f agents.y…☆33Updated this week
- A Python-based security assessment tool for continuous automated security scanning and monitoring of domains.☆13Apr 4, 2025Updated 10 months ago
- llama4_trip_planning_agent☆12Apr 5, 2025Updated 10 months ago
- QEMU support for a custom board based on a Microchip ATSAMD21G18A microcontroller (MCU)☆14Jun 10, 2024Updated last year
- Python tools for text to speech (TTS), speech to text (STT), and speech to speech (STS) powered by MLX☆29Jan 31, 2026Updated 2 weeks ago
- ☆36Nov 14, 2025Updated 3 months ago
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆16Nov 21, 2025Updated 2 months ago
- ☆14Apr 8, 2021Updated 4 years ago
- ☆12Jul 12, 2024Updated last year
- Generate an MCP for any web app☆55Aug 26, 2025Updated 5 months ago
- Bluesky MCP server☆29Aug 10, 2025Updated 6 months ago
- An initiative to create concise and widely shareable educational resources, infographics, and animated explainers on the latest contribut…☆18Jul 9, 2023Updated 2 years ago
- Unofficial Experiments with AlgebraNets☆17Jun 17, 2020Updated 5 years ago
- ☆11Jun 2, 2021Updated 4 years ago
- Investigate the speed of adaptation of structural causal models☆15Feb 11, 2021Updated 5 years ago
- Implementation of various deep neural networks on fashion-mnist with PyTorch☆14Aug 30, 2017Updated 8 years ago
- ☆16Jul 9, 2025Updated 7 months ago
- Tools for optimizing steering vectors in LLMs.☆19Apr 10, 2025Updated 10 months ago
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆153Updated this week
- ☆18Jul 30, 2024Updated last year
- ☆12Oct 11, 2022Updated 3 years ago
- a web logging proxy for MCP client-server communication☆27Aug 17, 2025Updated 5 months ago
- ☆14Aug 29, 2023Updated 2 years ago
- A security scanner for your LLM agentic workflows☆910Nov 27, 2025Updated 2 months ago