An alignment auditing agent capable of quickly exploring alignment hypothesis
☆934Feb 28, 2026Updated last week
Alternatives and similar repositories for petri
Users that are interested in petri are comparing it to the libraries listed below
Sorting:
- ☆37Jul 4, 2025Updated 8 months ago
- Inspect: A framework for large language model evaluations☆1,800Updated this week
- Repository for "Training Language Models To Explain Their Own Computations"☆21Dec 22, 2025Updated 2 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆134Feb 15, 2026Updated 3 weeks ago
- Open Source Replication of Anthropic's Alignment Faking Paper☆54Apr 4, 2025Updated 11 months ago
- ☆21Jun 22, 2025Updated 8 months ago
- James' cookbook of evaluations and finetuning experiments☆21Feb 19, 2026Updated 2 weeks ago
- A python sdk for LLM finetuning and inference on runpod infrastructure☆20Feb 16, 2026Updated 3 weeks ago
- ☆144Sep 29, 2025Updated 5 months ago
- A toolkit that provides a range of model diffing techniques including a UI to visualize them interactively.☆66Updated this week
- 💬 Simple TUI for ChatGPT.☆20Feb 9, 2024Updated 2 years ago
- ☆20Jan 21, 2023Updated 3 years ago
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆160Feb 27, 2026Updated last week
- Residual Quantization Autoencoder, used for interpreting LLMs☆14Jan 1, 2025Updated last year
- Using pre-trained YOLO algorithm to detect faces in photo ID documents for ID verification☆10Apr 3, 2018Updated 7 years ago
- Streamline on-policy/off-policy distillation workflows in a few lines of code☆96Feb 26, 2026Updated last week
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) with…☆24Jan 7, 2024Updated 2 years ago
- A Python-based security assessment tool for continuous automated security scanning and monitoring of domains.☆13Apr 4, 2025Updated 11 months ago
- ☆25Jan 17, 2026Updated last month
- ☆26Sep 3, 2025Updated 6 months ago
- A library for training crosscoders☆16May 28, 2025Updated 9 months ago
- A guide on how to provide external configuration to microservices using MicroProfile Config: https://openliberty.io/guides/microprofile-c…☆13Mar 1, 2026Updated last week
- Code for simulations in "Computational mechanisms of curiosity and goal-directed exploration"☆10May 22, 2020Updated 5 years ago
- Collection of evals for Inspect AI☆393Updated this week
- ☆37Nov 14, 2025Updated 3 months ago
- Introduction to Game Development in Unity☆10Jun 21, 2016Updated 9 years ago
- ☆22Updated this week
- ☆12Jul 12, 2024Updated last year
- Code for the "Overcoming Sparsity Artifacts in Crosscoders to Interpret Chat-Tuning" paper.☆16Nov 21, 2025Updated 3 months ago
- Bluesky MCP server☆30Aug 10, 2025Updated 7 months ago
- Sparsify transformers with SAEs and transcoders☆699Mar 2, 2026Updated last week
- ☆979Updated this week
- ☆11Jun 2, 2021Updated 4 years ago
- ☆15Jun 7, 2024Updated last year
- Agentkube - Run Kubernetes Like Never Before☆36Mar 1, 2026Updated last week
- Investigate the speed of adaptation of structural causal models☆15Feb 11, 2021Updated 5 years ago
- ☆21Jul 21, 2025Updated 7 months ago
- Tools for optimizing steering vectors in LLMs.☆20Apr 10, 2025Updated 11 months ago
- A lightweight static site generator with built-in CMS that creates microblog-style content feeds.☆17Jan 18, 2026Updated last month