aidecentralized / sonarLinks
SONAR - Self-Organizing Network of Aggregated Representations
☆17Updated 2 weeks ago
Alternatives and similar repositories for sonar
Users that are interested in sonar are comparing it to the libraries listed below
Sorting:
- Open source interpretability artefacts for R1.☆144Updated last month
- ☆180Updated 2 months ago
- ☆133Updated 2 months ago
- ☆14Updated this week
- A catalogue of existing Nanda servers☆133Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆173Updated 3 months ago
- 🧠 Starter templates for doing interpretability research☆70Updated last year
- Simulation framework for accelerating research in Private Federated Learning☆326Updated last week
- A package for defining deep learning models using categorical algebraic expressions.☆61Updated 10 months ago
- ☆128Updated 2 months ago
- A reading list of relevant papers and projects on foundation model annotation☆27Updated 3 months ago
- ☆24Updated 8 months ago
- Proof-of-concept of global switching between numpy/jax/pytorch in a library.☆18Updated 11 months ago
- METR Task Standard☆148Updated 4 months ago
- ☆16Updated last year
- A better way of testing, inspecting, and analyzing AI Agent traces.☆38Updated last week
- ControlArena is a suite of realistic settings, mimicking complex deployment environments, for running control evaluations. This is an alp…☆61Updated this week
- Steering vectors for transformer language models in Pytorch / Huggingface☆103Updated 3 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆95Updated this week
- Improving Alignment and Robustness with Circuit Breakers☆209Updated 8 months ago
- ☆43Updated 6 months ago
- Sphynx Hallucination Induction☆54Updated 4 months ago
- in this repository, i'm going to implement increasingly complex llm inference optimizations☆59Updated 2 weeks ago
- ⚖️ Awesome LLM Judges ⚖️☆104Updated last month
- Mechanistic Interpretability Visualizations using React☆253Updated 5 months ago
- Collection of evals for Inspect AI☆147Updated this week
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆106Updated last year
- Transformers for Mathematics Tutorial | Simons/SLMath Workshop on AI for Mathematics 2025☆33Updated 2 months ago
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆126Updated 2 years ago
- A lightweight framework for building research agents designed for developers☆94Updated this week