aidecentralized / sonarLinks
SONAR - Self-Organizing Network of Aggregated Representations
☆22Updated 3 months ago
Alternatives and similar repositories for sonar
Users that are interested in sonar are comparing it to the libraries listed below
Sorting:
- ☆232Updated 4 months ago
- Training-Ready RL Environments + Evals☆174Updated this week
- Repository with sample code using Apollo's suggested engineering practices☆13Updated 11 months ago
- Simulation framework for accelerating research in Private Federated Learning☆344Updated 3 weeks ago
- Benchmarks for the Evaluation of LLM Supervision☆32Updated last month
- ☆143Updated 2 months ago
- Open source interpretability artefacts for R1.☆163Updated 6 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆129Updated 7 months ago
- ☆17Updated last week
- A catalogue of existing Nanda servers☆189Updated 6 months ago
- ☆98Updated this week
- ⚖️ Awesome LLM Judges ⚖️☆133Updated 6 months ago
- METR Task Standard☆167Updated 9 months ago
- open source interpretability platform 🧠☆486Updated this week
- ☆104Updated 3 months ago
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆128Updated this week
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆120Updated last week
- 🧠 Starter templates for doing interpretability research☆75Updated 2 years ago
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆122Updated last year
- anything you want can be built with morph cloud☆25Updated last month
- Resources for skilling up in AI alignment research engineering. Covers basics of deep learning, mechanistic interpretability, and RL.☆232Updated 3 months ago
- PyTorch-native post-training at scale☆532Updated this week
- ☆111Updated this week
- large population models☆444Updated last month
- Collection of evals for Inspect AI☆284Updated this week
- An open-source compliance-centered evaluation framework for Generative AI models☆170Updated this week
- Tools for studying developmental interpretability in neural networks.☆114Updated 4 months ago
- Open-source release accompanying Gao et al. 2025☆57Updated this week
- This repository collects all relevant resources about interpretability in LLMs☆382Updated last year
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 8 months ago