aidecentralized / sonarLinks
SONAR - Self-Organizing Network of Aggregated Representations
☆22Updated last month
Alternatives and similar repositories for sonar
Users that are interested in sonar are comparing it to the libraries listed below
Sorting:
- Open source interpretability artefacts for R1.☆157Updated 4 months ago
- ☆222Updated 2 months ago
- ☆16Updated this week
- open source interpretability platform 🧠☆375Updated this week
- ☆141Updated 2 weeks ago
- A catalogue of existing Nanda servers☆185Updated 4 months ago
- METR Task Standard☆159Updated 7 months ago
- Public repository containing METR's DVC pipeline for eval data analysis☆104Updated 4 months ago
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆651Updated last week
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆110Updated last week
- Simulation framework for accelerating research in Private Federated Learning☆334Updated 2 weeks ago
- large population models☆400Updated this week
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆88Updated last week
- ☆85Updated last month
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆112Updated last year
- Starter SDK for full-stack EVM applications, built for TreeHacks 2025 Web3 Workshop☆13Updated 6 months ago
- Collection of evals for Inspect AI☆215Updated this week
- ☆122Updated 3 weeks ago
- Benchmarks for the Evaluation of LLM Supervision☆32Updated last month
- ☆475Updated last month
- Machine Learning for Alignment Bootcamp☆78Updated 3 years ago
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆54Updated 6 months ago
- A 7B parameter model for mathematical reasoning☆40Updated 6 months ago
- A curated list of awesome resources, tools, research papers, and projects related to the concept of Large Language Model Operating System…☆131Updated 4 months ago
- Tools for studying developmental interpretability in neural networks.☆101Updated 2 months ago
- Training-Ready RL Environments + Evals☆65Updated this week
- ☆98Updated 4 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆182Updated 5 months ago
- ☆28Updated 10 months ago
- Repository with sample code using Apollo's suggested engineering practices☆12Updated 8 months ago