aidecentralized / sonarLinks
SONAR - Self-Organizing Network of Aggregated Representations
☆22Updated 2 months ago
Alternatives and similar repositories for sonar
Users that are interested in sonar are comparing it to the libraries listed below
Sorting:
- Open source interpretability artefacts for R1.☆161Updated 5 months ago
- ☆225Updated 3 months ago
- ☆16Updated this week
- ☆142Updated last month
- Atropos is a Language Model Reinforcement Learning Environments framework for collecting and evaluating LLM trajectories through diverse …☆702Updated this week
- open source interpretability platform 🧠☆442Updated this week
- Simulation framework for accelerating research in Private Federated Learning☆341Updated last month
- Collection of evals for Inspect AI☆250Updated this week
- ☆475Updated 2 months ago
- ☆28Updated last year
- A catalogue of existing Nanda servers☆187Updated 5 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆115Updated this week
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆118Updated last year
- Training-Ready RL Environments + Evals☆121Updated this week
- METR Task Standard☆163Updated 8 months ago
- ☆230Updated 3 months ago
- Post-training with Tinker☆912Updated this week
- ☆97Updated 2 months ago
- A Mechanistic Interpretability Analysis of Grokking☆22Updated 3 years ago
- ☆35Updated 2 weeks ago
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆99Updated this week
- ☆109Updated 5 months ago
- Testing baseline LLMs performance across various models☆313Updated this week
- A 7B parameter model for mathematical reasoning☆40Updated 7 months ago
- Repository with sample code using Apollo's suggested engineering practices☆13Updated 9 months ago
- A reading list of relevant papers and projects on foundation model annotation☆28Updated 7 months ago
- ☆69Updated last week
- Public repository containing METR's DVC pipeline for eval data analysis☆117Updated 6 months ago
- Benchmarks for the Evaluation of LLM Supervision☆32Updated this week
- Transformers for Mathematics Tutorial | Simons/SLMath Workshop on AI for Mathematics 2025☆40Updated 6 months ago