aidecentralized / sonarLinks
SONAR - Self-Organizing Network of Aggregated Representations
☆22Updated 4 months ago
Alternatives and similar repositories for sonar
Users that are interested in sonar are comparing it to the libraries listed below
Sorting:
- ☆234Updated 5 months ago
- Simulation framework for accelerating research in Private Federated Learning☆345Updated last month
- Open source interpretability artefacts for R1.☆164Updated 7 months ago
- ☆17Updated last week
- Benchmarks for the Evaluation of LLM Supervision☆32Updated 2 months ago
- ☆144Updated 3 months ago
- A Mechanistic Interpretability Analysis of Grokking☆23Updated 3 years ago
- Training-Ready RL Environments + Evals☆190Updated this week
- A catalogue of existing Nanda servers☆190Updated 7 months ago
- ☆29Updated last year
- Repository with sample code using Apollo's suggested engineering practices☆14Updated 11 months ago
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆122Updated last year
- A 7B parameter model for mathematical reasoning☆40Updated 9 months ago
- METR Task Standard☆168Updated 10 months ago
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆134Updated last week
- Stochastic Parameter Decomposition☆58Updated last week
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆121Updated last month
- ☆18Updated last year
- Collection of evals for Inspect AI☆305Updated this week
- ☆62Updated last week
- open source interpretability platform 🧠☆532Updated this week
- Notebooks accompanying Anthropic's "Toy Models of Superposition" paper☆131Updated 3 years ago
- Test equality between a black-box LLM API and a reference distribution☆12Updated last year
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆85Updated 9 months ago
- ☆38Updated last week
- Training API and CLI☆253Updated last week
- An interface library for RL post training with environments.☆829Updated this week
- ☆104Updated 4 months ago
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆189Updated 9 months ago
- Solidity contracts for the decentralized Prime Network protocol☆27Updated 5 months ago