aidecentralized / sonar
SONAR - Self-Organizing Network of Aggregated Representations
☆15Updated last week
Alternatives and similar repositories for sonar:
Users that are interested in sonar are comparing it to the libraries listed below
- Verdict is a library for scaling judge-time compute.☆199Updated last week
- ☆160Updated 2 weeks ago
- ☆128Updated 3 weeks ago
- A catalogue of existing Nanda servers☆101Updated this week
- Improving Alignment and Robustness with Circuit Breakers☆197Updated 7 months ago
- Benchmarks for the Evaluation of LLM Supervision☆32Updated 2 weeks ago
- Federated Transformer (NeurIPS 24): a framework to enhance the performance of multi-party Vertical Federated Learning involving fuzzy ide…☆37Updated 4 months ago
- ☆122Updated last month
- METR Task Standard☆146Updated 2 months ago
- Open source interpretability artefacts for R1.☆95Updated this week
- ☆14Updated last week
- ☆137Updated last month
- ☆64Updated this week
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆117Updated 10 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆90Updated this week
- ☆54Updated 7 months ago
- A better way of testing, inspecting, and analyzing AI Agent traces.☆35Updated this week
- The Granite Guardian models are designed to detect risks in prompts and responses.☆78Updated last month
- Public repository containing METR's DVC pipeline for eval data analysis☆49Updated 3 weeks ago
- Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".☆102Updated last year
- A 7B parameter model for mathematical reasoning☆29Updated 2 months ago
- Simulation framework for accelerating research in Private Federated Learning☆323Updated 2 months ago
- ☆13Updated last week
- Guardrails for secure and robust agent development☆237Updated last week
- Official Repo for the 30DaysOfFLCode Challenge Initiative☆72Updated 4 months ago
- Blueprint for federated finetuning, enabling multiple data owners to collaboratively fine-tune models without sharing raw data. Developed…☆34Updated last week
- ⚖️ Awesome LLM Judges ⚖️☆93Updated 2 months ago
- Red-Teaming Language Models with DSPy☆183Updated 2 months ago
- open source interpretability platform 🧠☆97Updated this week
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆46Updated 2 months ago