Phylliida / OpenClioLinks
Open source version of Anthropic's Clio: A system for privacy-preserving insights into real-world AI use
☆53Updated 4 months ago
Alternatives and similar repositories for OpenClio
Users that are interested in OpenClio are comparing it to the libraries listed below
Sorting:
- A toolkit for describing model features and intervening on those features to steer behavior.☆223Updated last week
- ☆233Updated 3 weeks ago
- Steering vectors for transformer language models in Pytorch / Huggingface☆134Updated 10 months ago
- ☆112Updated 10 months ago
- ☆144Updated 3 months ago
- Open source interpretability artefacts for R1.☆165Updated 8 months ago
- ☆74Updated this week
- Create feature-centric and prompt-centric visualizations for sparse autoencoders (like those from Anthropic's published research).☆232Updated last year
- Inference-time scaling for LLMs-as-a-judge.☆317Updated last month
- Archon provides a modular framework for combining different inference-time techniques and LMs with just a JSON config file.☆190Updated 9 months ago
- ☆88Updated last week
- ☆79Updated last year
- Code for "Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs"☆85Updated 9 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆73Updated 8 months ago
- ⚖️ Awesome LLM Judges ⚖️☆146Updated 7 months ago
- ☆105Updated 11 months ago
- Vivaria is METR's tool for running evaluations and conducting agent elicitation research.☆123Updated last month
- ☆207Updated this week
- Persona Vectors: Monitoring and Controlling Character Traits in Language Models☆314Updated 4 months ago
- ☆29Updated 5 months ago
- Curated collection of community environments☆195Updated last week
- ☆124Updated 2 months ago
- ☆316Updated last year
- Functional Benchmarks and the Reasoning Gap☆90Updated last year
- Collection of evals for Inspect AI☆313Updated this week
- ☆57Updated 2 months ago
- ControlArena is a collection of settings, model organisms and protocols - for running control experiments.☆143Updated this week
- Delphi was the home of a temple to Phoebus Apollo, which famously had the inscription, 'Know Thyself.' This library lets language models …☆234Updated last week
- A DSPy-based implementation of the tree of thoughts method (Yao et al., 2023) for generating persuasive arguments☆94Updated 2 months ago
- Governance of the Commons Simulation (GovSim)☆62Updated 11 months ago