mlcommons / modelgauge
Make it easy to automatically and uniformly measure the behavior of many AI Systems.
☆27Updated 7 months ago
Alternatives and similar repositories for modelgauge
Users that are interested in modelgauge are comparing it to the libraries listed below
Sorting:
- Run safety benchmarks against AI models and view detailed reports showing how well they performed.☆90Updated this week
- Understanding how features learned by neural networks evolve throughout training☆34Updated 6 months ago
- ☆29Updated last year
- ☆26Updated 2 years ago
- Documenting large text datasets 🖼️ 📚☆12Updated 5 months ago
- ☆12Updated 3 years ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆26Updated 11 months ago
- Experiments to assess SPADE on different LLM pipelines.☆16Updated last year
- A mechanistic approach for understanding and detecting factual errors of large language models.☆44Updated 10 months ago
- ☆17Updated 2 years ago
- ☆43Updated last year
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆26Updated 5 months ago
- Finding semantically meaningful and accurate prompts.☆46Updated last year
- ☆24Updated 3 months ago
- The Foundation Model Transparency Index☆79Updated 11 months ago
- ☆28Updated last year
- ☆54Updated last year
- Sparse and discrete interpretability tool for neural networks☆62Updated last year
- ☆60Updated 3 years ago
- 🧠 Starter templates for doing interpretability research☆70Updated last year
- Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents☆49Updated 7 months ago
- Official Repository for Dataset Inference for LLMs☆33Updated 9 months ago
- ☆129Updated last month
- Minimum Description Length probing for neural network representations☆19Updated 3 months ago
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆22Updated 3 months ago
- The repository contains code for Adaptive Data Optimization☆24Updated 5 months ago
- ☆19Updated 10 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆74Updated 5 months ago
- Cross-field empirical trends analysis of XAI literature☆20Updated last year
- ModelDiff: A Framework for Comparing Learning Algorithms☆56Updated last year