mlcommons / modelgauge
Make it easy to automatically and uniformly measure the behavior of many AI Systems.
☆26Updated 5 months ago
Alternatives and similar repositories for modelgauge:
Users that are interested in modelgauge are comparing it to the libraries listed below
- Run safety benchmarks against AI models and view detailed reports showing how well they performed.☆81Updated this week
- Sparse and discrete interpretability tool for neural networks☆59Updated last year
- ☆12Updated 3 years ago
- A mechanistic approach for understanding and detecting factual errors of large language models.☆41Updated 8 months ago
- ☆28Updated last year
- ☆26Updated 8 months ago
- ☆13Updated last year
- Official PyTorch Implementation for Meaning Representations from Trajectories in Autoregressive Models (ICLR 2024)☆20Updated 9 months ago
- 🧠 Starter templates for doing interpretability research☆66Updated last year
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆26Updated 9 months ago
- ☆31Updated last year
- Collection of evals for Inspect AI☆88Updated this week
- Official code for the paper: "Metadata Archaeology"☆19Updated last year
- Code for Fooling Contrastive Language-Image Pre-trainined Models with CLIPMasterPrints☆16Updated 4 months ago
- The repository contains code for Adaptive Data Optimization☆20Updated 3 months ago
- Data for "Datamodels: Predicting Predictions with Training Data"☆95Updated last year
- [EMNLP 2024 Main] Virtual Personas for Language Models via an Anthology of Backstories☆26Updated 3 months ago
- Code for the ACL 2023 paper: "Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Sc…☆29Updated last year
- Get language models to generate responses in a specific format reliably. Open source implementation of Synchromesh: Reliable code generat…☆27Updated last year
- ReLM is a Regular Expression engine for Language Models☆103Updated last year
- ☆26Updated last year
- The code for the paper ROUTERBENCH: A Benchmark for Multi-LLM Routing System☆108Updated 9 months ago
- Official Repository for Dataset Inference for LLMs☆32Updated 7 months ago
- ☆38Updated 10 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- ☆28Updated last year
- ☆53Updated last year
- Dolomite Engine is a library for pretraining/finetuning LLMs☆42Updated this week