mlcommons / modelgaugeLinks
Make it easy to automatically and uniformly measure the behavior of many AI Systems.
☆26Updated 11 months ago
Alternatives and similar repositories for modelgauge
Users that are interested in modelgauge are comparing it to the libraries listed below
Sorting:
- Run safety benchmarks against AI models and view detailed reports showing how well they performed.☆104Updated this week
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Updated last year
- ☆29Updated 2 years ago
- Code for the ACL 2023 paper: "Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Sc…☆31Updated 2 years ago
- We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…☆94Updated last year
- ☆30Updated 2 years ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆72Updated last year
- ☆54Updated 2 years ago
- A simple evaluation of generative language models and safety classifiers.☆63Updated last week
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28Updated last year
- ☆28Updated 7 months ago
- ☆57Updated last month
- Data for "Datamodels: Predicting Predictions with Training Data"☆97Updated 2 years ago
- ☆44Updated 10 months ago
- Official PyTorch Implementation for Meaning Representations from Trajectories in Autoregressive Models (ICLR 2024)☆22Updated last year
- Language models scale reliably with over-training and on downstream tasks☆99Updated last year
- ☆75Updated last year
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- ☆61Updated 3 years ago
- Finding semantically meaningful and accurate prompts.☆48Updated last year
- The repository contains code for Adaptive Data Optimization☆25Updated 9 months ago
- ☆26Updated last year
- SILO Language Models code repository☆82Updated last year
- Package to optimize Adversarial Attacks against (Large) Language Models with Varied Objectives☆70Updated last year
- PyTorch package to train and audit ML models for Individual Fairness☆66Updated this week
- Code for the paper "Fishing for Magikarp"☆165Updated 4 months ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆59Updated 2 years ago
- LM engine is a library for pretraining/finetuning LLMs☆66Updated last week
- Erasing concepts from neural representations with provable guarantees☆233Updated 7 months ago
- ☆14Updated last year