mlcommons / modelgaugeLinks
Make it easy to automatically and uniformly measure the behavior of many AI Systems.
☆26Updated last year
Alternatives and similar repositories for modelgauge
Users that are interested in modelgauge are comparing it to the libraries listed below
Sorting:
- Run safety benchmarks against AI models and view detailed reports showing how well they performed.☆114Updated this week
- Erasing concepts from neural representations with provable guarantees☆239Updated 10 months ago
- Sparse and discrete interpretability tool for neural networks☆65Updated last year
- ☆56Updated 2 years ago
- ☆63Updated this week
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆28Updated last year
- ☆30Updated 2 years ago
- git extension for {collaborative, communal, continual} model development☆217Updated last year
- We view Large Language Models as stochastic language layers in a network, where the learnable parameters are the natural language prompts…☆95Updated last year
- ☆144Updated 3 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆258Updated 2 years ago
- The Foundation Model Transparency Index☆83Updated last week
- ☆76Updated last year
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for tr…☆66Updated last month
- codebase release for EMNLP2023 paper publication☆19Updated 3 months ago
- A mechanistic approach for understanding and detecting factual errors of large language models.☆49Updated last year
- Code for the ACL 2023 paper: "Rethinking the Role of Scale for In-Context Learning: An Interpretability-based Case Study at 66 Billion Sc…☆35Updated 2 years ago
- Utilities for the HuggingFace transformers library☆72Updated 2 years ago
- Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents☆51Updated last year
- ☆112Updated 10 months ago
- Measuring the situational awareness of language models☆39Updated last year
- 🧠 Starter templates for doing interpretability research☆74Updated 2 years ago
- ☆26Updated 2 years ago
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆71Updated last year
- Latest Weight Averaging (NeurIPS HITY 2022)☆32Updated 2 years ago
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Act☆93Updated 2 years ago
- Experiments for efforts to train a new and improved t5☆76Updated last year
- Code for the paper "Fishing for Magikarp"☆176Updated 7 months ago
- Understanding how features learned by neural networks evolve throughout training☆40Updated last year
- The NDIF server, which performs deep inference and serves nnsight requests remotely☆36Updated this week