stanford-crfm / fmti
The Foundation Model Transparency Index
β78Updated 11 months ago
Alternatives and similar repositories for fmti:
Users that are interested in fmti are comparing it to the libraries listed below
- Stanford CRFM's initiative to assess potential compliance with the draft EU AI Actβ94Updated last year
- π€ Disaggregators: Curated data labelers for in-depth analysis.β65Updated 2 years ago
- β230Updated last month
- β128Updated 3 weeks ago
- π A curated list of papers & technical articles on AI Quality & Safetyβ178Updated last week
- Your buddy in the (L)LM space.β64Updated 7 months ago
- This is the reproduction repository for my π€ Hugging Face blog post on synthetic dataβ68Updated last year
- β264Updated 3 months ago
- git extension for {collaborative, communal, continual} model developmentβ211Updated 5 months ago
- Run safety benchmarks against AI models and view detailed reports showing how well they performed.β88Updated this week
- Functional Benchmarks and the Reasoning Gapβ85Updated 6 months ago
- β90Updated 2 months ago
- Fiddler Auditor is a tool to evaluate language models.β179Updated last year
- Erasing concepts from neural representations with provable guaranteesβ227Updated 2 months ago
- Scaling is a distributed training library and installable dependency designed to scale up neural networks, with a dedicated module for trβ¦β58Updated 5 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"β108Updated last year
- β67Updated 8 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated 9 months ago
- β49Updated last year
- A library for working with prompt templates locally or on the Hugging Face Hub.β45Updated last month
- codebase release for EMNLP2023 paper publicationβ19Updated last year
- Manage scalable open LLM inference endpoints in Slurm clustersβ254Updated 9 months ago
- β92Updated last year
- A repository containing the code for translating popular LLM benchmarks to German.β25Updated last year
- Notebooks for training universal 0-shot classifiers on many different tasksβ124Updated 3 months ago
- π A curated list of resources dedicated to synthetic dataβ127Updated 2 years ago
- π Reference-Free automatic summarization evaluation with potential hallucination detectionβ100Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptionsβ69Updated 2 years ago
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domainβ¦β52Updated last year
- Evaluating LLMs with CommonGen-Liteβ89Updated last year