stanford-crfm / fmtiView external linksLinks
The Foundation Model Transparency Index
☆85Dec 9, 2025Updated 2 months ago
Alternatives and similar repositories for fmti
Users that are interested in fmti are comparing it to the libraries listed below
Sorting:
- A framework for few-shot evaluation of autoregressive language models.☆12Jul 14, 2025Updated 7 months ago
- This repository includes the implementation and results of the paper "ChatGPT is fun, but it is not funny! Humor is still challenging Lar…☆13Jul 13, 2023Updated 2 years ago
- ☆33Jan 25, 2026Updated 3 weeks ago
- Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…☆15Oct 16, 2023Updated 2 years ago
- Adversaial attack comparative assessment Large Language Model☆13May 21, 2025Updated 8 months ago
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Sep 4, 2024Updated last year
- Interface for GenAI-Arena [NeurIPS24]☆17Feb 27, 2024Updated last year
- Official PyTorch implementation for "Understanding Instance-based Interpretability of Variational Auto-Encoders."☆13Oct 21, 2021Updated 4 years ago
- Training hybrid models for dummies.☆29Nov 1, 2025Updated 3 months ago
- Code & data for EMNLP 2020 paper "MOCHA: A Dataset for Training and Evaluating Reading Comprehension Metrics".☆16May 3, 2022Updated 3 years ago
- ☆20Jun 1, 2022Updated 3 years ago
- ☆22Jan 25, 2023Updated 3 years ago
- RuleRAG: Rule Meets Retrieval-Augmented Generation for Question Answering☆32Oct 8, 2025Updated 4 months ago
- [EMNLP'23] Execution-Based Evaluation for Open Domain Code Generation☆49Dec 22, 2023Updated 2 years ago
- I2M2: Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning (NeurIPS 2024)☆22Oct 30, 2024Updated last year
- Aioli: A unified optimization framework for language model data mixing☆32Jan 17, 2025Updated last year
- Learning to route instances for Human vs AI Feedback (ACL Main '25)☆26Jul 23, 2025Updated 6 months ago
- Code for COLING22 paper, DPTDR: Deep Prompt Tuning for Dense Passage Retrieval☆26Aug 7, 2023Updated 2 years ago
- CypherBench: Towards Precise Retrieval over Full-scale Modern Knowledge Graphs in the LLM Era☆30Jun 18, 2025Updated 7 months ago
- This repo contains evaluation code for the paper "AV-Odyssey: Can Your Multimodal LLMs Really Understand Audio-Visual Information?"☆31Dec 23, 2024Updated last year
- Model hub for all your DiffeqML needs. Pretrained weights, modules, and basic inference infrastructure☆28Mar 9, 2023Updated 2 years ago
- Code for the MTEB leaderboard☆30Feb 4, 2025Updated last year
- ☆24Jun 25, 2025Updated 7 months ago
- Sparse and discrete interpretability tool for neural networks☆64Feb 12, 2024Updated 2 years ago
- ☆28Sep 21, 2024Updated last year
- Unofficial example of the COVID-19 vaccinations dashboard☆24Apr 23, 2024Updated last year
- Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)☆65Jan 11, 2025Updated last year
- A ton of fixes/enhancements to upstream SvnBridge project (at http://svnbridge.codeplex.com ). License intended to be identical to upstre…☆10Oct 30, 2015Updated 10 years ago
- ☆13Apr 29, 2023Updated 2 years ago
- A Jupyter Book for sharing resources around open source in academia☆16Jan 17, 2026Updated 3 weeks ago
- GPI-Space: Memory Driven Computing and Big Data☆10Jan 2, 2025Updated last year
- For calculating Shapley values via linear regression.☆73Jun 6, 2021Updated 4 years ago
- ☆33Jun 24, 2024Updated last year
- ☆42Nov 13, 2024Updated last year
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆90Jan 29, 2024Updated 2 years ago
- Demo repository showcasing how to use reusable workflows to build artifact attestations☆13Feb 2, 2026Updated 2 weeks ago
- Python test runner built in Rust☆17Updated this week
- Self-evaluating RAG application on LangCheck docs☆11Sep 10, 2025Updated 5 months ago
- [ICLR 2024] Scaling physics-informed hard constraints with mixture-of-experts.☆38Jun 21, 2024Updated last year