stanford-crfm / ecosystem-graphsLinks
☆267Updated 7 months ago
Alternatives and similar repositories for ecosystem-graphs
Users that are interested in ecosystem-graphs are comparing it to the libraries listed below
Sorting:
- ☆245Updated 5 months ago
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated last year
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated 3 months ago
- ☆149Updated last year
- ☆301Updated last year
- A joint community effort to create one central leaderboard for LLMs.☆305Updated last year
- Build, evaluate, understand, and fix LLM-based apps☆490Updated last year
- ☆292Updated last year
- Scaling Data-Constrained Language Models☆339Updated 2 months ago
- RuLES: a benchmark for evaluating rule-following in language models☆230Updated 6 months ago
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆219Updated last year
- Evaluation suite for LLMs☆359Updated last month
- Evaluating LLMs with fewer examples☆160Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆244Updated 9 months ago
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆188Updated last year
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆46Updated last year
- PaL: Program-Aided Language Models (ICML 2023)☆504Updated 2 years ago
- Ask Me Anything language model prompting☆546Updated 2 years ago
- ☆337Updated last year
- ☆444Updated 2 years ago
- Extracting spatial and temporal world models from LLMs☆255Updated last year
- ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…☆223Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆219Updated 2 years ago
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆335Updated 8 months ago
- Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".☆638Updated 3 weeks ago
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆290Updated 6 months ago
- Tools for understanding how transformer predictions are built layer-by-layer☆516Updated 2 weeks ago
- ☆536Updated 9 months ago
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆70Updated 2 years ago
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆593Updated last year