stanford-crfm / ecosystem-graphs
☆258Updated this week
Alternatives and similar repositories for ecosystem-graphs:
Users that are interested in ecosystem-graphs are comparing it to the libraries listed below
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆213Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆253Updated last year
- OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.☆543Updated last year
- ☆264Updated 6 months ago
- Build, evaluate, understand, and fix LLM-based apps☆484Updated last year
- ☆206Updated last week
- ☆247Updated 6 months ago
- Website for hosting the Open Foundation Models Cheat Sheet.☆262Updated 6 months ago
- RuLES: a benchmark for evaluating rule-following in language models☆215Updated this week
- ☆484Updated last month
- Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.☆307Updated last year
- Code and data for "Lost in the Middle: How Language Models Use Long Contexts"☆324Updated last year
- Scaling Data-Constrained Language Models☆330Updated 3 months ago
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆108Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆68Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆297Updated last year
- ☆277Updated last year
- ☆150Updated last year
- [EMNLP 2023] The CoT Collection: Improving Zero-shot and Few-shot Learning of Language Models via Chain-of-Thought Fine-Tuning☆221Updated last year
- Ask Me Anything language model prompting☆544Updated last year
- A joint community effort to create one central leaderboard for LLMs.☆288Updated 4 months ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆192Updated this week
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆217Updated last year
- 📚 A curated list of papers & technical articles on AI Quality & Safety☆166Updated last year
- The official evaluation suite and dynamic data release for MixEval.☆233Updated 2 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆247Updated 6 months ago
- Evaluating LLMs with fewer examples☆141Updated 9 months ago
- TART: A plug-and-play Transformer module for task-agnostic reasoning☆193Updated last year
- Simple next-token-prediction for RLHF☆222Updated last year
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆236Updated last year