stanford-crfm / ecosystem-graphsLinks
☆267Updated 6 months ago
Alternatives and similar repositories for ecosystem-graphs
Users that are interested in ecosystem-graphs are comparing it to the libraries listed below
Sorting:
- ☆244Updated 4 months ago
- Build, evaluate, understand, and fix LLM-based apps☆490Updated last year
- Website for hosting the Open Foundation Models Cheat Sheet.☆267Updated 2 months ago
- A joint community effort to create one central leaderboard for LLMs.☆304Updated 11 months ago
- Project 2 (Building Large Language Models) for Stanford CS324: Understanding and Developing Large Language Models (Winter 2022)☆105Updated 2 years ago
- PaL: Program-Aided Language Models (ICML 2023)☆503Updated 2 years ago
- ☆291Updated last year
- NeurIPS Large Language Model Efficiency Challenge: 1 LLM + 1GPU + 1Day☆256Updated last year
- ☆299Updated last year
- Scaling Data-Constrained Language Models☆338Updated last month
- ☆149Updated last year
- This is the repo for the paper Shepherd -- A Critic for Language Model Generation☆219Updated last year
- A set of utilities for running few-shot prompting experiments on large-language models☆122Updated last year
- Extracting spatial and temporal world models from LLMs☆255Updated last year
- RuLES: a benchmark for evaluating rule-following in language models☆228Updated 5 months ago
- Code and data for "Measuring and Narrowing the Compositionality Gap in Language Models"☆317Updated last year
- [ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets☆218Updated last year
- ☆94Updated last year
- Ask Me Anything language model prompting☆547Updated 2 years ago
- OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.☆569Updated last year
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆46Updated last year
- Functional Benchmarks and the Reasoning Gap☆88Updated 10 months ago
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆187Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆70Updated 2 years ago
- Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".☆627Updated 3 weeks ago
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆335Updated 7 months ago
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆306Updated last year
- ☆529Updated 8 months ago
- Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback☆208Updated 2 years ago
- An open collection of methodologies to help with successful training of large language models.☆507Updated last year