LudwigStumpp / llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
☆297Updated 8 months ago
Alternatives and similar repositories for llm-leaderboard:
Users that are interested in llm-leaderboard are comparing it to the libraries listed below
- ☆270Updated 2 years ago
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆547Updated last year
- Code for fine-tuning Platypus fam LLMs using LoRA☆629Updated last year
- PaL: Program-Aided Language Models (ICML 2023)☆489Updated last year
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆222Updated 2 years ago
- Official repository for LongChat and LongEval☆519Updated 11 months ago
- Salesforce open-source LLMs with 8k sequence length.☆717Updated 3 months ago
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆941Updated 6 months ago
- This repository implements the chain of verification paper by Meta AI☆168Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆498Updated last year
- ☆515Updated 5 months ago
- 🐙 OctoPack: Instruction Tuning Code Large Language Models☆463Updated 3 months ago
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆226Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆186Updated last year
- LLMs can generate feedback on their work, use it to improve the output, and repeat this process iteratively.☆682Updated 7 months ago
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆305Updated 6 months ago
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆586Updated last year
- SAIL: Search Augmented Instruction Learning☆157Updated last year
- Repository for organizing datasets and papers used in Open LLM.☆95Updated last year
- Data and code for "DocPrompting: Generating Code by Retrieving the Docs" @ICLR 2023☆243Updated last year
- CodeGen2 models for program synthesis☆275Updated last year
- Fine-tune SantaCoder for Code/Text Generation.☆192Updated 2 years ago
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- Build, evaluate, understand, and fix LLM-based apps☆488Updated last year
- ☆444Updated 2 years ago
- 📚 Datasets and models for instruction-tuning☆238Updated last year
- Data and code for FreshLLMs (https://arxiv.org/abs/2310.03214)☆358Updated last week
- ☆356Updated 2 years ago
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆315Updated last year
- ☆173Updated last year