LudwigStumpp / llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
☆293Updated 6 months ago
Alternatives and similar repositories for llm-leaderboard:
Users that are interested in llm-leaderboard are comparing it to the libraries listed below
- ☆268Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆584Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆543Updated last year
- Build, evaluate, understand, and fix LLM-based apps☆487Updated last year
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆922Updated 4 months ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆628Updated last year
- This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.☆543Updated last year
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆222Updated last year
- Large Language Models for All, 🦙 Cult and More, Stay in touch !☆442Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆690Updated 11 months ago
- 📚 Datasets and models for instruction-tuning☆235Updated last year
- Generate textbook-quality synthetic LLM pretraining data☆498Updated last year
- Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".☆785Updated 11 months ago
- Fine-tune SantaCoder for Code/Text Generation.☆190Updated last year
- Fine-Tuning Embedding for RAG with Synthetic Data☆487Updated last year
- Official repository for LongChat and LongEval☆516Updated 9 months ago
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆185Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆820Updated last year
- Salesforce open-source LLMs with 8k sequence length.☆717Updated last month
- ☆501Updated 4 months ago
- Run evaluation on LLMs using human-eval benchmark☆400Updated last year
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆300Updated 4 months ago
- PaL: Program-Aided Language Models (ICML 2023)☆483Updated last year
- Domain Adapted Language Modeling Toolkit - E2E RAG☆316Updated 4 months ago
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- This repository implements the chain of verification paper by Meta AI☆164Updated last year
- This is the repository for our paper "INTERS: Unlocking the Power of Large Language Models in Search with Instruction Tuning"☆203Updated 3 months ago
- A method to fix GPT-3 after deployment with user feedback, without re-training.☆326Updated last year
- Ongoing research training transformer models at scale☆381Updated 7 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆413Updated last week