LudwigStumpp / llm-leaderboard
A joint community effort to create one central leaderboard for LLMs.
☆294Updated 7 months ago
Alternatives and similar repositories for llm-leaderboard:
Users that are interested in llm-leaderboard are comparing it to the libraries listed below
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆585Updated last year
- ☆268Updated last year
- PaL: Program-Aided Language Models (ICML 2023)☆488Updated last year
- This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…☆223Updated 2 years ago
- Generate textbook-quality synthetic LLM pretraining data☆498Updated last year
- Tune any FALCON in 4-bit☆466Updated last year
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆693Updated last year
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆931Updated 5 months ago
- Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning☆304Updated 5 months ago
- Salesforce open-source LLMs with 8k sequence length.☆717Updated 2 months ago
- NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…☆313Updated last year
- [ICLR 2024] Lemur: Open Foundation Models for Language Agents☆545Updated last year
- Official repository for LongChat and LongEval☆518Updated 10 months ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆629Updated last year
- Build, evaluate, understand, and fix LLM-based apps☆488Updated last year
- Fast & more realistic evaluation of chat language models. Includes leaderboard.☆186Updated last year
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆821Updated last year
- Fine-Tuning Embedding for RAG with Synthetic Data☆494Updated last year
- ☆412Updated last year
- A bagel, with everything.☆318Updated last year
- ☆444Updated 2 years ago
- Official implementation of our NeurIPS 2023 paper "Augmenting Language Models with Long-Term Memory".☆789Updated last year
- 📚 Datasets and models for instruction-tuning☆238Updated last year
- Customizable implementation of the self-instruct paper.☆1,043Updated last year
- A command-line interface to generate textual and conversational datasets with LLMs.☆294Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers☆423Updated last year
- ☆356Updated 2 years ago
- This repository implements the chain of verification paper by Meta AI☆168Updated last year
- Large Language Models for All, 🦙 Cult and More, Stay in touch !☆446Updated last year
- Build Hierarchical Autonomous Agents through Config. Collaborative Growth of Specialized Agents.☆312Updated last year