LudwigStumpp / llm-leaderboardLinks

A joint community effort to create one central leaderboard for LLMs.

☆304

Alternatives and similar repositories for llm-leaderboard

Users that are interested in llm-leaderboard are comparing it to the libraries listed below

Sorting:

nlpxucan / evol-instruct
☆270Updated 2 years ago
arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆628Updated last year
abacusai / Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…
☆591Updated last year
salesforce / xgen
Salesforce open-source LLMs with 8k sequence length.
☆720Updated 6 months ago
OpenLemur / Lemur
[ICLR 2024] Lemur: Open Foundation Models for Language Agents
☆552Updated last year
shm007g / LLaMA-Cult-and-More
Large Language Models for All, 🦙 Cult and More, Stay in touch !
☆445Updated 2 years ago
dsdanielpark / open-llm-leaderboard-report
Weekly visualization report of Open LLM model performance based on 4 metrics.
☆87Updated last year
taprosoft / llm_finetuning
Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…
☆146Updated last year
yxuansu / OpenAlpaca
OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA
☆302Updated 2 years ago
conceptofmind / toolformer
☆366Updated 2 years ago
neuml / txtinstruct
📚 Datasets and models for instruction-tuning
☆238Updated last year
h2oai / h2o-wizardlm
Open-Source Implementation of WizardLM to turn documents into Q:A pairs for LLM fine-tuning
☆312Updated 9 months ago
madaan / memprompt
A method to fix GPT-3 after deployment with user feedback, without re-training.
☆329Updated 2 years ago
loubnabnl / santacoder-finetuning
Fine-tune SantaCoder for Code/Text Generation.
☆192Updated 2 years ago
manyoso / haltt4llm
This project is an attempt to create a common metric to test LLM's for progress in eliminating hallucinations which is the most serious c…
☆222Updated 2 years ago
VikParuchuri / textbook_quality
Generate textbook-quality synthetic LLM pretraining data
☆501Updated last year
bigcode-project / Megatron-LM
Ongoing research training transformer models at scale
☆390Updated 11 months ago
radi-cho / datasetGPT
A command-line interface to generate textual and conversational datasets with LLMs.
☆301Updated last year
nexusflowai / NexusRaven
NexusRaven-13B, a new SOTA Open-Source LLM for function calling. This repo contains everything for reproducing our evaluation on NexusRav…
☆316Updated last year
yuchenlin / LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…
☆952Updated 9 months ago
young-geng / koala_data_pipeline
The data processing pipeline for the Koala chatbot language model
☆117Updated 2 years ago
promptslab / LLMtuner
FineTune LLMs in few lines of code (Text2Text, Text2Speech, Speech2Text)
☆240Updated last year
dsdanielpark / open-llm-datasets
Repository for organizing datasets and papers used in Open LLM.
☆99Updated 2 years ago
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆168Updated last year
zeno-ml / zeno-build
Build, evaluate, understand, and fix LLM-based apps
☆489Updated last year
reasoning-machines / pal
PaL: Program-Aided Language Models (ICML 2023)
☆502Updated 2 years ago
FastEval / FastEval
Fast & more realistic evaluation of chat language models. Includes leaderboard.
☆187Updated last year
ritun16 / chain-of-verification
This repository implements the chain of verification paper by Meta AI
☆172Updated last year
FSoft-AI4Code / CodeCapybara
Open-source Self-Instruction Tuning Code LLM
☆169Updated 2 years ago
run-llama / finetune-embedding
Fine-Tuning Embedding for RAG with Synthetic Data
☆504Updated last year