huggingface/lm-evaluation-harness

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huggingface/lm-evaluation-harness)

huggingface / lm-evaluation-harness

A framework for few-shot evaluation of language models.

☆37

Alternatives and similar repositories for lm-evaluation-harness

Users that are interested in lm-evaluation-harness are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huggingface / pyo3-special-method-derive
View on GitHub
Automatically derive Python dunder methods for your Rust code
☆25May 26, 2026Updated last month
huggingface / leaderboards
View on GitHub
☆23May 26, 2026Updated last month
huggingface / hf-endpoints-documentation
View on GitHub
☆27Jun 23, 2026Updated 3 weeks ago
huggingface / feel
View on GitHub
☆15May 26, 2026Updated last month
huggingface / AIEnergyScore
View on GitHub
AI Energy Score: Initiative to establish comparable energy efficiency ratings for AI models.
☆40Dec 2, 2025Updated 7 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
felixbinder / introspection_self_prediction
View on GitHub
Code for experiments on self-prediction as a way to measure introspection in LLMs
☆16Dec 10, 2024Updated last year
Ammar-Alnagar / Ammar-Alnagar
View on GitHub
☆13Apr 10, 2026Updated 3 months ago
Ammar-Alnagar / Enlightener
View on GitHub
Enlightener, the cutting-edge Retrieval-Augmented Generation (RAG) system that revolutionizes query responses. By combining the power of …
☆13Jul 28, 2025Updated 11 months ago
huggingface / tgi-gaudi
View on GitHub
Large Language Model Text Generation Inference on Habana Gaudi
☆34Mar 20, 2025Updated last year
huggingface / pixparse
View on GitHub
Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data
☆24Jul 30, 2024Updated last year
huggingface / tailscale-action
View on GitHub
Github action to connect to tailscale
☆20Jun 8, 2026Updated last month
sumukshashidhar / yourbench
View on GitHub
Benchmark Large Language Models Reliably On Your Data
☆18Dec 27, 2025Updated 6 months ago
FartyPants / VirtualLora
View on GitHub
extension for text WebUI
☆20Aug 7, 2025Updated 11 months ago
twitter-research / lmsoc
View on GitHub
Code for reproducing our paper: LMSOC: An Approach for Socially Sensitive Pretraining
☆13Oct 22, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Tomorrowdawn / top_nsigma
View on GitHub
The official code repo and data hub of top_nsigma sampling strategy for LLMs.
☆26Feb 11, 2025Updated last year
huggingface / docmatix
View on GitHub
A huge dataset for Document Visual Question Answering
☆24Jul 29, 2024Updated last year
RuishanLiu / GAN-TSC
View on GitHub
☆11Oct 15, 2020Updated 5 years ago
ay27 / RandomGit
View on GitHub
随机扒取古诗文词语作为git的commit msg
☆11Jan 16, 2017Updated 9 years ago
j2kun / mlir-resources
View on GitHub
A list of articles outside of the official MLIR docs that I've found useful for learning MLIR
☆13Aug 16, 2023Updated 2 years ago
huggingface / hub-sync
View on GitHub
A GitHub Action that syncs your GitHub repository to Hugging Face Hub 🤗
☆21Updated this week
zytedata / clear-html
View on GitHub
Remove DIVs, style stuff and normalize HTML preserving structure information
☆14Oct 24, 2025Updated 8 months ago
tscholak / PyConEdward
View on GitHub
Slides for the tutorial talk on Bayesian Machine Learning at PyCon 2017
☆10May 19, 2017Updated 9 years ago
metterian / korean_bert_score
View on GitHub
BERT score for text generation
☆12Jan 15, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
leon-ai / leon-ci-image
View on GitHub
🐳 Docker CI image for Leon projects.
☆13Nov 6, 2021Updated 4 years ago
convei-lab / BotsTalk
View on GitHub
🤖 Code for our EMNLP 2022 paper: "BotsTalk: Machine-sourced Framework for Automatic Curation of Large-scale Multi-skill Dialogue Dataset…
☆16Oct 7, 2024Updated last year
Coolnesss / fada-pytorch
View on GitHub
PyTorch implementation of https://arxiv.org/abs/1711.02536
☆12Jan 11, 2018Updated 8 years ago
TryHard-LL / AFFNet
View on GitHub
AFFNet-Unofficial Implementation
☆14Aug 23, 2023Updated 2 years ago
anthropics / rogue-deploy-eval
View on GitHub
☆16Jan 21, 2025Updated last year
ashwindcruz / dgm
View on GitHub
Deep Generative Models (Chainer)
☆10Oct 12, 2017Updated 8 years ago
apple / pkl-package-docs
View on GitHub
Documentation for Pkl packages
☆18Updated this week
huggingface / optimum-furiosa
View on GitHub
Accelerated inference of 🤗 models using FuriosaAI NPU chips.
☆27May 26, 2026Updated last month
Milimo-Quantum / milimochat
View on GitHub
MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…
☆14Mar 12, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LAION-AI / Desktop-BUD-E_V1.0
View on GitHub
BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…
☆23Oct 10, 2024Updated last year
princeton-nlp / align-mlm
View on GitHub
☆13Nov 30, 2022Updated 3 years ago
UBC-NLP / octopus
View on GitHub
Octopus is a neural machine generation toolkit for Arabic Natural Lnagauge Generation (NLG)
☆10Apr 29, 2024Updated 2 years ago
Telegram-Mini-Apps / vanillajs-template
View on GitHub
Telegram Mini Apps application template using @telegram-apps/sdk and JavaScript.
☆15Jan 7, 2025Updated last year
klightz / splitting
View on GitHub
Offical Repo for Splitting Steepest Descent for Growing Neural Architectures
☆13May 12, 2021Updated 5 years ago
graphrag / ms-graphrag
View on GitHub
A modular graph-based Retrieval-Augmented Generation (RAG) system
☆16Updated this week
leon-ai / blog.getleon.ai
View on GitHub
🖋️ Blog of Leon.
☆14Mar 25, 2026Updated 3 months ago