π€ Evaluate: A library for easily evaluating machine learning models and datasets.
β2,419Jan 20, 2026Updated last month
Alternatives and similar repositories for evaluate
Users that are interested in evaluate are comparing it to the libraries listed below
Sorting:
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,296Feb 9, 2026Updated 2 weeks ago
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,513Updated this week
- π€ The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation toolsβ21,228Updated this week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β20,678Updated this week
- Train transformer language models with reinforcement learning.β17,460Updated this week
- Accessible large language models via k-bit quantization for PyTorch.β7,997Updated this week
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backendsβ2,311Feb 20, 2026Updated last week
- Efficient few-shot learning with Sentence Transformersβ2,683Dec 11, 2025Updated 2 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,903Updated this week
- A framework for few-shot evaluation of language models.β11,478Feb 15, 2026Updated last week
- Large Language Model Text Generation Inferenceβ10,774Jan 8, 2026Updated last month
- π₯ Fast State-of-the-Art Tokenizers optimized for Research and Productionβ10,485Updated this week
- Simple, safe way to store and distribute tensorsβ3,637Updated this week
- State-of-the-Art Text Embeddingsβ18,298Feb 20, 2026Updated last week
- Robust recipes to align language models with human and AI preferencesβ5,506Sep 8, 2025Updated 5 months ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)β4,741Jan 8, 2024Updated 2 years ago
- A Unified Library for Parameter-Efficient and Modular Transfer Learningβ2,801Oct 12, 2025Updated 4 months ago
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β41,648Updated this week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.β30,860Feb 21, 2026Updated last week
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β32,873Updated this week
- Fast and memory-efficient exact attentionβ22,361Updated this week
- β2,946Jan 15, 2026Updated last month
- Ongoing research training transformer models at scaleβ15,242Updated this week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,159Sep 30, 2025Updated 5 months ago
- Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasetsβ4,875Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.β10,353Feb 20, 2026Updated last week
- PyTorch native post-training libraryβ5,689Updated this week
- Minimalistic large language model 3D-parallelism trainingβ2,569Feb 19, 2026Updated last week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ22,033Jan 23, 2026Updated last month
- A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.β1,248Updated this week
- The official Python client for the Hugging Face Hub.β3,343Feb 21, 2026Updated last week
- Foundation Architecture for (M)LLMsβ3,135Apr 11, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMsβ71,234Updated this week
- Unsupervised text tokenizer for Neural Network-based text generation.β11,668Updated this week
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language modelsβ3,207Jul 19, 2024Updated last year
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β157,071Updated this week
- Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.β17,889Nov 3, 2025Updated 3 months ago
- Task-based datasets, preprocessing, and evaluation for sequence models.β594Feb 3, 2026Updated 3 weeks ago
- BertViz: Visualize Attention in Transformer Modelsβ7,921Jan 8, 2026Updated last month