huggingface / evaluateLinks
π€ Evaluate: A library for easily evaluating machine learning models and datasets.
β2,266Updated last week
Alternatives and similar repositories for evaluate
Users that are interested in evaluate are comparing it to the libraries listed below
Sorting:
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β2,987Updated this week
- Efficient few-shot learning with Sentence Transformersβ2,523Updated 3 months ago
- A Unified Library for Parameter-Efficient and Modular Transfer Learningβ2,735Updated last month
- β1,228Updated 11 months ago
- β1,529Updated 2 weeks ago
- Holistic Evaluation of Language Models (HELM) is an open source Python framework created by the Center for Research on Foundation Models β¦β2,344Updated this week
- Toolkit for creating, sharing and using natural language prompts.β2,904Updated last year
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language modelsβ3,082Updated last year
- Model explainability that works seamlessly with π€ transformers. Explain your transformers model in just 2 lines of code.β1,360Updated last year
- β2,846Updated last month
- A modular RL library to fine-tune language models to human preferencesβ2,330Updated last year
- MTEB: Massive Text Embedding Benchmarkβ2,706Updated this week
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.β1,894Updated this week
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.β1,877Updated last month
- The implementation of DeBERTaβ2,118Updated last year
- Organize your experiments into discrete steps that can be cached and reused throughout the lifetime of your research project.β564Updated last year
- Original Implementation of Prompt Tuning from Lester, et al, 2021β687Updated 4 months ago
- Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.β2,487Updated last week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β8,951Updated this week
- BERT score for text generationβ1,769Updated 11 months ago
- Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.β1,003Updated 11 months ago
- General technology for enabling AI capabilities w/ LLMs and MLLMsβ4,067Updated 3 weeks ago
- Efficient, scalable and enterprise-grade CPU/GPU inference server for π€ Hugging Face transformer models πβ1,688Updated 8 months ago
- SGPT: GPT Sentence Embeddings for Semantic Searchβ868Updated last year
- PyTorch extensions for high performance and large scale training.β3,339Updated 2 months ago
- Accessible large language models via k-bit quantization for PyTorch.β7,230Updated last week
- β1,583Updated 2 years ago
- The hub for EleutherAI's work on interpretability and learning dynamicsβ2,570Updated last month
- Measuring Massive Multitask Language Understanding | ICLR 2021β1,457Updated 2 years ago
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"β1,762Updated last month