huggingface / evaluate
๐ค Evaluate: A library for easily evaluating machine learning models and datasets.
โ2,037Updated 2 months ago
Related projects โ
Alternatives and complementary repositories for evaluate
- ๐ Accelerate training and inference of ๐ค Transformers and ๐ค Diffusers with easy to use hardware optimization toolsโ2,576Updated this week
- Efficient few-shot learning with Sentence Transformersโ2,239Updated 2 months ago
- โ1,474Updated 3 weeks ago
- PyTorch extensions for high performance and large scale training.โ3,195Updated last week
- A modular RL library to fine-tune language models to human preferencesโ2,213Updated 8 months ago
- A repo for distributed training of language models with Reinforcement Learning via Human Feedback (RLHF)โ4,502Updated 10 months ago
- A Unified Library for Parameter-Efficient and Modular Transfer Learningโ2,581Updated 2 weeks ago
- Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.โ980Updated 3 months ago
- Toolkit for creating, sharing and using natural language prompts.โ2,700Updated last year
- Holistic Evaluation of Language Models (HELM), a framework to increase the transparency of language models (https://arxiv.org/abs/2211.09โฆโ1,948Updated this week
- The implementation of DeBERTaโ1,991Updated last year
- โ2,686Updated this week
- Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language modelsโ2,871Updated 4 months ago
- ๐ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iโฆโ7,958Updated this week
- The hub for EleutherAI's work on interpretability and learning dynamicsโ2,282Updated 2 weeks ago
- A Heterogeneous Benchmark for Information Retrieval. Easy to use, evaluate your models across 15+ diverse IR datasets.โ1,620Updated 3 months ago
- Ongoing research training transformer language models at scale, including: BERT & GPT-2โ1,338Updated 8 months ago
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"โ1,624Updated last year
- General technology for enabling AI capabilities w/ LLMs and MLLMsโ3,699Updated last month
- ๐ค A list of wonderful open-source projects & applications integrated with Hugging Face libraries.โ897Updated 6 months ago
- Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.โ1,680Updated last week
- [ACL 2023] One Embedder, Any Task: Instruction-Finetuned Text Embeddingsโ1,869Updated 2 months ago
- Public repo for HF blog postsโ2,377Updated this week
- maximal update parametrization (ยตP)โ1,402Updated 4 months ago
- Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackabโฆโ1,535Updated 9 months ago
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)โ3,072Updated this week
- Measuring Massive Multitask Language Understanding | ICLR 2021โ1,216Updated last year
- โ1,124Updated 3 months ago
- Accessible large language models via k-bit quantization for PyTorch.โ6,299Updated this week