huggingface / datasetsLinks
π€ The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
β20,921Updated this week
Alternatives and similar repositories for datasets
Users that are interested in datasets are comparing it to the libraries listed below
Sorting:
- π€ Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal modelβ¦β153,203Updated this week
- π₯ Fast State-of-the-Art Tokenizers optimized for Research and Productionβ10,252Updated this week
- Unsupervised text tokenizer for Neural Network-based text generation.β11,474Updated last week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β9,329Updated this week
- Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalitiesβ21,851Updated 5 months ago
- Repo for external large-scale workβ6,548Updated last year
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"β6,455Updated 3 weeks ago
- State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterβ¦β14,602Updated last year
- DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.β40,890Updated last week
- State-of-the-Art Text Embeddingsβ17,942Updated last week
- Ongoing research training transformer models at scaleβ14,389Updated this week
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.β30,530Updated last week
- A framework for training and evaluating AI models on a variety of openly available dialogue datasets.β10,625Updated 2 years ago
- Trax β Deep Learning with Clear Code and Speedβ8,295Updated 2 months ago
- BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)β7,797Updated 6 months ago
- Notebooks using the Hugging Face libraries π€β4,388Updated this week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β32,008Updated 2 months ago
- π Accelerate inference and training of π€ Transformers, Diffusers, TIMM and Sentence Transformers with easy to use hardware optimizationβ¦β3,192Updated 2 weeks ago
- Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conveβ¦β4,229Updated 3 months ago
- Google Researchβ36,832Updated this week
- Repository to track the progress in Natural Language Processing (NLP), including the datasets and the current state-of-the-art for the moβ¦β22,966Updated last year
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed librariesβ7,343Updated 2 months ago
- An open-source NLP research library, built on PyTorch.β11,887Updated 3 years ago
- The AI developer platform. Use Weights & Biases to train and fine-tune models, and manage models from experimentation to production.β10,576Updated last week
- π€ PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.β20,157Updated last week
- The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic β¦β3,614Updated this week
- Public repo for HF blog postsβ3,212Updated this week
- GPT-3: Language Models are Few-Shot Learnersβ15,778Updated 5 years ago
- π€ Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.β31,812Updated this week
- ONNX Runtime: cross-platform, high performance ML inferencing and training acceleratorβ18,504Updated this week