huggingface / datasets
π€ The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
β19,461Updated this week
Alternatives and similar repositories for datasets:
Users that are interested in datasets are comparing it to the libraries listed below
- Pretrain, finetune ANY AI model of ANY size on multiple GPUs, TPUs with zero code changes.β28,807Updated last week
- π€ Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.β137,641Updated this week
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.β30,809Updated last week
- π A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (iβ¦β8,178Updated this week
- π₯ Fast State-of-the-Art Tokenizers optimized for Research and Productionβ9,258Updated this week
- Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and moreβ30,985Updated this week
- Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Streβ¦β8,298Updated this week
- State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterβ¦β13,831Updated 5 months ago
- Unsupervised text tokenizer for Neural Network-based text generation.β10,479Updated last month
- A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) trainingβ20,810Updated 5 months ago
- Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"β6,240Updated 3 months ago
- Ongoing research training transformer models at scaleβ11,109Updated this week
- scikit-learn: machine learning in Pythonβ60,776Updated this week
- Fast and Accurate ML in 3 Lines of Codeβ8,262Updated this week
- An open-source, low-code machine learning library in Pythonβ9,075Updated this week
- Flax is a neural network library for JAX that is designed for flexibility.β6,272Updated this week
- π Papers & tech blogs by companies sharing their work on data science & machine learning in production.β27,540Updated 5 months ago
- State-of-the-Art Text Embeddingsβ15,772Updated last week
- Google Researchβ34,691Updated this week
- Notebooks using the Hugging Face libraries π€β3,792Updated last week
- Hydra is a framework for elegantly configuring complex applicationsβ8,960Updated this week
- A hyperparameter optimization frameworkβ11,236Updated this week
- This repository contains implementations and illustrative code to accompany DeepMind publicationsβ13,413Updated last month
- π A ranked list of awesome machine learning Python libraries. Updated weekly.β18,739Updated last week
- Trax β Deep Learning with Clear Code and Speedβ8,136Updated this week
- Train transformer language models with reinforcement learning.β10,609Updated this week
- A toolkit for developing and comparing reinforcement learning algorithms.β35,156Updated 3 months ago
- A data augmentations library for audio, image, text, and video.β4,983Updated last month
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)β8,655Updated this week
- A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Autoβ¦β12,864Updated this week