explosion / ml-datasetsLinks
π Machine learning dataset loaders for testing and example scripts
β47Updated last month
Alternatives and similar repositories for ml-datasets
Users that are interested in ml-datasets are comparing it to the libraries listed below
Sorting:
- Automatically labeling training dataβ107Updated 7 years ago
- 𧬠A JupyterLab extension for annotating data with Prodigyβ189Updated 2 years ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 4 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learningβ43Updated 5 years ago
- An Implementation of ERNIE For Language Understanding (including Pre-training models and Fine-tuning tools)β27Updated 6 years ago
- The fast.ai data ethics courseβ17Updated 2 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stanβ¦β53Updated 2 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or fβ¦β24Updated 4 years ago
- A Python module to convert natural language numerics into ints and floats.β234Updated last year
- Search system on top of Elasticsearch, Kubeflow and Katibβ29Updated 2 years ago
- Presentations & notebooks from our talks /workshops/meetups/etcβ24Updated 7 years ago
- The ntentional blog - a machine learning journeyβ23Updated 3 years ago
- Comparing Polars to Pandas and a small introductionβ44Updated 4 years ago
- Custom Natural Language Processing with big and small models π²π±β66Updated 4 years ago
- Tutorial for Topic Modelling using PySpark and Spark NLPβ17Updated 5 years ago
- β70Updated 3 years ago
- A fully customisable language detection pipeline for spaCyβ93Updated 6 years ago
- A collection of simple tutorials for using Fonduerβ100Updated 5 years ago
- Smarter Manual Annotation for Resource-constrained collection of Training dataβ230Updated last year
- Information for readers of the fastai bookβ69Updated 5 years ago
- Example using Polyaxon to experiment with pre-training spaCyβ65Updated 4 years ago
- An easy to use open-source library for advanced Deep Learning and Natural Language Processingβ113Updated last year
- Python text processing, pattern matching, and NLP frameworkβ67Updated 2 years ago
- spaCy match and replace, maintaining conjugationβ36Updated 3 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methodsβ96Updated 5 years ago
- General Interpretability Packageβ58Updated 3 years ago
- classy is a simple-to-use library for building high-performance Machine Learning models in NLP.β87Updated last week
- allennlp + streamlit demoβ21Updated 6 years ago
- β¨ Web interface for NeuralCoref coreference resolutionβ34Updated 2 years ago
- A clean and easy interface for performing nearest-neighbor lookupsβ50Updated 6 years ago