explosion / ml-datasetsLinks
π Machine learning dataset loaders for testing and example scripts
β47Updated last month
Alternatives and similar repositories for ml-datasets
Users that are interested in ml-datasets are comparing it to the libraries listed below
Sorting:
- Find strings/words in text; convenience and C speedβ126Updated 3 years ago
- 𧬠A JupyterLab extension for annotating data with Prodigyβ189Updated 2 years ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 4 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stanβ¦β53Updated 2 years ago
- Automatically labeling training dataβ107Updated 7 years ago
- A Python module to convert natural language numerics into ints and floats.β233Updated last year
- The fast.ai data ethics courseβ17Updated 3 years ago
- Search system on top of Elasticsearch, Kubeflow and Katibβ29Updated 2 years ago
- A fully customisable language detection pipeline for spaCyβ93Updated 6 years ago
- Jupyter Widget for data annotationβ140Updated 3 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing powerβ191Updated 2 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.β104Updated 3 years ago
- A collection of simple tutorials for using Fonduerβ100Updated 5 years ago
- The ntentional blog - a machine learning journeyβ23Updated 3 years ago
- Smarter Manual Annotation for Resource-constrained collection of Training dataβ230Updated last year
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learningβ43Updated 5 years ago
- A machine learning testing framework for sklearn and pandas. The goal is to help folks assess whether things have changed over time.β104Updated 2 weeks ago
- Bag of, not words, but tricks!β68Updated 2 years ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text dataβ¦β243Updated last year
- Super lightweight function registries for your libraryβ181Updated last year
- Use ML-Annotate to label data for machine learning purposesβ110Updated 5 years ago
- Python word cloud library for use within Jupyter notebook and Python apps.β49Updated last year
- β70Updated 3 years ago
- General Interpretability Packageβ58Updated 3 years ago
- Production Machine Learning Pipeline for Text Classification with fastTextβ33Updated 4 years ago
- Pyinfer is a model agnostic tool for ML developers and researchers to benchmark the inference statistics for machine learning models or fβ¦β24Updated 4 years ago
- Articles on machine learningβ66Updated 3 years ago
- MLOps simplified. One-stop AI delivery platform, all the features you need.β106Updated last week
- β123Updated 2 years ago
- Dataframe Integration with spaCy.β103Updated 4 years ago