explosion / ml-datasetsLinks
π Machine learning dataset loaders for testing and example scripts
β47Updated 3 years ago
Alternatives and similar repositories for ml-datasets
Users that are interested in ml-datasets are comparing it to the libraries listed below
Sorting:
- The fast.ai data ethics courseβ16Updated 2 years ago
- The ntentional blog - a machine learning journeyβ23Updated 2 years ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- A Python module to convert natural language numerics into ints and floats.β229Updated 9 months ago
- Python text processing, pattern matching, and NLP frameworkβ66Updated 2 years ago
- 𧬠A JupyterLab extension for annotating data with Prodigyβ189Updated 2 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."β26Updated 2 years ago
- Automatically labeling training dataβ107Updated 6 years ago
- Jupyter Widget for data annotationβ140Updated 2 years ago
- Information for readers of the fastai bookβ68Updated 4 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learningβ41Updated 5 years ago
- Natural language processing support for Pandas dataframes.β216Updated 4 months ago
- Generate reports for spaCy models.β29Updated 3 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stanβ¦β53Updated 2 years ago
- An Implementation of ERNIE For Language Understanding (including Pre-training models and Fine-tuning tools)β27Updated 5 years ago
- Deep learning with text doesn't have to be scary.β276Updated 2 years ago
- Smarter Manual Annotation for Resource-constrained collection of Training dataβ229Updated 7 months ago
- βοΈ Parallel and distributed training with spaCy and Rayβ54Updated last year
- Bag of, not words, but tricks!β68Updated last year
- Production Machine Learning Pipeline for Text Classification with fastTextβ32Updated 4 years ago
- allennlp + streamlit demoβ22Updated 5 years ago
- A simple NLP library allows profiling datasets with one or more text columns. When given a dataset and a column name containing text dataβ¦β242Updated last year
- Super Simple Similarities Serviceβ151Updated 3 months ago
- β123Updated 2 years ago
- Utilities for preprocessing text for deep learning with Kerasβ180Updated 2 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing powerβ190Updated 2 years ago
- Example using Polyaxon to experiment with pre-training spaCyβ65Updated 3 years ago
- Use ML-Annotate to label data for machine learning purposesβ109Updated 4 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methodsβ95Updated 4 years ago
- Text Similarity Search Application using Modern NLP and Elasticsearchβ30Updated 5 years ago