explosion / ml-datasetsLinks
π Machine learning dataset loaders for testing and example scripts
β47Updated 3 years ago
Alternatives and similar repositories for ml-datasets
Users that are interested in ml-datasets are comparing it to the libraries listed below
Sorting:
- β70Updated 2 years ago
- 𧬠A JupyterLab extension for annotating data with Prodigyβ189Updated 2 years ago
- Automatically labeling training dataβ107Updated 6 years ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- An Implementation of ERNIE For Language Understanding (including Pre-training models and Fine-tuning tools)β27Updated 6 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stanβ¦β53Updated 2 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learningβ42Updated 5 years ago
- The fast.ai data ethics courseβ16Updated 2 years ago
- Use ML-Annotate to label data for machine learning purposesβ110Updated 5 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.β105Updated 2 years ago
- A fully customisable language detection pipeline for spaCyβ93Updated 6 years ago
- π Emoji handling and meta data for spaCy with custom extension attributesβ181Updated 2 years ago
- Text Mining and Topic Modeling Toolkit for Python with parallel processing powerβ190Updated 2 years ago
- Utilities for preprocessing text for deep learning with Kerasβ180Updated 2 years ago
- Jupyter Widget for data annotationβ140Updated 2 years ago
- Language detection using Spacy and Fasttextβ57Updated last year
- Smarter Manual Annotation for Resource-constrained collection of Training dataβ229Updated 8 months ago
- Find strings/words in text; convenience and C speedβ127Updated 2 years ago
- Dataframe Integration with spaCy.β103Updated 4 years ago
- A Python module to convert natural language numerics into ints and floats.β229Updated 10 months ago
- πGUI for training spaCy modelsβ55Updated 4 years ago
- A collection of simple tutorials for using Fonduerβ100Updated 4 years ago
- π₯ Browser-based slides or PDFs of our talks and presentationsβ94Updated 6 years ago
- A visualisation tool for Spacy using Hierplane.β65Updated 2 years ago
- Python library for Natural Language Preprocessing (NLPre)β191Updated 2 years ago
- Relatively simple text classification powered by spaCyβ41Updated 9 years ago
- β123Updated 2 years ago
- ReconNER, Debug annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality of your data.β35Updated 5 years ago
- A bit of extra usability for sqlalchemy v2.β77Updated last year
- Clean personally identifiable information from dirty dirty text using spaCy.β41Updated last year