explosion / ml-datasets
π Machine learning dataset loaders for testing and example scripts
β47Updated 3 years ago
Alternatives and similar repositories for ml-datasets:
Users that are interested in ml-datasets are comparing it to the libraries listed below
- allennlp + streamlit demoβ22Updated 5 years ago
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- A collection of simple tutorials for using Fonduerβ99Updated 4 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."β26Updated 2 years ago
- β70Updated 2 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learningβ41Updated 4 years ago
- Python text processing, pattern matching, and NLP frameworkβ63Updated last year
- Example using Polyaxon to experiment with pre-training spaCyβ65Updated 3 years ago
- Jupyter notebook widget to quickly label text dataβ47Updated 6 years ago
- EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and Eβ¦β41Updated 2 years ago
- Inter-annotator agreement for Doccanoβ27Updated 4 years ago
- A visualisation tool for Spacy using Hierplane.β65Updated 2 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methodsβ95Updated 4 years ago
- allennlp tutorial for O'Reilly AI Conference, September 2019β22Updated 5 years ago
- Automatically labeling training dataβ105Updated 6 years ago
- Python bindings for Stanford CoreNLP's protobufs.β20Updated 6 years ago
- βοΈ Parallel and distributed training with spaCy and Rayβ53Updated last year
- Presentations & notebooks from our talks /workshops/meetups/etcβ24Updated 7 years ago
- Information for readers of the fastai bookβ67Updated 4 years ago
- Indra is a Web Service which allows easy access to different distributional semantics models in several languages.β48Updated last week
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- Dice.com repo to accompany the dice.com 'Vectors in Search' talk by Simon Hughes, from the Activate 2018 search conference, and the 'Searβ¦β85Updated 3 years ago
- Overview of IR/NLP papers covered in my team's reading group.β10Updated 4 years ago
- The ntentional blog - a machine learning journeyβ23Updated 2 years ago
- Generate reports for spaCy models.β29Updated 2 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.β105Updated 2 years ago
- Running Prodigy for a team of annotatorsβ53Updated 4 years ago
- β30Updated 2 years ago
- β123Updated 2 years ago
- Text classification automlβ21Updated 3 years ago