explosion / ml-datasets
π Machine learning dataset loaders for testing and example scripts
β47Updated 2 years ago
Alternatives and similar repositories for ml-datasets:
Users that are interested in ml-datasets are comparing it to the libraries listed below
- Example using Polyaxon to experiment with pre-training spaCyβ65Updated 3 years ago
- The fast.ai data ethics courseβ15Updated 2 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learningβ41Updated 4 years ago
- βοΈ Parallel and distributed training with spaCy and Rayβ53Updated last year
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 3 years ago
- A utility for labeling clusters of text data.β28Updated 3 years ago
- β70Updated 2 years ago
- allennlp + streamlit demoβ22Updated 5 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."β26Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- Jupyter notebook widget to quickly label text dataβ47Updated 6 years ago
- Bag of, not words, but tricks!β68Updated last year
- Find strings/words in text; convenience and C speedβ126Updated 2 years ago
- The ntentional blog - a machine learning journeyβ23Updated 2 years ago
- Dataframe Integration with spaCy.β103Updated 4 years ago
- Articles on machine learningβ63Updated 2 years ago
- Presentations & notebooks from our talks /workshops/meetups/etcβ24Updated 7 years ago
- Relatively simple text classification powered by spaCyβ41Updated 9 years ago
- Generate reports for spaCy models.β29Updated 2 years ago
- allennlp tutorial for O'Reilly AI Conference, September 2019β22Updated 5 years ago
- Inter-annotator agreement for Doccanoβ27Updated 4 years ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is doβ¦β48Updated last year
- spaCy + UDPipeβ161Updated 2 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methodsβ95Updated 4 years ago
- Information for readers of the fastai bookβ67Updated 4 years ago
- sequence tagging with spaCy and crfsuiteβ19Updated 2 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stanβ¦β53Updated 2 years ago
- β30Updated 2 years ago
- Converter from UD-trees to BART representationβ36Updated last year
- Language detection using Spacy and Fasttextβ55Updated last year