explosion / ml-datasets
π Machine learning dataset loaders for testing and example scripts
β46Updated 2 years ago
Related projects β
Alternatives and complementary repositories for ml-datasets
- 𧬠A VS Code extension for annotating data with Prodigyβ30Updated 2 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learningβ42Updated 4 years ago
- Example using Polyaxon to experiment with pre-training spaCyβ65Updated 3 years ago
- A visualisation tool for Spacy using Hierplane.β65Updated last year
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."β26Updated last year
- Bag of, not words, but tricks!β68Updated last year
- The ntentional blog - a machine learning journeyβ23Updated 2 years ago
- Generate reports for spaCy models.β28Updated 2 years ago
- The fast.ai data ethics courseβ14Updated last year
- β29Updated 2 years ago
- β70Updated last year
- Language detection using Spacy and Fasttextβ54Updated 11 months ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is doβ¦β49Updated 7 months ago
- EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and Eβ¦β41Updated 2 years ago
- Dataframe Integration with spaCy.β101Updated 3 years ago
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methodsβ96Updated 3 years ago
- βοΈ Parallel and distributed training with spaCy and Rayβ54Updated last year
- Relatively simple text classification powered by spaCyβ42Updated 9 years ago
- Presentations & notebooks from our talks /workshops/meetups/etcβ24Updated 6 years ago
- β¨ Web interface for NeuralCoref coreference resolutionβ34Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated 8 months ago
- Articles on machine learningβ62Updated 2 years ago
- Automatically labeling training dataβ106Updated 5 years ago
- spaCy match and replace, maintaining conjugationβ34Updated last year
- allennlp + streamlit demoβ22Updated 5 years ago
- Notebooks configured to be run with Binder, usually found on my blog.β41Updated last year
- Jupyter notebook widget to quickly label text dataβ47Updated 5 years ago
- Anonymization of legal cases (Fr) based on Flair embeddingsβ87Updated 3 years ago
- Hypergol is a Data Science/Machine Learning productivity toolkit to accelerate any projects into production with autogenerated code, stanβ¦β53Updated last year
- RASA wrapper for HMTL: Hierarchical Multi-Task Learningβ30Updated 5 years ago