explosion / ml-datasets
🌊 Machine learning dataset loaders for testing and example scripts
☆47Updated 2 years ago
Alternatives and similar repositories for ml-datasets:
Users that are interested in ml-datasets are comparing it to the libraries listed below
- A visualisation tool for Spacy using Hierplane.☆65Updated 2 years ago
- Python text processing, pattern matching, and NLP framework☆63Updated last year
- ☆70Updated 2 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- The ntentional blog - a machine learning journey☆23Updated 2 years ago
- A fully customisable language detection pipeline for spaCy☆92Updated 5 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 3 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 11 months ago
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆49Updated 10 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆53Updated last year
- EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and E…☆41Updated 2 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linking☆85Updated 2 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 4 years ago
- 🧬 A JupyterLab extension for annotating data with Prodigy☆189Updated last year
- Sentence transformers models for SpaCy☆107Updated last year
- allennlp + streamlit demo☆22Updated 5 years ago
- Dataframe Integration with spaCy.☆103Updated 3 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago
- 📂 Additional lookup tables and data resources for spaCy☆101Updated 3 weeks ago
- Generate reports for spaCy models.☆29Updated 2 years ago
- Bag of, not words, but tricks!☆68Updated last year
- The fast.ai data ethics course☆15Updated 2 years ago
- Inter-annotator agreement for Doccano☆27Updated 4 years ago
- 🚀GUI for training spaCy models☆54Updated 3 years ago
- Notebooks configured to be run with Binder, usually found on my blog.☆42Updated last year
- Super lightweight function registries for your library☆177Updated 8 months ago
- Language detection using Spacy and Fasttext☆55Updated last year
- Text Mining and Topic Modeling Toolkit for Python with parallel processing power☆190Updated last year