explosion / ml-datasets
🌊 Machine learning dataset loaders for testing and example scripts
☆45Updated 2 years ago
Related projects: ⓘ
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 2 years ago
- Python text processing, pattern matching, and NLP framework☆61Updated last year
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆42Updated 4 years ago
- ☆70Updated last year
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated last year
- Notebooks configured to be run with Binder, usually found on my blog.☆41Updated last year
- Enso: An Open Source Library for Benchmarking Embeddings + Transfer Learning Methods☆95Updated 3 years ago
- allennlp + streamlit demo☆21Updated 4 years ago
- The fast.ai data ethics course☆14Updated last year
- Running Prodigy for a team of annotators☆53Updated 3 years ago
- The ntentional blog - a machine learning journey☆23Updated last year
- Bag of, not words, but tricks!☆68Updated 10 months ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 3 years ago
- Language detection using Spacy and Fasttext☆53Updated 9 months ago
- A collection of simple tutorials for using Fonduer☆100Updated 3 years ago
- EpiTator annotates epidemiological information in text documents. It is the natural language processing framework that powers GRITS and E…☆42Updated 2 years ago
- Inter-annotator agreement for Doccano☆26Updated 4 years ago
- NERtwork is a collection of scripts to help you create a network graph of co-occurring named entities using open source tools. This is do…☆49Updated 5 months ago
- Jupyter notebook widget to quickly label text data☆47Updated 5 years ago
- A visualisation tool for Spacy using Hierplane.☆65Updated last year
- A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python☆99Updated 7 months ago
- Generate reports for spaCy models.☆28Updated 2 years ago
- Use ML-Annotate to label data for machine learning purposes☆104Updated 4 years ago
- 🧬 A VS Code extension for annotating data with Prodigy☆30Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆72Updated 2 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 6 months ago
- Sentence transformers models for SpaCy☆104Updated last year
- Jupyter Widget for data annotation☆140Updated last year
- Materials for the Neural Network tutorial at PyData NYC 2019☆15Updated last year
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆49Updated 3 years ago