PayLead / PyLighter
Annotation tool on Jupyter for Named Entity Recognition tasks
β21Updated 8 months ago
Related projects β
Alternatives and complementary repositories for PyLighter
- π§ͺ Cutting-edge experimental spaCy components and featuresβ95Updated 6 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.β151Updated 5 months ago
- A Python library aimed at dissecting and augmenting NER training data.β56Updated last year
- Sentence transformers models for SpaCyβ105Updated last year
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEvalβ13β160Updated 2 weeks ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ52Updated last year
- Super lightweight function registries for your libraryβ173Updated 5 months ago
- π€ Push your spaCy pipelines to the Hugging Face Hubβ43Updated 5 months ago
- Language Models for Zalando's flair libraryβ62Updated 4 years ago
- Bag of, not words, but tricks!β68Updated last year
- β70Updated last year
- spaCy pipeline object for negating concepts in textβ274Updated 5 months ago
- Dataframe Integration with spaCy.β101Updated 3 years ago
- Source code and data for Like a Good Nearest Neighborβ28Updated 9 months ago
- πΈ fastText + Bloom embeddings for compact, full-coverage vectors with spaCyβ287Updated last year
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doβ¦β76Updated 4 months ago
- βοΈ Parallel and distributed training with spaCy and Rayβ54Updated last year
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.β88Updated 2 years ago
- π οΈ Tools for Transformers compression using PyTorch Lightning β‘β79Updated last week
- Running Prodigy for a team of annotatorsβ53Updated 3 years ago
- spaCy match and replace, maintaining conjugationβ34Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated 8 months ago
- Text tokenization and sentence segmentation (segtok v2)β203Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further langβ¦β117Updated 6 months ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linkingβ84Updated 2 years ago
- β29Updated 2 years ago
- β42Updated last year
- Generate reports for spaCy models.β28Updated 2 years ago
- Robust and Fast tokenizations alignment library for Rust and Python https://tamuhey.github.io/tokenizations/β187Updated last year