bjascob / pyInflect
A python module for word inflections designed for use with spaCy.
☆92Updated 4 years ago
Alternatives and similar repositories for pyInflect:
Users that are interested in pyInflect are comparing it to the libraries listed below
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆78Updated 6 months ago
- A Word Sense Disambiguation system integrating implicit and explicit external knowledge.☆68Updated 3 years ago
- Implementation of the ClausIE information extraction system for python+spacy☆220Updated 2 years ago
- 🧪 Cutting-edge experimental spaCy components and features☆96Updated 8 months ago
- A Super-Lightweight Annotation Tool for Experts: Label text in a terminal with just Python☆101Updated 3 weeks ago
- spaCy + UDPipe☆161Updated 2 years ago
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- UFSAC is a resource containing all WordNet Sense Annotated Corpora, and a Java library for manipulating them☆37Updated 2 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆149Updated last year
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆253Updated 4 months ago
- A small tool that EXPLains spACY parse results. See what I did there?☆83Updated 2 years ago
- Automatic extraction of edited sentences from text edition histories.☆82Updated 2 years ago
- Alignment and annotation for comparable documents.☆22Updated 6 years ago
- Text tokenization and sentence segmentation (segtok v2)☆203Updated 2 years ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆64Updated last year
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆60Updated last year
- A python module for English lemmatization and inflection.☆265Updated last year
- Sentence transformers models for SpaCy☆107Updated last year
- Language independent truecaser in Python.☆161Updated 3 years ago
- A module to compute textual lexical richness (aka lexical diversity).☆98Updated last year
- CONLL-U to Pandas DataFrame☆31Updated 7 years ago
- German Morphological Analyzer☆47Updated 3 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- Python framework for processing Universal Dependencies data☆56Updated 3 weeks ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆171Updated 3 years ago
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Language Modelling Makes Sense - WSD (and more) with Contextual Embeddings☆95Updated last year
- A dataset of atomic wikipedia edits containing insertions and deletions of a contiguous chunk of text in a sentence. This dataset contai…☆106Updated 5 years ago
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆26Updated 4 years ago
- List of corpora annotated for coreference for different languages☆17Updated 5 months ago