aosingh / lexpyLinks
Python package for lexicon; Trie and DAWG implementation.
☆55Updated 6 months ago
Alternatives and similar repositories for lexpy
Users that are interested in lexpy are comparing it to the libraries listed below
Sorting:
- Language independent truecaser in Python.☆160Updated 3 years ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆77Updated 3 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 10 months ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- ☆171Updated 3 months ago
- Lightning Fast Language Prediction 🚀☆167Updated 6 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 6 years ago
- Text tokenization and sentence segmentation (segtok v2)☆205Updated 3 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Stand-alone WordNet API☆48Updated 3 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated 2 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- spaCy + UDPipe☆161Updated 3 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 6 months ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- A compound word splitter for Python☆48Updated 3 years ago
- Language detection extension for spaCy 2.0+☆113Updated 6 years ago
- Python bindings for libwapiti☆67Updated 5 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆80Updated 11 months ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated 2 years ago
- A simple client for doccano API.☆85Updated last year
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆170Updated 3 years ago
- A python true casing utility that restores case information for texts☆89Updated 2 years ago
- ☆27Updated 8 years ago
- Inter-annotator agreement for Doccano☆27Updated 5 years ago
- A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection☆60Updated 8 years ago