aosingh / lexpy
Python package for lexicon; Trie and DAWG implementation.
☆55Updated 4 months ago
Alternatives and similar repositories for lexpy:
Users that are interested in lexpy are comparing it to the libraries listed below
- Text tokenization and sentence segmentation (segtok v2)☆201Updated 3 years ago
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Hunspell extension for spaCy 2.0.☆94Updated 8 months ago
- Lightning Fast Language Prediction 🚀☆166Updated 6 years ago
- ☆169Updated 3 weeks ago
- A python module for word inflections designed for use with spaCy.☆92Updated 5 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 6 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆114Updated 2 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆77Updated 3 years ago
- Use ML-Annotate to label data for machine learning purposes☆109Updated 4 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- A minimal, pure Python library to interface with CoNLL-U format files.☆151Updated last year
- Inter-annotator agreement for Doccano☆27Updated 4 years ago
- A small tool that EXPLains spACY parse results. See what I did there?☆83Updated 3 years ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- Stand-alone WordNet API☆48Updated 3 years ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- Language Tool style grammar handling with spaCy 2.0☆42Updated 6 years ago
- Code and data accompanying the paper "Approaching nested named entity recognition with parallel LSTM-CRFs."☆26Updated 2 years ago
- LASER multilingual sentence embeddings as a pip package☆223Updated last year
- Misspelling Oblivious Word Embeddings☆203Updated 5 years ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic fe…☆169Updated 3 years ago
- A simple client for doccano API.☆84Updated 10 months ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- Hidden alignment conditional random field for classifying string pairs.☆36Updated 7 years ago
- ☆33Updated 3 years ago
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 2 years ago
- A lightweight but powerful library to build token indices for NLP tasks, compatible with major Deep Learning frameworks like PyTorch and …☆51Updated 4 months ago