cdpierse / breameLinks
Lightweight utility tools for the detection of multiple spellings, meanings, and language-specific terminology in British and American English
☆17Updated 4 years ago
Alternatives and similar repositories for breame
Users that are interested in breame are comparing it to the libraries listed below
Sorting:
- Learning BPE embeddings by first learning a segmentation model and then training word2vec☆19Updated 3 years ago
- A python true casing utility that restores case information for texts☆88Updated 3 years ago
- Language independent truecaser in Python.☆160Updated 4 years ago
- Hunspell extension for spaCy 2.0.☆94Updated last year
- Compound splitter for German☆112Updated 5 years ago
- The NLPStatTest project☆12Updated 3 years ago
- SegEval Segmentation Evaluation Package☆57Updated 2 years ago
- Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussain☆60Updated 6 years ago
- REMERGE - Multi-Word Expression discovery algorithm☆14Updated 3 years ago
- GLADIS: A General and Large Acronym Disambiguation Benchmark (EACL 23)☆18Updated last year
- Fast supervised sentence boundary detection using the averaged perceptron☆91Updated 7 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- spaCy + UDPipe☆166Updated 3 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆151Updated last year
- C++ implementation of Generalised Brown clustering and python scripts for feature generation☆41Updated 9 years ago
- A simple neural truecaser written in pytorch and allennlp.☆33Updated last year
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆64Updated 3 years ago
- A python module to process data for Frame Semantic Parsing☆23Updated 5 years ago
- Featurize words into orthographic and phonological vectors.☆41Updated 2 years ago
- Normalize text string☆12Updated 7 years ago
- German lemmatization with IWNLP as extension for spaCy☆26Updated 2 years ago
- Tool for parsing and converting various span encoding schemes.☆23Updated 2 years ago
- Tokenizer for Twitter and Reddit data☆45Updated 6 years ago
- ☆69Updated 2 years ago
- Neural Network for Automatic Negation Detection☆20Updated 9 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆56Updated 2 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆36Updated 2 years ago
- A compound splitter based on the semantic regularities in the vector space of word embeddings.☆16Updated 8 years ago
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- German Morphological Analyzer☆51Updated 4 years ago