d99kris / spacy-cpp
C++ wrapper library for the NLP library spaCy
☆102Updated 2 years ago
Alternatives and similar repositories for spacy-cpp:
Users that are interested in spacy-cpp are comparing it to the libraries listed below
- General-Purpose Neural Networks for Sentence Boundary Detection☆73Updated 2 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆127Updated 4 months ago
- Fast and customizable text tokenization library with BPE and SentencePiece support☆302Updated last week
- word2vec++ is a Distributed Representations of Words (word2vec) library and tools implementation, written in C++11 from the scratch☆138Updated last year
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.☆316Updated 2 months ago
- A sentence segmenter that actually works!☆306Updated 4 years ago
- spaCy + UDPipe☆161Updated 3 years ago
- Fast Neural Machine Translation in C++ - development repository☆272Updated 6 months ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆223Updated 2 years ago
- LASER multilingual sentence embeddings as a pip package☆223Updated last year
- Lightweight C++ translator for OpenNMT Torch models (deprecated)☆79Updated 5 years ago
- xfspell — the Transformer Spell Checker☆190Updated 4 years ago
- MIT Language Modeling Toolkit☆116Updated 5 years ago
- Corpus preprocessing☆96Updated last year
- Text tokenization and sentence segmentation (segtok v2)☆201Updated 3 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆191Updated 4 years ago
- Various utilities for processing the data.☆208Updated this week
- Fast SymSpell written in c++ and exposes to python via pybind11☆43Updated last month
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Source code for the Apple reproduction☆32Updated 4 years ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated 2 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆114Updated 2 years ago
- State-of-the-art Supervised Sentence Simplification System from ACL 2014☆46Updated 6 years ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- Efficient Low-Memory Aligner☆143Updated 3 months ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- Doing things with embeddings☆64Updated 2 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆255Updated 7 months ago
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.☆48Updated 6 years ago