d99kris / spacy-cpp
C++ wrapper library for the NLP library spaCy
☆102Updated 2 years ago
Alternatives and similar repositories for spacy-cpp:
Users that are interested in spacy-cpp are comparing it to the libraries listed below
- General-Purpose Neural Networks for Sentence Boundary Detection☆72Updated 2 years ago
- spaCy + UDPipe☆161Updated 2 years ago
- Named Entity Recognition based on dictionaries☆242Updated 6 years ago
- A sentence segmenter that actually works!☆305Updated 4 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆191Updated 4 years ago
- xfspell — the Transformer Spell Checker☆189Updated 4 years ago
- Deep-learning based sentence auto-segmentation from unstructured text w/o punctuation☆36Updated 7 years ago
- Fast and customizable text tokenization library with BPE and SentencePiece support☆304Updated 7 months ago
- Fast SymSpell written in c++ and exposes to python via pybind11☆42Updated last month
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆254Updated 7 months ago
- Named Entity Recognition data for Europeana Newspapers☆171Updated last year
- word2vec++ is a Distributed Representations of Words (word2vec) library and tools implementation, written in C++11 from the scratch☆137Updated last year
- 📂 Additional lookup tables and data resources for spaCy☆105Updated 2 months ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- Unicode tokeniser. Ucto tokenizes text files: it separates words from punctuation, and splits sentences. It offers several other basic pr…☆68Updated last month
- AmbiverseNLU: A Natural Language Understanding suite by Max Planck Institute for Informatics☆210Updated last year
- A minimal, pure Python library to interface with CoNLL-U format files.☆149Updated last year
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆222Updated 2 years ago
- Disambiguate is a tool for training and using state of the art neural WSD models☆59Updated 2 years ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 10 months ago
- Language detection extension for spaCy 2.0+☆112Updated 6 years ago
- Colibri core is an NLP tool as well as a C++ and Python library for working with basic linguistic constructions such as n-grams and skipg…☆126Updated 3 months ago
- Corpus preprocessing☆95Updated last year
- Implementation of Hobbs' algorithm for coreference resolution in python☆44Updated 4 years ago
- COMBO is jointly trained tagger, lemmatizer and dependency parser.☆35Updated 2 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆412Updated 2 months ago
- Parse natural language time expressions in python☆131Updated 2 years ago
- Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)☆158Updated 5 years ago
- Build a dialog dataset from online books in many languages☆72Updated 2 years ago
- Text tokenization and sentence segmentation (segtok v2)☆202Updated 3 years ago