nickdavidhaynes / spacy-cld
Language detection extension for spaCy 2.0+
β112Updated 6 years ago
Alternatives and similar repositories for spacy-cld:
Users that are interested in spacy-cld are comparing it to the libraries listed below
- Hunspell extension for spaCy 2.0.β94Updated 8 months ago
- π Emoji handling and meta data for spaCy with custom extension attributesβ181Updated last year
- Server/Client around Spacy to load spacy only onceβ46Updated 7 years ago
- π« Scripts, tools and resources for developing spaCyβ125Updated 6 years ago
- A visualisation tool for Spacy using Hierplane.β65Updated 2 years ago
- Named Entity Recognition data for Europeana Newspapersβ171Updated last year
- Fast supervised sentence boundary detection using the averaged perceptronβ90Updated 6 years ago
- spaCy + UDPipeβ161Updated 2 years ago
- spaCy pipeline component for adding text readability meta data to Doc objects.β56Updated 6 years ago
- π« REST microservices for various spaCy-related tasksβ240Updated 2 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtitiesβ114Updated 2 years ago
- Anonymization of legal cases (Fr) based on Flair embeddingsβ88Updated 4 years ago
- Excel Integration with spaCy. Training NER using Excel/XLSX from PDF, DOCX, PPT, PNG or JPG.β105Updated 2 years ago
- Regex like pattern tree matching but on sentence's tree instead of Stringsβ42Updated 7 years ago
- Language Tool style grammar handling with spaCy 2.0β42Updated 6 years ago
- π Additional lookup tables and data resources for spaCyβ105Updated 2 months ago
- FoLiA Linguistic Annotation Tool -- Flat is a web-based linguistic annotation environment based around the FoLiA format (http://proycon.gβ¦β112Updated 2 months ago
- Segtok v2 is here: https://github.com/fnl/syntok -- A rule-based sentence segmenter (splitter) and a word tokenizer using orthographic feβ¦β169Updated 3 years ago
- Temporal Expression Recognition and Normalisation in Pythonβ78Updated 9 years ago
- Named Entity Recognition based on dictionariesβ242Updated 6 years ago
- Example using Polyaxon to experiment with pre-training spaCyβ65Updated 3 years ago
- Running Prodigy for a team of annotatorsβ53Updated 4 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.β77Updated 3 years ago
- A fully customisable language detection pipeline for spaCyβ92Updated 5 years ago
- C++ implementation of Generalised Brown clustering and python scripts for feature generationβ41Updated 8 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern webβ199Updated 6 years ago
- Repository for the word embeddings experiments described in "Evaluating Unsupervised Dutch Word Embeddings as a Linguistic Resource", preβ¦β83Updated 3 years ago
- KenLM extension for spaCy 2.0.β16Updated 7 years ago
- Language independent truecaser in Python.β160Updated 3 years ago
- Socially-Equitable Language Identificationβ78Updated 2 years ago