MathieuLoutre / node-symspell
JavaScript port of SymSpell for Node.js
☆13Updated 2 years ago
Alternatives and similar repositories for node-symspell:
Users that are interested in node-symspell are comparing it to the libraries listed below
- CLDR text segmentation for JavaScript☆38Updated 9 months ago
- English NLP for Node.js and the browser.☆89Updated last year
- Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and …☆9Updated 4 years ago
- ⚙️ [Processor] A better English POS tagger written in JavaScript☆53Updated 7 years ago
- Nodejs binding for fasttext representation and classification.☆42Updated 11 months ago
- Morphological Dictionaries for German Language☆28Updated 6 years ago
- JS Trie / DAWG classes☆30Updated last year
- English Lemma Database - Compiled by Referencing British National Corpus☆29Updated 4 months ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- NLP Functions for amplifying negations, managing elisions, creating ngrams, stems, phonetic codes to tokens and more.☆125Updated 11 months ago
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆17Updated this week
- A semi-unsupervised language independent morphological analyzer useful for stemming unknown language text, or getting a rough estimate of…☆21Updated 7 years ago
- name2nat: a Python package for nationality prediction from a name☆106Updated 4 years ago
- FastText for Node.js☆196Updated last year
- WebAssembly based Javascript bindings for google Compact Language Detector v3☆62Updated last year
- Multilingual tokenizer that automatically tags each token with its type☆61Updated last year
- 🎀 JavaScript API for spaCy with Python REST API☆196Updated last year
- Custom French POS and lemmatizer based on Lefff for spacy☆66Updated last year
- Measure the similarity of text corpora for 74 languages☆13Updated last year
- Distance/Similarity functions for Bag of Words, Strings, Vectors and more.☆23Updated last year
- Wikipedia Bilingual Reference Data (English)☆15Updated 8 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated last year
- German Morphological Analyzer☆47Updated 3 years ago
- Get n-grams from text☆78Updated 2 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆189Updated 4 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago
- The official repository for the The Project Dialogism Novel Corpus, a dataset of annotated quotations in full-length English novels.☆39Updated last year
- Quickly estimate the similarity between many sets☆51Updated 2 years ago
- Extracts plain text, language identification and more metadata from WARC records☆21Updated 2 weeks ago
- Sentence Boundary Detection in javascript for node. http://tessmore.github.io/sbd/☆209Updated last year