MathieuLoutre / node-symspellLinks
JavaScript port of SymSpell for Node.js
☆13Updated 3 years ago
Alternatives and similar repositories for node-symspell
Users that are interested in node-symspell are comparing it to the libraries listed below
Sorting:
- A semi-unsupervised language independent morphological analyzer useful for stemming unknown language text, or getting a rough estimate of…☆22Updated 8 years ago
- The Wikinflection Corpus, from the paper "Wikinflection Corpus: A (Better) Multilingual, Morpheme-Annotated Inflectional Corpus" (Metheni…☆12Updated 2 years ago
- roll a wikipedia dump into mongo☆249Updated last year
- English NLP for Node.js and the browser.☆87Updated 2 years ago
- A list of words from the SUBTLEX movie subtitles corpus, sorted by frequency.☆37Updated 5 years ago
- TweetCaT - a tool for building Twitter corpora of smaller languages or specific geographical regions☆12Updated 8 years ago
- Convert between DOM Range instances and text positions.☆26Updated 5 years ago
- 📝 Hunspell compatible spell-checker☆289Updated 4 years ago
- Quickly estimate the similarity between many sets☆53Updated 3 years ago
- FastText for Node.js☆199Updated 2 years ago
- 🎀 JavaScript API for spaCy with Python REST API☆199Updated 2 years ago
- PhiloLogic4☆39Updated last year
- Nodejs binding for fasttext representation and classification.☆43Updated last year
- A Corpus Data Retrieval Index using Lucene for Look-Ups☆19Updated 3 weeks ago
- Tools for scraping, annotating, and parsing morphological information from Wiktionary☆15Updated 6 years ago
- Get n-grams from text☆84Updated 3 years ago
- SpellcheckerWasm is an extrememly fast spellchecker for WebAssembly based on SymSpell☆62Updated 3 years ago
- Simularity identification in JS☆37Updated last year
- WebAssembly based Javascript bindings for google Compact Language Detector v3☆75Updated 2 years ago
- Speaker count for 450+ languages☆20Updated 3 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆32Updated 6 months ago
- All languages stopwords collection☆475Updated 2 years ago
- Filter and format a newline-delimited JSON stream of Wikibase entities☆105Updated 4 months ago
- CLDR text segmentation for JavaScript☆39Updated last year
- Multilingual tokenizer that automatically tags each token with its type☆65Updated 2 years ago
- A cloud-based, open-source system for writing and publishing dictionaries.☆98Updated 2 years ago
- Custom French POS and lemmatizer based on Lefff for spacy☆68Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆182Updated 7 months ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆57Updated 4 years ago
- varied english texts for modern NLP testing☆78Updated 3 years ago