takafumir / javascript-lemmatizerLinks
JavaScript Lemmatizer is a lemmatization library to retrieve a base form from an English inflected word.
☆66Updated 3 years ago
Alternatives and similar repositories for javascript-lemmatizer
Users that are interested in javascript-lemmatizer are comparing it to the libraries listed below
Sorting:
- English lemmatizer☆67Updated 2 years ago
- Analyzes the given text and determine what's the vocabulary level based on CEFR levels☆45Updated 2 years ago
- CLDR text segmentation for JavaScript☆38Updated last year
- A list of words from the SUBTLEX movie subtitles corpus, sorted by frequency.☆33Updated 5 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆31Updated 8 months ago
- Tokenizes Chinese texts into words.☆98Updated 2 years ago
- Read-only mirror of https://framagit.org/tuxor1337/stardict.js. Pull requests and issues on GitHub cannot be accepted and will be automat…☆44Updated 2 years ago
- A collection of modules and utilities for doing things with phonemes.☆50Updated 3 years ago
- an opinionated assembly of wordnet for javascript☆56Updated 8 years ago
- A tool to find grammar patterns in Chinese text☆27Updated 5 years ago
- *.mdx/*.mdd interpreter js implements, support mdict index file☆173Updated 2 months ago
- wordpos for the web/browser☆43Updated 4 years ago
- ⚙️ [Processor] A better English POS tagger written in JavaScript☆54Updated 8 years ago
- Javascript text tokenizer that is easy to use and compose.☆32Updated 9 years ago
- The 134,000+ words and their pronunciations in the CMU pronouncing dictionary☆79Updated 3 years ago
- English (natural language) parser☆160Updated 7 months ago
- Japanese data from the Google UDT 2.0.☆38Updated this week
- Tools for extracting data from Apple dictionary files (used by the Dictionary application on Mac).☆117Updated last year
- Natural Language Concrete Syntax Tree format☆221Updated 8 months ago
- WordNet in JSON format.☆91Updated 4 years ago
- Fast Double Metaphone algorithm☆92Updated 2 years ago
- Gather modern English word frequencies from all enwiki articles.☆213Updated last year
- HanziJS is a Chinese character and NLP module for Chinese language processing for Node.js☆381Updated 8 months ago
- Export UNIHAN's database to csv, json or yaml☆58Updated this week
- Implement the supermemo 2 algorithm.☆81Updated 2 years ago
- Open Language Profiles — English profile datasets from CEFR-J☆126Updated 5 years ago
- A library for writing dictionary files in the MDict (.mdx) format☆335Updated 7 years ago
- Converts from Chinese characters to pinyin, between simplified and traditional, and does word segmentation.☆112Updated last year
- Describe and resolve DOM Range objects using XPath☆42Updated 8 years ago
- Put together a multilingual corpus from a variety of sources. Used for wordfreq and word embeddings.☆51Updated 3 years ago