yishn / chinese-tokenizerLinks
Tokenizes Chinese texts into words.
☆100Updated 2 years ago
Alternatives and similar repositories for chinese-tokenizer
Users that are interested in chinese-tokenizer are comparing it to the libraries listed below
Sorting:
- A tool to find grammar patterns in Chinese text☆28Updated 5 years ago
 - HanziJS is a Chinese character and NLP module for Chinese language processing for Node.js☆393Updated last year
 - Converts from Chinese characters to pinyin, between simplified and traditional, and does word segmentation.☆116Updated 2 years ago
 - CLDR text segmentation for JavaScript☆38Updated last year
 - Python module that identifies Chinese text as being Simplified or Traditional☆102Updated 11 months ago
 - English Lemma Database - Compiled by Referencing British National Corpus☆32Updated last year
 - The 134,000+ words and their pronunciations in the CMU pronouncing dictionary☆80Updated 4 years ago
 - Free, open-source Chinese handwriting recognition in Javascript☆164Updated 6 years ago
 - 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆202Updated last year
 - Han character library for CJKV languages☆163Updated 4 years ago
 - Draw animated Japanese characters (Kanji and Kana), Korean characters (Hanja) and Chinese characters (Hanzi) in correct stroke order usin…☆351Updated 3 weeks ago
 - Chrome extension that translates Chinese words when hovering on them.☆40Updated 2 years ago
 - Open Language Profiles — English profile datasets from CEFR-J☆153Updated 5 years ago
 - FastText for Node.js☆198Updated 2 years ago
 - Analyzes the given text and determine what's the vocabulary level based on CEFR levels☆47Updated 2 years ago
 - Enter only simplified characters and create word meaning with Traditional, Pinyin, Meaning, Audio and example sentences☆31Updated 4 years ago
 - rime-cantonese 上游詞表倉庫☆30Updated last year
 - Node.js Interface for CC-CEDICT (http://cc-cedict.org/)☆27Updated 8 years ago
 - Text to IPA converter in JavaScript☆58Updated 3 years ago
 - Converts English text to IPA notation☆391Updated 2 years ago
 - Implement the supermemo 2 algorithm.☆81Updated 3 years ago
 - Offline bilingual dictionaries made using data from Wiktionary☆61Updated 10 years ago
 - Cantonese Romanization Converter☆17Updated 4 years ago
 - JavaScript Lemmatizer is a lemmatization library to retrieve a base form from an English inflected word.☆67Updated 4 years ago
 - English lemmatizer☆67Updated 2 years ago
 - 開放漢語字典 - 現代漢語字音數據庫☆24Updated 5 years ago
 - Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆93Updated last week
 - Chinese language vocabulary graph generation. Python/Flask tool that performs dictionary search and analysis on Chinese Hanzi characters.…☆149Updated 2 years ago
 - ☆28Updated last year
 - A natural language detection library based on trigram statistical analysis for Node.js and the Web.☆212Updated 10 years ago