yishn / chinese-tokenizerLinks
Tokenizes Chinese texts into words.
☆100Updated 2 years ago
Alternatives and similar repositories for chinese-tokenizer
Users that are interested in chinese-tokenizer are comparing it to the libraries listed below
Sorting:
- HanziJS is a Chinese character and NLP module for Chinese language processing for Node.js☆398Updated last year
- A tool to find grammar patterns in Chinese text☆28Updated 6 years ago
- Converts from Chinese characters to pinyin, between simplified and traditional, and does word segmentation.☆117Updated 2 years ago
- CLDR text segmentation for JavaScript☆38Updated last year
- Han character library for CJKV languages☆164Updated 4 years ago
- 開放漢語字典 - 現代漢語字音數據庫☆24Updated 5 years ago
- Python module that identifies Chinese text as being Simplified or Traditional☆104Updated last year
- English Lemma Database - Compiled by Referencing British National Corpus☆33Updated last year
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆214Updated last year
- Free, open-source Chinese handwriting recognition in Javascript☆168Updated 6 years ago
- JavaScript Lemmatizer is a lemmatization library to retrieve a base form from an English inflected word.☆67Updated 4 years ago
- A JavaScript Chinese word segmentation tool based on Python Jieba☆51Updated 12 years ago
- FastText for Node.js☆199Updated 2 years ago
- Analyzes the given text and determine what's the vocabulary level based on CEFR levels☆48Updated 2 years ago
- Cantonese Romanization Converter☆18Updated 4 years ago
- cc-kedict: Creative Commons Korean-English Dictionary☆41Updated 4 years ago
- Text to IPA converter in JavaScript☆58Updated 3 years ago
- Draw animated Japanese characters (Kanji and Kana), Korean characters (Hanja) and Chinese characters (Hanzi) in correct stroke order usin…☆367Updated 2 months ago
- 臺灣閩南語常用詞辭典 資料檔☆78Updated 2 years ago
- The 134,000+ words and their pronunciations in the CMU pronouncing dictionary☆80Updated 4 years ago
- Chrome extension that translates Chinese words when hovering on them.☆40Updated 2 years ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆56Updated 3 weeks ago
- Chinese (zh-cnm) opendata audio files for 8,596 hsk words and 1,707 syllabs.☆57Updated 4 years ago
- Open Language Profiles — English profile datasets from CEFR-J☆156Updated 5 years ago
- Cantonese Linguistics and NLP☆393Updated last year
- Sentence Boundary Detection in javascript for node. http://tessmore.github.io/sbd/☆217Updated 2 years ago
- Gather modern English word frequencies from all enwiki articles.☆227Updated last year
- An experimental webpage for observing Chinese natural language processing. It demonstrates the processes of decomposition, transformation…☆69Updated last year
- 這棵橡木是松鼠的☆26Updated 9 years ago
- Implement the supermemo 2 algorithm.☆81Updated 3 years ago