yishn / chinese-tokenizer
Tokenizes Chinese texts into words.
☆96Updated 2 years ago
Alternatives and similar repositories for chinese-tokenizer:
Users that are interested in chinese-tokenizer are comparing it to the libraries listed below
- A tool to find grammar patterns in Chinese text☆27Updated 5 years ago
- Converts from Chinese characters to pinyin, between simplified and traditional, and does word segmentation.☆111Updated last year
- HanziJS is a Chinese character and NLP module for Chinese language processing for Node.js☆381Updated 6 months ago
- A JavaScript Chinese word segmentation tool based on Python Jieba☆45Updated 11 years ago
- Draw animated Japanese characters (Kanji and Kana), Korean characters (Hanja) and Chinese characters (Hanzi) in correct stroke order usin…☆317Updated 3 months ago
- CLDR text segmentation for JavaScript☆38Updated 11 months ago
- Python module that identifies Chinese text as being Simplified or Traditional☆91Updated 4 months ago
- JavaScript Lemmatizer is a lemmatization library to retrieve a base form from an English inflected word.☆66Updated 3 years ago
- Chinese (zh-cnm) opendata audio files for 8,596 hsk words and 1,707 syllabs.☆45Updated 4 years ago
- 臺灣閩南語常用詞辭典 資料檔☆78Updated last year
- Chrome extension that translates Chinese words when hovering on them.☆40Updated 2 years ago
- Implement the supermemo 2 algorithm.☆81Updated 2 years ago
- An experimental webpage for observing Chinese natural language processing. It demonstrates the processes of decomposition, transformation…☆64Updated 10 months ago
- Gather modern English word frequencies from all enwiki articles.☆212Updated last year
- Node.js Interface for CC-CEDICT (http://cc-cedict.org/)☆26Updated 8 years ago
- English Lemma Database - Compiled by Referencing British National Corpus☆30Updated 6 months ago
- Han character library for CJKV languages☆156Updated 4 years ago
- 《国际中文教育中文水平等级标准》 查询系统 Query System of Chinese Proficiency Grading Standards for International Chinese Language Education, New HSK Levels …☆29Updated last year
- Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a frame…☆21Updated last year
- Chinese language vocabulary graph generation. Python/Flask tool that performs dictionary search and analysis on Chinese Hanzi characters.…☆133Updated last year
- 開放漢語字典 - 現代漢語字音數據庫☆22Updated 4 years ago
- Analyzes the given text and determine what's the vocabulary level based on CEFR levels☆45Updated 2 years ago
- List of Chinese characters ordered by frequency rank (from most common to least common). Based on Jun Da's Modern Chinese Character Frequ…☆32Updated last year
- 說文解字的檢索數據☆93Updated 3 months ago
- Chinese lexicon containing definitions, character origins, and statistics, built for Dong Chinese (https://www.dong-chinese.com)☆45Updated 4 years ago
- Stroke order SVG files for Chinese Hanzi characters☆40Updated last year
- Free, open-source Chinese handwriting recognition in Javascript☆151Updated 5 years ago
- 中文词典 / 中文詞典。Chinese / Chinese-English dictionaries.☆168Updated last year
- Practice Chinese language grammar☆17Updated 3 years ago
- A free, open source, cross-platform, Chinese-To-English dictionary for desktops.☆162Updated 7 months ago