larsyencken / cjktools
Tools for processing CJK strings in Python
☆20Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for cjktools
- Python library for CJK (Chinese, Japanese, and Korean) language dictionary☆82Updated this week
- unihandecode is a transliteration library to convert all characters/words in Unicode into ASCII alphabet that aware with Language prefere…☆69Updated 2 years ago
- a script and anki addon to turn KanjiVG data into colored stroke order diagrams☆121Updated 4 months ago
- Search-by-similarity for Japanese kanji☆10Updated 7 years ago
- HDIC : Integrated Database of Hanzi Dictionaries in Early Japan☆34Updated this week
- A Romaji/Kana conversion library for Python☆112Updated 2 years ago
- Various data for Ideographs☆41Updated 10 years ago
- Unidic packaged for installation via pip.☆77Updated last year
- natto-py combines the Python programming language with MeCab, the part-of-speech and morphological analyzer for the Japanese language.☆92Updated 5 months ago
- Small example scripts for working with Japanese texts in Python☆26Updated 5 years ago
- Sane data exporter for an insane dictionary format.☆99Updated last year
- Database for various Ideographic Variants Data☆59Updated last year
- Chinese Character Frequencies☆17Updated 7 years ago
- tokenizer specified for Japanese☆48Updated 3 years ago
- Export UNIHAN's database to csv, json or yaml☆52Updated this week
- 漢字データベースの辞書関連データ☆88Updated last year
- 様々な漢字表のデータベース☆91Updated 5 years ago
- Anki2 Add-On to look-up the pronunciation of Japanese expressions.☆70Updated 3 years ago
- Lightweight converter from Japanese Kana-kanji sentences into Kana-Roman.☆421Updated 2 years ago
- Hy-phen-ation made easy☆202Updated last week
- Japanese Natural Langauge Processing Libraries☆148Updated 4 years ago
- Han character library for CJKV languages☆150Updated 3 years ago
- This repo contains a list of the 44,998 most common Japanese words in order of frequency, as determined by the University of Leeds Corpus…☆66Updated 6 years ago
- Kanji usage frequency data collected from various sources☆131Updated 3 weeks ago
- PanCJKV IVD Collection (UNREGISTERED)☆23Updated 7 years ago
- Trying to consolidate japanese phonetic, and in particular pitch accent resources into one list☆105Updated 9 months ago
- The ultimate kanji resource☆278Updated 4 months ago
- Open source and updatable JLPT Vocabulary Anki Decks☆117Updated 4 months ago
- Translation of the MeCab documentation to English☆42Updated 8 years ago