A small version of UniDic for easy pip installs.
☆49Sep 1, 2020Updated 5 years ago
Alternatives and similar repositories for unidic-lite
Users that are interested in unidic-lite are comparing it to the libraries listed below
Sorting:
- Unidic packaged for installation via pip.☆108Feb 26, 2025Updated last year
- A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.☆511Oct 24, 2025Updated 4 months ago
- English loanwords in Japanese☆19Oct 24, 2024Updated last year
- mecab-python. you can find original version here//taku910.github.io/mecab/☆580Nov 25, 2025Updated 3 months ago
- Speaker embedding for anime speech domain based on ECAPA_TDNN☆17Jun 22, 2025Updated 8 months ago
- A tool for visualizing the internal structures of morphological analyzer Sudachi☆18Jun 9, 2022Updated 3 years ago
- MeCab model trained with OpenKorPos.☆23Jun 19, 2022Updated 3 years ago
- 🐍 pymecab-ko. you can find original version here: https://bitbucket.org/eunjeon/mecab-ko, https://github.com/SamuraiT/mecab-python3☆22Sep 23, 2025Updated 5 months ago
- 🌸De-inflect Japanese words☆15Nov 24, 2025Updated 3 months ago
- ☆11Jan 11, 2022Updated 4 years ago
- ☆11Oct 24, 2021Updated 4 years ago
- 🤖✨🗺 charites-ai - AI that can generate json files according to MapLibre style specification based on natural language instructions☆27Feb 26, 2026Updated last week
- baikal.ai's pre-trained BERT models: descriptions and sample codes☆12Jun 24, 2021Updated 4 years ago
- CC-CEDICT-MeCab is a MeCab dictionary for Chinese (Mandarin) text segmentation☆13Apr 9, 2020Updated 5 years ago
- A Japanese accent dictionary generator☆123Mar 21, 2024Updated last year
- Generate SKK/MeCab dictionary from Wikipedia(Japanese edition)☆59Feb 8, 2026Updated 3 weeks ago
- A lexicon for Sudachi☆279Jan 20, 2026Updated last month
- 青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット☆17Jan 17, 2024Updated 2 years ago
- Convert Korean to Katakana☆13Dec 13, 2023Updated 2 years ago
- Kanji Converter to Hiragana, Katakana, Roman alphabet.☆17Oct 30, 2025Updated 4 months ago
- Japanese to romaji converter in Python☆372Jun 2, 2025Updated 9 months ago
- BERT with SentencePiece for Japanese text.☆33Oct 28, 2021Updated 4 years ago
- Dictionary Library for Kagome v2☆15Feb 10, 2026Updated 3 weeks ago
- A lidera japanese tokenizer wrapper for javascript and typescript☆16Dec 29, 2021Updated 4 years ago
- ☆36Sep 20, 2022Updated 3 years ago
- ☆11Jun 19, 2022Updated 3 years ago
- Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese with BERT models☆20Updated this week
- 青空文庫振り仮名注釈付き音声コーパスのデータセット☆45Mar 7, 2025Updated 11 months ago
- mirror of git://source.ffmpeg.org/ffmpeg.git☆16Jul 10, 2024Updated last year
- Reading mdict files, support MDX/MDD file formats.☆18Feb 3, 2026Updated last month
- Automatic Korean Hanja tagging tool powered by Hanjaro (hanjaro.juntong.or.kr)☆18Feb 22, 2019Updated 7 years ago
- ☆24Jan 14, 2021Updated 5 years ago
- Annotated Fuman Kaitori Center Corpus☆18Dec 18, 2023Updated 2 years ago
- Client software for the AIST 3DDB system to be published as open source software.☆23Jun 13, 2024Updated last year
- Monokakido to Yomitan☆30Aug 18, 2025Updated 6 months ago
- ☆43Feb 2, 2024Updated 2 years ago
- A Japanese tokenizer based on recurrent neural networks☆412Feb 12, 2026Updated 3 weeks ago
- Python version of Sudachi, a Japanese tokenizer.☆427Oct 7, 2022Updated 3 years ago
- This is a repository for the Travatar forest-to-string translation decoder☆29Aug 7, 2021Updated 4 years ago