polm/unidic-lite

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/polm/unidic-lite)

polm / unidic-lite

A small version of UniDic for easy pip installs.

☆52

Alternatives and similar repositories for unidic-lite

Users that are interested in unidic-lite are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

polm / unidic-py
View on GitHub
Unidic packaged for installation via pip.
☆111Feb 26, 2025Updated last year
polm / fugashi
View on GitHub
A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
☆533Oct 24, 2025Updated 9 months ago
SamuraiT / mecab-python3
View on GitHub
mecab-python. you can find original version here//taku910.github.io/mecab/
☆584Nov 25, 2025Updated 8 months ago
jamesohortle / loanwords_gairaigo
View on GitHub
English loanwords in Japanese
☆19Oct 24, 2024Updated last year
WorksApplications / ViSudachi
View on GitHub
A tool for visualizing the internal structures of morphological analyzer Sudachi
☆18Jun 9, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
higumachan / lindera-js
View on GitHub
A lidera japanese tokenizer wrapper for javascript and typescript
☆16Dec 29, 2021Updated 4 years ago
openkorpos / model-mecab
View on GitHub
MeCab model trained with OpenKorPos.
☆23Jun 19, 2022Updated 4 years ago
NoUnique / pymecab-ko
View on GitHub
🐍 pymecab-ko. you can find original version here: https://bitbucket.org/eunjeon/mecab-ko, https://github.com/SamuraiT/mecab-python3
☆25Sep 23, 2025Updated 10 months ago
polm / cutlet
View on GitHub
Japanese to romaji converter in Python
☆381Jul 1, 2026Updated 3 weeks ago
litagin02 / anime_speaker_embedding
View on GitHub
Speaker embedding for anime speech domain based on ECAPA_TDNN
☆21Jun 22, 2025Updated last year
baikalai / baikal-bert
View on GitHub
baikal.ai's pre-trained BERT models: descriptions and sample codes
☆12Jun 24, 2021Updated 5 years ago
PKSHATechnology-Research / tdmelodic
View on GitHub
A Japanese accent dictionary generator
☆126Mar 21, 2024Updated 2 years ago
zbller / Mecari
View on GitHub
☆40Oct 21, 2025Updated 9 months ago
WorksApplications / SudachiDict
View on GitHub
A lexicon for Sudachi
☆301Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
yuiseki / charites-ai
View on GitHub
🤖✨🗺 charites-ai - AI that can generate json files according to MapLibre style specification based on natural language instructions
☆27Updated this week
6gsn / marine
View on GitHub
☆38Sep 20, 2022Updated 3 years ago
AIRC-KETI / Korean-Copora
View on GitHub
☆14Dec 9, 2021Updated 4 years ago
tokuhirom / jawiki-kana-kanji-dict
View on GitHub
Generate SKK/MeCab dictionary from Wikipedia(Japanese edition)
☆65Updated this week
Language-Media-Lab / commonsense-moral-ja
View on GitHub
☆15Nov 20, 2025Updated 8 months ago
taku910 / mecab
View on GitHub
Yet another Japanese morphological analyzer
☆1,103Feb 22, 2025Updated last year
kdrkdrkdr / ko2kana
View on GitHub
Convert Korean to Katakana
☆13Dec 13, 2023Updated 2 years ago
gotutiyan / GEC-Info-ja
View on GitHub
文法誤り訂正に関する日本語文献を収集・分類するためのリポジトリ
☆14Apr 17, 2025Updated last year
systemd / systemd-centos-ci
View on GitHub
CI scripts for systemd upstream/downstream testing using the CentOS CI infrastructure
☆13May 20, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lighttransport / jagger-python
View on GitHub
Python binding for Jagger(C++ implementation of Pattern-based Japanese Morphological Analyzer)
☆13Dec 16, 2025Updated 7 months ago
yahoojapan / VFD-Dataset
View on GitHub
☆11Nov 10, 2020Updated 5 years ago
ndl-lab / huriganacorpus-aozora
View on GitHub
青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット
☆22Jan 17, 2024Updated 2 years ago
WorksApplications / SudachiPy
View on GitHub
Python version of Sudachi, a Japanese tokenizer.
☆442Oct 7, 2022Updated 3 years ago
codeforjapan / Gussuri
View on GitHub
睡眠記録シートのアプリ化のプロジェクト
☆16Jul 16, 2026Updated last week
alinear-corp / albert-japanese
View on GitHub
BERT with SentencePiece for Japanese text.
☆33Oct 28, 2021Updated 4 years ago
nlp-waseda / Kanbun-LM
View on GitHub
Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"
☆21Jul 10, 2023Updated 3 years ago
taishi-i / nagisa
View on GitHub
A Japanese tokenizer based on recurrent neural networks
☆418Jul 6, 2026Updated 2 weeks ago
neubig / travatar
View on GitHub
This is a repository for the Travatar forest-to-string translation decoder
☆29Aug 7, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wtetsu / deinja
View on GitHub
🌸De-inflect Japanese words
☆16Nov 24, 2025Updated 8 months ago
s-nlp / mutual_implication_score
View on GitHub
☆12May 18, 2022Updated 4 years ago
yuni-shinogami / VRChatEngineerWorkingAndDrinkingMeetup
View on GitHub
VRCエンジニア作業飲み集会のためのGithubリポジトリです。Issueを立ててワールドの不具合を報告したり、ご意見提案などを自由に投げられます。
☆15Dec 14, 2021Updated 4 years ago
utanaka2000 / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆25Mar 16, 2021Updated 5 years ago
kaniblu / hanja-tagger
View on GitHub
Automatic Korean Hanja tagging tool powered by Hanjaro (hanjaro.juntong.or.kr)
☆19Feb 22, 2019Updated 7 years ago
WorksApplications / chikkarpy
View on GitHub
Japanese synonym library
☆55Feb 7, 2022Updated 4 years ago
DHRI-Curriculum / command-line
View on GitHub
@DHRI-Curriculum Session on the command line, a means of interacting with your computer programmatically through text.
☆15May 8, 2024Updated 2 years ago