Kuromoji is a self-contained and very easy to use Japanese morphological analyzer designed for search
☆1,044Jan 23, 2023Updated 3 years ago
Alternatives and similar repositories for kuromoji
Users that are interested in kuromoji are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JavaScript implementation of Japanese morphological analyzer☆983Nov 12, 2023Updated 2 years ago
- Yet another Japanese morphological analyzer☆1,093Feb 22, 2025Updated last year
- A Japanese Tokenizer for Business☆972Updated this week
- Neologism dictionary based on the language resources on the Web for mecab-ipadic☆2,783Dec 27, 2023Updated 2 years ago
- Japanese language library for converting Japanese sentence to Hiragana, Katakana or Romaji with furigana and okurigana modes supported.☆972Jun 7, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Rakuten MA - morphological analyzer (word segmentor + PoS Tagger) for Chinese and Japanese written purely in JavaScript.☆472Feb 2, 2019Updated 7 years ago
- Self-contained Japanese Morphological Analyzer written in pure Go☆965Updated this week
- Kuromoji server and demo that shows Japanese morphological analyzer capabilities☆26Nov 11, 2017Updated 8 years ago
- These scripts to build a Lucene Kuromoji or Atilika Kuromoji with bundled mecab-ipadic-NEologd.☆23Apr 16, 2020Updated 6 years ago
- Juman++ (a Morphological Analyzer Toolkit)☆414Apr 17, 2026Updated last month
- Japanese Natural Langauge Processing Libraries☆147Sep 9, 2020Updated 5 years ago
- Japanese morphological analysis engine written in pure Python☆913Oct 13, 2025Updated 7 months ago
- The Kyoto Text Analysis Toolkit for word segmentation and pronunciation estimation, etc.☆213Apr 3, 2020Updated 6 years ago
- Japanese (kuromoji) Analysis Plugin☆171Feb 5, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Elasticsearch's Analyzer for Kuromoji with Neologd☆115Nov 22, 2023Updated 2 years ago
- I converted WanaKana (https://github.com/wanikani/wanakana) to Java for use in Android projects.☆39Sep 14, 2015Updated 10 years ago
- A lexicon for Sudachi☆296Apr 30, 2026Updated last month
- Linguistic tools for texts in Japanese language☆394Jan 18, 2026Updated 4 months ago
- A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.☆521Oct 24, 2025Updated 7 months ago
- Neologism dictionary based on the language resources on the Web for mecab-unidic☆87Sep 14, 2020Updated 5 years ago
- Javascript library for detecting and transforming between Hiragana, Katakana, and Romaji☆923Sep 10, 2025Updated 8 months ago
- A Japanese NLP Library using spaCy as framework based on Universal Dependencies☆851Mar 30, 2024Updated 2 years ago
- Lightweight converter from Japanese Kana-kanji sentences into Kana-Roman.☆453Apr 26, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- JMdict, JMnedict, Kanjidic, KRADFILE/RADKFILE in JSON format☆364May 18, 2026Updated last week
- A library that helps conjugate Japanese verbs. This repo contains two Qt projects: libjpconj witch is the library, and jpconj implements …☆12Aug 29, 2017Updated 8 years ago
- mecab-python. you can find original version here//taku910.github.io/mecab/☆582Nov 25, 2025Updated 6 months ago
- Swift port (well, wrapper) of WanaKana.js☆13Jan 4, 2021Updated 5 years ago
- 🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.☆261May 19, 2026Updated last week
- Trying to consolidate japanese phonetic, and in particular pitch accent resources into one list☆126Feb 10, 2024Updated 2 years ago
- Unidic packaged for installation via pip.☆109Feb 26, 2025Updated last year
- The ultimate kanji resource☆331Jun 16, 2024Updated last year
- Python version of Sudachi, a Japanese tokenizer.☆436Oct 7, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A Japanese tokenizer and morphological analysis engine written in Kotlin☆59Aug 30, 2020Updated 5 years ago
- tokenizer specified for Japanese☆51Apr 20, 2021Updated 5 years ago
- A morphological analyzer with a small memory footprint.☆19Dec 2, 2011Updated 14 years ago
- Japanese analyzer uses kuromoji japanese tokenizer for ElasticSearch☆29Feb 11, 2020Updated 6 years ago
- A self-contained morphological analyzer (including dictionary data).☆33Jul 30, 2015Updated 10 years ago
- A Japanese tokenizer based on recurrent neural networks☆417May 6, 2026Updated 3 weeks ago
- A linguistic framework that's easy to use.☆231Apr 3, 2026Updated last month