This repository is archived! The maintained MeCab can be found https://github.com/shogo82148/mecab
☆272Oct 15, 2024Updated last year
Alternatives and similar repositories for mecab
Users that are interested in mecab are comparing it to the libraries listed below
Sorting:
- natto-py combines the Python programming language with MeCab, the part-of-speech and morphological analyzer for the Japanese language.☆95Jun 6, 2024Updated last year
- mecab-python. you can find original version here//taku910.github.io/mecab/☆579Nov 25, 2025Updated 3 months ago
- Yet another sentence-level tokenizer for the Japanese text☆24Nov 27, 2025Updated 3 months ago
- Annotated Fuman Kaitori Center Corpus☆18Dec 18, 2023Updated 2 years ago
- Zunda: Japanese Enhanced Modality Analyzer client for Python.☆10Nov 30, 2019Updated 6 years ago
- Japanese text normalizer for mecab-neologd☆287Dec 2, 2025Updated 3 months ago
- A lexicon for Sudachi☆283Jan 20, 2026Updated 2 months ago
- ☆10Aug 13, 2012Updated 13 years ago
- Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"☆21Jul 10, 2023Updated 2 years ago
- A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.☆515Oct 24, 2025Updated 4 months ago
- Kana-Kanji converter using Mozc dictionary☆47Feb 14, 2025Updated last year
- Japanese morphological analysis engine written in pure Python☆907Oct 13, 2025Updated 5 months ago
- Python version of Sudachi, a Japanese tokenizer.☆429Oct 7, 2022Updated 3 years ago
- lists of text corpus and more (mainly Japanese)☆118Jul 25, 2024Updated last year
- 🧨 Japanese Sentence Breaker 🧨☆14Jun 6, 2021Updated 4 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆19Feb 28, 2026Updated 3 weeks ago
- A Japanese Tokenizer for Business☆948Jun 17, 2025Updated 9 months ago
- Japanese word embedding with Sudachi and NWJC 🌿☆171Mar 1, 2024Updated 2 years ago
- Mecab + NEologd + Docker + Python3☆36May 10, 2022Updated 3 years ago
- Neologism dictionary based on the language resources on the Web for mecab-ipadic☆2,785Dec 27, 2023Updated 2 years ago
- A Japanese NLP Library using spaCy as framework based on Universal Dependencies☆837Mar 30, 2024Updated last year
- Async incremental migemo search.☆13May 4, 2015Updated 10 years ago
- ☆28Apr 5, 2022Updated 3 years ago
- Kyoto University Text Corpus☆70Jul 14, 2023Updated 2 years ago
- Japanese BERT trained on Aozora Bunko and Wikipedia, pre-tokenized by MeCab with UniDic & SudachiPy☆40Aug 8, 2020Updated 5 years ago
- Use custom tokenizers in spacy-transformers☆16Aug 9, 2022Updated 3 years ago
- Testing of Neural Topic Modeling for Japanese articles☆13Jul 24, 2019Updated 6 years ago
- Emotion analyzer for Japanese text☆116Jul 25, 2024Updated last year
- ☆161Oct 19, 2020Updated 5 years ago
- Japanese data from the Google UDT 2.0.☆28Mar 24, 2023Updated 2 years ago
- A tool for visualizing the internal structures of morphological analyzer Sudachi☆18Jun 9, 2022Updated 3 years ago
- A package that can be locally executed to generate minutes in Japanese☆10Sep 11, 2023Updated 2 years ago
- Flatten nested iterable object for Python (Pure-Python implementation)☆28Aug 15, 2025Updated 7 months ago
- A localized word dictionary asset for University of Tsukuba☆12Sep 19, 2025Updated 6 months ago
- A single-document summarizer in JavaScript.☆20Mar 21, 2017Updated 8 years ago
- 形態素解析したtokenからネガティブ/ポジティブを判定したスコアを返すJavaScriptライブラリ☆27Jan 7, 2017Updated 9 years ago
- Juman++ (a Morphological Analyzer Toolkit)☆409Oct 3, 2023Updated 2 years ago
- Pure-Python Japanese character interconverter for Hiragana, Katakana, Hankaku, and Zenkaku☆342Feb 8, 2026Updated last month
- Python port of Igo Japanese morphological analyzer☆18Sep 22, 2018Updated 7 years ago