Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese with BERT models
☆20Feb 28, 2026Updated last week
Alternatives and similar repositories for SuPar-UniDic
Users that are interested in SuPar-UniDic are comparing it to the libraries listed below
Sorting:
- Japanese verb/adjective inflections tool☆12Mar 10, 2025Updated 11 months ago
- Japanese BERT trained on Aozora Bunko and Wikipedia, pre-tokenized by MeCab with UniDic & SudachiPy☆40Aug 8, 2020Updated 5 years ago
- Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese☆38Dec 29, 2025Updated 2 months ago
- Anki add-on providing support for adding or removing furigana on Japanese text☆11Jan 7, 2022Updated 4 years ago
- 🌸De-inflect Japanese words☆15Nov 24, 2025Updated 3 months ago
- Kana-Kanji converter using Mozc dictionary☆46Feb 14, 2025Updated last year
- ☆11Sep 7, 2021Updated 4 years ago
- Trials of pre-trained BERT models for the medical domain in Japanese.☆12Nov 21, 2020Updated 5 years ago
- Facebook AI Research Sequence-to-Sequence Toolkit written in Python.☆25Mar 16, 2021Updated 4 years ago
- Windows graphic user interface for mdict-utils☆13Apr 6, 2025Updated 11 months ago
- Automatic Korean Hanja tagging tool powered by Hanjaro (hanjaro.juntong.or.kr)☆18Feb 22, 2019Updated 7 years ago
- A tool for comparing tokenizers☆121Nov 9, 2025Updated 4 months ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Apr 9, 2024Updated last year
- ☆21Feb 28, 2022Updated 4 years ago
- ☆24Aug 14, 2025Updated 6 months ago
- Native messaging component for https://github.com/yomidevs/yomitan☆42Mar 1, 2026Updated last week
- An easy to use tokenizer for Japanese text, aimed at language learners and non-linguists☆25Nov 21, 2021Updated 4 years ago
- 🛥 Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.☆21Jun 1, 2025Updated 9 months ago
- Monorepo for Kanji, Furigana, Japanese DB, and others☆62Mar 5, 2023Updated 3 years ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆125Nov 13, 2025Updated 3 months ago
- A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.☆511Oct 24, 2025Updated 4 months ago
- 译者编程进阶指南☆14Jan 21, 2024Updated 2 years ago
- 日本語文法誤り訂正ツール☆29Jun 22, 2022Updated 3 years ago
- 桌面划词翻译/查词工具,支持Windows和Linux,支持多种词典,支持将单词添加到Anki。☆33Apr 13, 2023Updated 2 years ago
- Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)☆199Mar 26, 2024Updated last year
- Computation Graph framework implemented using only NumPy☆10Mar 31, 2024Updated last year
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- Japanese data from the Google UDT 2.0.☆38Nov 12, 2025Updated 3 months ago
- Wikipediaから作成した日本語名寄せデータセット☆35Mar 10, 2020Updated 5 years ago
- ☆86Nov 4, 2023Updated 2 years ago
- 文庫本スタイルのゲラをテキストファイルから作る、github actionsのワークフローです。☆11Sep 29, 2021Updated 4 years ago
- Repository for the course "JavaScript Object Oriented Programming"☆11Jun 30, 2019Updated 6 years ago
- Karaoke lyrics plugins for Flutter☆10Apr 30, 2022Updated 3 years ago
- A dependency visualizer for Japanese to help beginners deconstruct complex sentences. Also my first Vue 3 project c:☆11Jun 20, 2021Updated 4 years ago
- Multi-view learning approaches for stock return prediction with tweets.☆11Jun 17, 2020Updated 5 years ago
- 使用Puppeteer快速导出QQ阅读的内容到TXT内 Quickly export the content of book.qq.com (QQ Reading) to TXT by using puppeteer 微信读书太难导出了 不妨试试平替QQ阅读🤓☆17Sep 4, 2025Updated 6 months ago
- ☆10Sep 15, 2024Updated last year
- ボケて電笑戦 (bokete DENSHOSEN) Workshop☆43May 16, 2022Updated 3 years ago
- ☆161Oct 19, 2020Updated 5 years ago