KoichiYasuoka/SuPar-UniDic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/KoichiYasuoka/SuPar-UniDic)

KoichiYasuoka / SuPar-UniDic

Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese with BERT models

☆21

Alternatives and similar repositories for SuPar-UniDic

Users that are interested in SuPar-UniDic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

akirakubo / bert-japanese-aozora
View on GitHub
Japanese BERT trained on Aozora Bunko and Wikipedia, pre-tokenized by MeCab with UniDic & SudachiPy
☆40Aug 8, 2020Updated 5 years ago
lighttransport / jagger-python
View on GitHub
Python binding for Jagger(C++ implementation of Pattern-based Japanese Morphological Analyzer)
☆13Dec 16, 2025Updated 6 months ago
KoichiYasuoka / UniDic2UD
View on GitHub
Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese
☆38Dec 29, 2025Updated 6 months ago
SmashinFries / PyKatsuyou
View on GitHub
Japanese verb/adjective inflections tool
☆13Mar 10, 2025Updated last year
nobu-g / cohesion-analysis
View on GitHub
Code for COLING 2020 Paper
☆13Feb 3, 2026Updated 5 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ujiuji1259 / shinra-attribute-extraction
View on GitHub
☆11Sep 7, 2021Updated 4 years ago
jcsirot / anki-simple-furigana
View on GitHub
Anki add-on providing support for adding or removing furigana on Japanese text
☆11Jan 7, 2022Updated 4 years ago
utanaka2000 / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆25Mar 16, 2021Updated 5 years ago
singletongue / wikipedia-utils
View on GitHub
Utility scripts for preprocessing Wikipedia texts for NLP
☆78Apr 9, 2024Updated 2 years ago
ishiko732 / WordSearch
View on GitHub
对词典进行解析单词的含义,提供Anki的Fast Words Query插件词库
☆12Apr 1, 2021Updated 5 years ago
yahoojapan / VFD-Dataset
View on GitHub
☆11Nov 10, 2020Updated 5 years ago
ou-medinfo / medbertjp
View on GitHub
Trials of pre-trained BERT models for the medical domain in Japanese.
☆13Nov 21, 2020Updated 5 years ago
ikegami-yukino / mozcpy
View on GitHub
Kana-Kanji converter using Mozc dictionary
☆49Feb 14, 2025Updated last year
wtetsu / deinja
View on GitHub
🌸De-inflect Japanese words
☆16Nov 24, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HojiChar / HojiChar
View on GitHub
The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.
☆127Apr 10, 2026Updated 3 months ago
kaniblu / hanja-tagger
View on GitHub
Automatic Korean Hanja tagging tool powered by Hanjaro (hanjaro.juntong.or.kr)
☆19Feb 22, 2019Updated 7 years ago
himkt / awesome-bert-japanese
View on GitHub
📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information
☆132Mar 15, 2023Updated 3 years ago
jekovcar / mdict-utils-Gui
View on GitHub
Windows graphic user interface for mdict-utils
☆15Apr 6, 2025Updated last year
DeepApps91 / Kindai-OCR
View on GitHub
OCR system for recognizing modern Japanese magazines
☆153Jul 12, 2023Updated 3 years ago
yagays / nayose-wikipedia-ja
View on GitHub
Wikipediaから作成した日本語名寄せデータセット
☆35Mar 10, 2020Updated 6 years ago
takapy0210 / geek_blog
View on GitHub
技術ブログで紹介したコードやutilityスクリプト置き場 https://www.takapy.work/
☆14Dec 4, 2025Updated 7 months ago
daac-tools / python-vaporetto
View on GitHub
🛥 Vaporetto is a fast and lightweight pointwise prediction based tokenizer. (Python wrapper)
☆21May 30, 2026Updated last month
taishi-i / toiro
View on GitHub
A tool for comparing tokenizers
☆122Nov 9, 2025Updated 8 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
megagonlabs / bunkai
View on GitHub
Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
☆199Mar 26, 2024Updated 2 years ago
sarulab-speech / Coco-Nut
View on GitHub
Coco-Nut (Corpus of connecting NIHONGO utterance and text) corpus
☆20Jun 12, 2024Updated 2 years ago
colorfulscoop / sbert-ja
View on GitHub
Code to train Sentence BERT Japanese model for Hugging Face Model Hub
☆11Aug 8, 2021Updated 4 years ago
asakura-data-science / finance
View on GitHub
☆21Feb 28, 2022Updated 4 years ago
polm / fugashi
View on GitHub
A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.
☆531Oct 24, 2025Updated 8 months ago
ids-cv / wrime
View on GitHub
☆177Sep 11, 2025Updated 10 months ago
WorksApplications / chikkarpy
View on GitHub
Japanese synonym library
☆55Feb 7, 2022Updated 4 years ago
megagonlabs / instruction_ja
View on GitHub
Japanese instruction data (日本語指示データ)
☆24Jul 13, 2023Updated 2 years ago
yomidevs / yomitan-api
View on GitHub
Native messaging component for https://github.com/yomidevs/yomitan
☆61Jun 28, 2026Updated 2 weeks ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
k-takano0423 / BiClass-Definition-Generator
View on GitHub
☆11Oct 20, 2024Updated last year
facebookresearch / romqa
View on GitHub
A Benchmark for Robust, Multi-evidence, Multi-answer Question Answering
☆18Jan 7, 2023Updated 3 years ago
retarfi / language-pretraining
View on GitHub
Pre-training Language Models for Japanese
☆50Jul 2, 2023Updated 3 years ago
intellygenta / InteractiveParallelCoordinates
View on GitHub
Python code for interactive parallel coordinates visualization on jupyter notebook.
☆12Sep 8, 2019Updated 6 years ago
ikawaha / kagome-dict
View on GitHub
Dictionary Library for Kagome v2
☆15Jun 9, 2026Updated last month
Ino-Ichan / AIMedical2021-2nd
View on GitHub
☆11Mar 27, 2021Updated 5 years ago
tkuri / albumentations_test
View on GitHub
albumentations test
☆11Jun 23, 2020Updated 6 years ago