ku-nlp/kwja

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ku-nlp/kwja)

ku-nlp / kwja

An integrated Japanese analyzer based on foundation models

☆145

Alternatives and similar repositories for kwja

Users that are interested in kwja are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ku-nlp / bertknp
View on GitHub
A Japanese dependency parser based on BERT
☆23Oct 26, 2022Updated 3 years ago
nobu-g / cohesion-analysis
View on GitHub
Code for COLING 2020 Paper
☆13Feb 3, 2026Updated 5 months ago
ku-nlp / rhoknp
View on GitHub
Yet another Python binding for Juman++/KNP/KWJA
☆40Jul 15, 2026Updated last week
ku-nlp / KWDLC
View on GitHub
Kyoto University Web Document Leads Corpus
☆84Dec 18, 2023Updated 2 years ago
nobu-g / JGLUE-evaluation-scripts
View on GitHub
Training and evaluation scripts for JGLUE, a Japanese language understanding benchmark
☆18Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
himkt / awesome-bert-japanese
View on GitHub
📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information
☆132Mar 15, 2023Updated 3 years ago
WorksApplications / chikkarpy
View on GitHub
Japanese synonym library
☆55Feb 7, 2022Updated 4 years ago
chakki-works / Japanese-Company-Lexicon
View on GitHub
☆99Jul 23, 2023Updated 3 years ago
kajyuuen / daaja
View on GitHub
This repository has implementations of data augmentation for NLP for Japanese.
☆64Feb 16, 2023Updated 3 years ago
aiishii / JEMHopQA
View on GitHub
☆30Apr 10, 2025Updated last year
osekilab / JCoLA
View on GitHub
☆19Apr 21, 2026Updated 3 months ago
megagonlabs / jrte-corpus
View on GitHub
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
☆77Jun 23, 2023Updated 3 years ago
yahoojapan / JGLUE
View on GitHub
JGLUE: Japanese General Language Understanding Evaluation
☆346Mar 31, 2025Updated last year
teaspn / teaspn-server
View on GitHub
A sample implementation of the TEASPN server
☆18Oct 31, 2019Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
nlp-waseda / comet-atomic-ja
View on GitHub
COMET-ATOMIC ja
☆31Mar 8, 2024Updated 2 years ago
ku-nlp / KyotoCorpus
View on GitHub
Kyoto University Text Corpus
☆71Jul 14, 2023Updated 3 years ago
daac-tools / find-simdoc
View on GitHub
Finding all pairs of similar documents time- and memory-efficiently
☆62Mar 13, 2025Updated last year
ku-nlp / ja-vicuna-qa-benchmark
View on GitHub
☆33Jul 31, 2024Updated last year
HojiChar / HojiChar
View on GitHub
The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.
☆128Jul 17, 2026Updated last week
nttcslab / japanese-dialog-transformers
View on GitHub
Code for evaluating Japanese pretrained models provided by NTT Ltd.
☆246Jun 21, 2023Updated 3 years ago
nandenjin / itfdic
View on GitHub
A localized word dictionary asset for University of Tsukuba
☆12Sep 19, 2025Updated 10 months ago
ku-nlp / knp
View on GitHub
A Japanese Parser
☆34Nov 1, 2023Updated 2 years ago
yagays / nayose-wikipedia-ja
View on GitHub
Wikipediaから作成した日本語名寄せデータセット
☆35Mar 10, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
himkt / konoha
View on GitHub
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
☆263Jul 19, 2026Updated last week
wwwcojp / ja_sentence_segmenter
View on GitHub
japanese sentence segmentation library for python
☆75Updated this week
megagonlabs / bunkai
View on GitHub
Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
☆200Mar 26, 2024Updated 2 years ago
taishi-i / toiro
View on GitHub
A tool for comparing tokenizers
☆122Nov 9, 2025Updated 8 months ago
verypluming / JSICK
View on GitHub
Repository for JSICK
☆46May 31, 2023Updated 3 years ago
stockmarkteam / ner-wikipedia-dataset
View on GitHub
Wikipediaを用いた日本語の固有表現抽出データセット
☆143Sep 2, 2023Updated 2 years ago
sbintuitions / JMTEB
View on GitHub
The evaluation scripts of JMTEB (Japanese Massive Text Embedding Benchmark)
☆93Mar 16, 2026Updated 4 months ago
conditional / jawikify
View on GitHub
日本語テキストに対する wikification のためのソフトウェア
☆17Mar 14, 2017Updated 9 years ago
izuna385 / Wikia-and-Wikipedia-EL-Dataset-Creator
View on GitHub
You can create datasets from Wikia/Wikipedia that can be used for entity recognition and Entity Linking. Dumps for ja-wiki and VTuber-wik…
☆18May 2, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
chakki-works / entitypedia
View on GitHub
Entitypedia is an Extended Named Entity Dictionary from Wikipedia.
☆13Dec 7, 2022Updated 3 years ago
ku-nlp / AnnotatedFKCCorpus
View on GitHub
Annotated Fuman Kaitori Center Corpus
☆18Dec 18, 2023Updated 2 years ago
taishi-i / awesome-japanese-nlp-resources
View on GitHub
A curated list of resources for Japanese natural language processing (NLP): Python libraries, LLMs, dictionaries, corpora, and datasets. …
☆999Updated this week
ku-nlp / text-cleaning
View on GitHub
A powerful text cleaner for Japanese web texts
☆12Jan 20, 2024Updated 2 years ago
daac-tools / vaporetto
View on GitHub
🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer
☆297Updated this week
megagonlabs / UD_Japanese-GSD
View on GitHub
Japanese data from the Google UDT 2.0.
☆28Mar 24, 2023Updated 3 years ago
ku-nlp / kyoto-reader
View on GitHub
A processor for KyotoCorpus, KWDLC, and AnnotatedFKCCorpus
☆10Jun 26, 2024Updated 2 years ago