tmu-nlp/simple-jppdb

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tmu-nlp/simple-jppdb)

tmu-nlp / simple-jppdb

A paraphrase database for Japanese text simplification

☆32

Alternatives and similar repositories for simple-jppdb

Users that are interested in simple-jppdb are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

skozawa / Comainu
View on GitHub
COrpus based Morphological Analyzer with INtegrated User dictionary
☆21Mar 30, 2025Updated last year
ikegami-yukino / pymlask
View on GitHub
Emotion analyzer for Japanese text
☆118Jul 25, 2024Updated 2 years ago
tmu-nlp / JapaneseWordSimilarityDataset
View on GitHub
Japanese Word Similarity Dataset
☆103Dec 7, 2021Updated 4 years ago
ikegami-yukino / zunda-python
View on GitHub
Zunda: Japanese Enhanced Modality Analyzer client for Python.
☆10Nov 30, 2019Updated 6 years ago
nandenjin / itfdic
View on GitHub
A localized word dictionary asset for University of Tsukuba
☆12Sep 19, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
wwwcojp / ja_sentence_segmenter
View on GitHub
japanese sentence segmentation library for python
☆75Jul 18, 2026Updated last week
Takeuchi-Lab-LM / python_asa
View on GitHub
python版日本語意味役割付与システム（ASA）
☆22Nov 11, 2022Updated 3 years ago
hiroki13 / neural-pasa-system
View on GitHub
☆13Apr 23, 2017Updated 9 years ago
ikegami-yukino / asa-python
View on GitHub
Japanese Argument Structure Analyzer (ASA) client for Python
☆11Feb 16, 2019Updated 7 years ago
hiroshi-manabe / CRFSegmenter
View on GitHub
A multi-language segmenter using high-order CRF.
☆17Feb 27, 2020Updated 6 years ago
TEI-EAJ / jp_guidelines
View on GitHub
TEIガイドラインへの準拠の仕方を日本語で解説します。
☆12Feb 15, 2021Updated 5 years ago
yagays / swem
View on GitHub
Python implementation of SWEM (Simple Word-Embedding-based Methods)
☆30Jun 21, 2022Updated 4 years ago
musyoku / hpylm
View on GitHub
HPYLMのC++実装
☆11May 2, 2017Updated 9 years ago
yagays / alacarte_embedding
View on GitHub
Python implementation of A La Carte Embedding
☆10Dec 7, 2018Updated 7 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
HojiChar / HojiChar
View on GitHub
The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.
☆128Jul 17, 2026Updated last week
youichiro / transformer-copy
View on GitHub
日本語文法誤り訂正ツール
☆29Jun 22, 2022Updated 4 years ago
knok / make-meidai-dialogue
View on GitHub
Get Japanese dialogue corpus
☆40Sep 28, 2017Updated 8 years ago
ikegami-yukino / rakutenma-python
View on GitHub
Rakuten MA (Python version)
☆23May 22, 2017Updated 9 years ago
upura / papers
View on GitHub
What I read
☆23Jun 15, 2018Updated 8 years ago
yagays / nayose-wikipedia-ja
View on GitHub
Wikipediaから作成した日本語名寄せデータセット
☆35Mar 10, 2020Updated 6 years ago
ku-nlp / AnnotatedFKCCorpus
View on GitHub
Annotated Fuman Kaitori Center Corpus
☆18Dec 18, 2023Updated 2 years ago
ku-nlp / text-cleaning
View on GitHub
A powerful text cleaner for Japanese web texts
☆12Jan 20, 2024Updated 2 years ago
megagonlabs / ebe-dataset
View on GitHub
Evidence-based Explanation Dataset (AACL-IJCNLP 2020)
☆18Dec 17, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
inspection-ai / japanese-toxic-dataset
View on GitHub
☆22Jan 11, 2023Updated 3 years ago
ku-nlp / KWDLC
View on GitHub
Kyoto University Web Document Leads Corpus
☆84Dec 18, 2023Updated 2 years ago
ku-nlp / kwja
View on GitHub
An integrated Japanese analyzer based on foundation models
☆145Jul 18, 2026Updated last week
gotutiyan / gec-metrics
View on GitHub
A library for evaluation of Grammatical Error Correction (GEC). Accepted to ACL'25 Demo: "gec-metrics: A Unified Library for Grammatical …
☆14Jan 25, 2026Updated 6 months ago
tmu-nlp / sscorpus
View on GitHub
A monolingual parallel corpus for sentence simplification
☆11Jul 4, 2016Updated 10 years ago
megagonlabs / UD_Japanese-GSD
View on GitHub
Japanese data from the Google UDT 2.0.
☆28Mar 24, 2023Updated 3 years ago
katryo / wordnet_python
View on GitHub
日本語版wordnetをPythonで扱うためのラッパー
☆26Jan 20, 2014Updated 12 years ago
ikegami-yukino / sengiri
View on GitHub
Yet another sentence-level tokenizer for the Japanese text
☆24Nov 27, 2025Updated 7 months ago
megagonlabs / bunkai
View on GitHub
Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
☆200Mar 26, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
tuem / resembla
View on GitHub
☆74Aug 3, 2025Updated 11 months ago
1never / open2ch-dialogue-corpus
View on GitHub
おーぷん2ちゃんねるをクロールして作成した対話コーパス
☆101Jun 6, 2021Updated 5 years ago
megagonlabs / asdc
View on GitHub
Accommodation Search Dialog Corpus (宿泊施設探索対話コーパス)
☆25Jan 19, 2024Updated 2 years ago
ku-nlp / jumanpp
View on GitHub
Juman++ (a Morphological Analyzer Toolkit)
☆414Apr 17, 2026Updated 3 months ago
icoxfog417 / yans-2019-annotation-hackathon
View on GitHub
Yans2019 Annotation hackathon
☆14May 22, 2023Updated 3 years ago
ikegami-yukino / oseti
View on GitHub
Dictionary based Sentiment Analysis for Japanese
☆99Aug 2, 2025Updated 11 months ago
buruzaemon / natto-py
View on GitHub
natto-py combines the Python programming language with MeCab, the part-of-speech and morphological analyzer for the Japanese language.
☆95Jun 6, 2024Updated 2 years ago