WorksApplications/chikkarpy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/WorksApplications/chikkarpy)

WorksApplications / chikkarpy

Japanese synonym library

☆55

Alternatives and similar repositories for chikkarpy

Users that are interested in chikkarpy are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WorksApplications / ViSudachi
View on GitHub
A tool for visualizing the internal structures of morphological analyzer Sudachi
☆18Jun 9, 2022Updated 4 years ago
megagonlabs / ginza-transformers
View on GitHub
Use custom tokenizers in spacy-transformers
☆16Aug 9, 2022Updated 3 years ago
megagonlabs / bunkai
View on GitHub
Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
☆200Mar 26, 2024Updated 2 years ago
WorksApplications / SudachiTra
View on GitHub
Japanese tokenizer for Transformers
☆80Dec 15, 2023Updated 2 years ago
daac-tools / vaporetto
View on GitHub
🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer
☆294Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ku-nlp / kwja
View on GitHub
An integrated Japanese analyzer based on foundation models
☆145Updated this week
yagays / ja-timex
View on GitHub
自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器
☆141Feb 27, 2025Updated last year
eggplants / awesome-japanese-censored-words
View on GitHub
Awesome List of Sources of Japanese Censored Words
☆19Sep 11, 2022Updated 3 years ago
stockmarkteam / ner-wikipedia-dataset
View on GitHub
Wikipediaを用いた日本語の固有表現抽出データセット
☆143Sep 2, 2023Updated 2 years ago
chakki-works / Japanese-Company-Lexicon
View on GitHub
☆99Jul 23, 2023Updated 2 years ago
jojonki / Taiyaki
View on GitHub
PythonとCythonで出来てる日本語形態素解析エンジン🚧
☆13Dec 4, 2019Updated 6 years ago
kzinmr / transformers_ner_ja
View on GitHub
Japanese NER with Transformers + PyTorch-Lightning + MLflow Tracking
☆15Nov 20, 2022Updated 3 years ago
taishi-i / toiro
View on GitHub
A tool for comparing tokenizers
☆122Nov 9, 2025Updated 8 months ago
sonoisa / t5-japanese
View on GitHub
日本語T5モデル
☆118Sep 15, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
takuyaa / yada
View on GitHub
Yada is a yet another double-array trie library aiming for fast search and compact data representation.
☆48Jun 7, 2026Updated last month
WorksApplications / SudachiDict
View on GitHub
A lexicon for Sudachi
☆301Apr 30, 2026Updated 2 months ago
kajyuuen / funer
View on GitHub
Funer is Rule based Named Entity Recognition tool.
☆22Apr 21, 2022Updated 4 years ago
kampersanda / sif-embedding
View on GitHub
Rust implementation of SIF and uSIF: Simple and fast sentence embedding
☆19Jan 22, 2025Updated last year
daac-tools / find-simdoc
View on GitHub
Finding all pairs of similar documents time- and memory-efficiently
☆62Mar 13, 2025Updated last year
uribo / bucky
View on GitHub
Helpers for literature management as GitHub actions
☆13May 7, 2021Updated 5 years ago
chemicaltree / tetra
View on GitHub
☆10Sep 14, 2022Updated 3 years ago
laboroai / Laboro-ParaCorpus
View on GitHub
Scripts for creating a Japanese-English parallel corpus and training NMT models
☆19Nov 9, 2021Updated 4 years ago
polm / ipadic-py
View on GitHub
IPAdic packaged for easy use from Python.
☆24Oct 31, 2021Updated 4 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
colorfulscoop / sbert-ja
View on GitHub
Code to train Sentence BERT Japanese model for Hugging Face Model Hub
☆11Aug 8, 2021Updated 4 years ago
oreilly-japan / building-search-app-w-ml
View on GitHub
『機械学習による検索ランキング改善ガイド』のサンプルコードのリポジトリ
☆23Aug 3, 2023Updated 2 years ago
Katsumata420 / wikihow_japanese
View on GitHub
☆35Dec 17, 2020Updated 5 years ago
WorksApplications / chiVe
View on GitHub
Japanese word embedding with Sudachi and NWJC 🌿
☆177Mar 1, 2024Updated 2 years ago
osuossu8 / CommonLitReadabilityPrize
View on GitHub
☆14Aug 3, 2021Updated 4 years ago
verypluming / JSICK
View on GitHub
Repository for JSICK
☆46May 31, 2023Updated 3 years ago
octanove / shiba
View on GitHub
Pytorch implementation and pre-trained Japanese model for CANINE, the efficient character-level transformer.
☆89Nov 3, 2023Updated 2 years ago
wareya / notmecab-rs
View on GitHub
notmecab-rs is a very basic mecab clone, designed only to do parsing, not training.
☆18Jul 25, 2020Updated 5 years ago
p-geon / DropoutCheatSheet
View on GitHub
☆33Apr 27, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
glassmonkey / seimei
View on GitHub
☆17Jul 17, 2023Updated 3 years ago
yoichi1484 / subspace
View on GitHub
An implementation of "Subspace Representations for Soft Set Operations and Sentence Similarities" (NAACL 2024)
☆10May 31, 2024Updated 2 years ago
altescy / colt
View on GitHub
🐎 Colt: Effortlessly configure and construct Python objects with colt, a lightweight library inspired by AllenNLP and Tango
☆26Jul 13, 2026Updated last week
tatHi / optok
View on GitHub
☆10Aug 26, 2021Updated 4 years ago
yagays / nayose-wikipedia-ja
View on GitHub
Wikipediaから作成した日本語名寄せデータセット
☆35Mar 10, 2020Updated 6 years ago
megagonlabs / ginza
View on GitHub
A Japanese NLP Library using spaCy as framework based on Universal Dependencies
☆862Jul 10, 2026Updated last week
singletongue / wikipedia-utils
View on GitHub
Utility scripts for preprocessing Wikipedia texts for NLP
☆78Apr 9, 2024Updated 2 years ago