PKSHATechnology-Research/camphr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PKSHATechnology-Research/camphr)

PKSHATechnology-Research / camphr

Camphr - NLP libary for creating pipeline components

☆336

Alternatives and similar repositories for camphr

Users that are interested in camphr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

yagays / nayose-wikipedia-ja
View on GitHub
Wikipediaから作成した日本語名寄せデータセット
☆35Mar 10, 2020Updated 6 years ago
himkt / awesome-bert-japanese
View on GitHub
📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information
☆132Mar 15, 2023Updated 3 years ago
taishi-i / toiro
View on GitHub
A tool for comparing tokenizers
☆122Nov 9, 2025Updated 8 months ago
chakki-works / Japanese-Company-Lexicon
View on GitHub
☆99Jul 23, 2023Updated 3 years ago
himkt / konoha
View on GitHub
🌿 An easy-to-use Japanese Text Processing tool, which makes it possible to switch tokenizers with small changes of code.
☆263Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ikegami-yukino / neologdn
View on GitHub
Japanese text normalizer for mecab-neologd
☆289May 6, 2026Updated 2 months ago
chakki-works / chariot
View on GitHub
Deliver the ready-to-train data to your NLP model.
☆123Jul 15, 2022Updated 4 years ago
megagonlabs / ginza
View on GitHub
A Japanese NLP Library using spaCy as framework based on Universal Dependencies
☆862Jul 10, 2026Updated 2 weeks ago
ku-nlp / knp
View on GitHub
A Japanese Parser
☆34Nov 1, 2023Updated 2 years ago
ikegami-yukino / sengiri
View on GitHub
Yet another sentence-level tokenizer for the Japanese text
☆24Nov 27, 2025Updated 7 months ago
UniversalDependencies / UD_Japanese-GSD
View on GitHub
Japanese data from the Google UDT 2.0.
☆40May 6, 2026Updated 2 months ago
yagays / ja-timex
View on GitHub
自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器
☆141Feb 27, 2025Updated last year
ku-nlp / KWDLC
View on GitHub
Kyoto University Web Document Leads Corpus
☆84Dec 18, 2023Updated 2 years ago
p-geon / DropoutCheatSheet
View on GitHub
☆33Apr 27, 2020Updated 6 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
conditional / jawikify
View on GitHub
日本語テキストに対する wikification のためのソフトウェア
☆17Mar 14, 2017Updated 9 years ago
PKSHATechnology-Research / tdmelodic
View on GitHub
A Japanese accent dictionary generator
☆126Mar 21, 2024Updated 2 years ago
taishi-i / nagisa
View on GitHub
A Japanese tokenizer based on recurrent neural networks
☆418Jul 6, 2026Updated 2 weeks ago
megagonlabs / bunkai
View on GitHub
Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)
☆200Mar 26, 2024Updated 2 years ago
pfnet / pysen
View on GitHub
Python linting made easy. Also a casual yet honorific way to address individuals who have entered an organization prior to you.
☆492Updated this week
daac-tools / vaporetto
View on GitHub
🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer
☆296Updated this week
aistairc / trf
View on GitHub
This is the repository for TRF (text readability features) publication.
☆37Aug 27, 2019Updated 6 years ago
ymym3412 / acl-papers
View on GitHub
paper summary of Association for Computational Linguistics
☆185Sep 16, 2019Updated 6 years ago
chakki-works / chABSA-dataset
View on GitHub
chakki's Aspect-Based Sentiment Analysis dataset
☆142Feb 25, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
megagonlabs / jrte-corpus
View on GitHub
Japanese Realistic Textual Entailment Corpus (NLP 2020, LREC 2020)
☆77Jun 23, 2023Updated 3 years ago
ku-nlp / bertknp
View on GitHub
A Japanese dependency parser based on BERT
☆23Oct 26, 2022Updated 3 years ago
BandaiNamcoResearchInc / DistilBERT-base-jp
View on GitHub
☆161Oct 19, 2020Updated 5 years ago
buruzaemon / natto-py
View on GitHub
natto-py combines the Python programming language with MeCab, the part-of-speech and morphological analyzer for the Japanese language.
☆95Jun 6, 2024Updated 2 years ago
chemicaltree / tetra
View on GitHub
☆10Sep 14, 2022Updated 3 years ago
WorksApplications / ViSudachi
View on GitHub
A tool for visualizing the internal structures of morphological analyzer Sudachi
☆18Jun 9, 2022Updated 4 years ago
cl-tohoku / bert-japanese
View on GitHub
BERT models for Japanese text.
☆550Mar 23, 2024Updated 2 years ago
ikegami-yukino / zunda-python
View on GitHub
Zunda: Japanese Enhanced Modality Analyzer client for Python.
☆10Nov 30, 2019Updated 6 years ago
ujiuji1259 / shinra-attribute-extraction
View on GitHub
☆11Sep 7, 2021Updated 4 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
altescy / colt
View on GitHub
🐎 Colt: Effortlessly configure and construct Python objects with colt, a lightweight library inspired by AllenNLP and Tango
☆26Jul 13, 2026Updated last week
yahoojapan / JGLUE
View on GitHub
JGLUE: Japanese General Language Understanding Evaluation
☆346Mar 31, 2025Updated last year
tatHi / optok
View on GitHub
☆10Aug 26, 2021Updated 4 years ago
explosion / spacy-transformers
View on GitHub
🛸 Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
☆1,408Mar 27, 2026Updated 3 months ago
wwwcojp / ja_sentence_segmenter
View on GitHub
japanese sentence segmentation library for python
☆75Updated this week
explosion / spacy-stanza
View on GitHub
💥 Use the latest Stanza (StanfordNLP) research models directly in spaCy
☆747Aug 15, 2024Updated last year
WorksApplications / SudachiPy
View on GitHub
Python version of Sudachi, a Japanese tokenizer.
☆442Oct 7, 2022Updated 3 years ago