nlp-waseda / Kanbun-LM
Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"
☆16Updated last year
Alternatives and similar repositories for Kanbun-LM:
Users that are interested in Kanbun-LM are comparing it to the libraries listed below
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆50Updated 2 weeks ago
- A powerful text cleaner for Japanese web texts☆12Updated 11 months ago
- A small version of UniDic for easy pip installs.☆41Updated 4 years ago
- Unidic packaged for installation via pip.☆83Updated last year
- Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese with BERT models☆17Updated 6 months ago
- Yet another Python binding for Juman++/KNP/KWJA☆31Updated 3 months ago
- ☆21Updated 2 months ago
- 全国書誌データから作成した振り仮名のデータセット☆23Updated 3 years ago
- ☆9Updated 4 months ago
- 日本語文法誤り訂正ツール☆28Updated 2 years ago
- ☆28Updated 2 months ago
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆28Updated last month
- Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese☆34Updated last month
- python版日本語意味役割付与システム(ASA)☆23Updated 2 years ago
- Classical Chinese to Modern Japanese Translator☆27Updated last year
- Utility scripts for preprocessing Wikipedia texts for NLP☆75Updated 9 months ago
- ☆47Updated last year
- Kyoto University Text Corpus☆60Updated last year
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆64Updated last month
- COMET-ATOMIC ja☆29Updated 10 months ago
- NDL古典籍OCRのアプリケーション(ソースコードを含む)☆51Updated 2 months ago
- 青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット☆12Updated last year
- ☆18Updated last year
- This repository has implementations of data augmentation for NLP for Japanese.☆64Updated last year
- ☆95Updated 6 years ago
- English loanwords in Japanese☆17Updated 2 months ago
- Sentence boundary disambiguation tool for Japanese texts (日本語文境界判定器)☆189Updated 9 months ago
- Trials of pre-trained BERT models for the medical domain in Japanese.☆12Updated 4 years ago
- An integrated Japanese analyzer based on foundation models☆131Updated 3 months ago
- A Japanese accent dictionary generator☆112Updated 9 months ago