nlp-waseda / Kanbun-LM
Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"
☆16Updated last year
Alternatives and similar repositories for Kanbun-LM:
Users that are interested in Kanbun-LM are comparing it to the libraries listed below
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆51Updated last month
- Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese with BERT models☆17Updated 7 months ago
- A small version of UniDic for easy pip installs.☆42Updated 4 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆65Updated 3 months ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆13Updated last year
- Japanese-BPEEncoder Version 2☆41Updated 2 years ago
- ☆23Updated 3 months ago
- Unidic packaged for installation via pip.☆85Updated last year
- Kyoto University Text Corpus☆62Updated last year
- English loanwords in Japanese☆17Updated 3 months ago
- ☆28Updated 3 months ago
- ☆9Updated 5 months ago
- A summarizer for Japanese articles (but ChatGPT is better)☆10Updated 2 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆15Updated 3 months ago
- Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese☆34Updated 3 months ago
- Classical Chinese to Modern Japanese Translator☆27Updated last year
- COMET-ATOMIC ja☆29Updated 11 months ago
- Yet another Python binding for Juman++/KNP/KWJA☆31Updated 4 months ago
- 全国書誌データから作成した振り仮名のデータセット☆26Updated 3 years ago
- 日本語文法誤り訂正ツール☆28Updated 2 years ago
- Japanese data from the Google UDT 2.0.☆37Updated 3 months ago
- Hanja Understanding Evaluation Dataset☆13Updated 2 years ago
- Hanzipy is a Chinese character and NLP module for Chinese language processing for python. It is primarily written to help provide a frame…☆18Updated last year
- CCL 2023 古汉语通假字语料库的构建及应用研究:通假字资源库☆12Updated last year
- NDL古典籍OCRのアプリケーション(ソースコードを含む)☆53Updated 3 months ago
- Codes to pre-train Japanese T5 models☆41Updated 3 years ago
- A powerful text cleaner for Japanese web texts☆12Updated last year
- 🌸De-inflect Japanese words☆12Updated 2 years ago
- An example usage of JParaCrawl pre-trained Neural Machine Translation (NMT) models.☆104Updated 3 years ago
- 青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット☆13Updated last year