nlp-waseda / Kanbun-LMLinks
Code for paper "Kanbun-LM: Reading and Translating Classical Chinese in Japanese Method by Language Models"
☆16Updated last year
Alternatives and similar repositories for Kanbun-LM
Users that are interested in Kanbun-LM are comparing it to the libraries listed below
Sorting:
- Tokenizer POS-Tagger and Dependency-parser with BERT/RoBERTa/DeBERTa/GPT models for Japanese and other languages☆50Updated last month
- Classical Chinese to Modern Japanese Translator☆29Updated last year
- A small version of UniDic for easy pip installs.☆43Updated 4 years ago
- COMET-ATOMIC ja☆29Updated last year
- 日本語文法誤り訂正ツール☆29Updated 2 years ago
- ☆28Updated last week
- ☆51Updated 2 years ago
- Scripts for creating a Japanese-English parallel corpus and training NMT models☆17Updated 3 years ago
- Trials of pre-trained BERT models for the medical domain in Japanese.☆12Updated 4 years ago
- ☆24Updated last week
- Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese☆37Updated 6 months ago
- A powerful text cleaner for Japanese web texts☆12Updated last year
- Annotated Fuman Kaitori Center Corpus☆18Updated last year
- Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese with BERT models☆20Updated 2 months ago
- ☆9Updated 9 months ago
- Unidic packaged for installation via pip.☆96Updated 3 months ago
- Kyoto University Text Corpus☆62Updated last year
- python版日本語意味役割付与システム(ASA)☆23Updated 2 years ago
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆13Updated last year
- A Japanese accent dictionary generator☆117Updated last year
- Codes to pre-train Japanese T5 models☆41Updated 3 years ago
- 🛥 Vaporetto is a fast and lightweight pointwise prediction based tokenizer. This is a Python wrapper for Vaporetto.☆20Updated last week
- Utility scripts for preprocessing Wikipedia texts for NLP☆77Updated last year
- Tokenizer POS-tagger and Dependency-parser for Classical Chinese☆64Updated 6 months ago
- DIRECT: Direct and Indirect REsponses in Conversational Text Corpus☆16Updated 3 years ago
- Yet another sentence-level tokenizer for the Japanese text☆22Updated 2 years ago
- ☆17Updated 2 years ago
- AllenNLP integration for Shiba: Japanese CANINE model☆12Updated 3 years ago
- 全国書誌データから作成した振り仮名のデータセット☆27Updated 3 years ago
- The Business Scene Dialogue corpus☆68Updated 3 years ago