levelevel / AozoraTxtLinks
青空文庫のテキストファイル
☆9Updated last year
Alternatives and similar repositories for AozoraTxt
Users that are interested in AozoraTxt are comparing it to the libraries listed below
Sorting:
- ☆41Updated 4 months ago
- JMultiWOZ: A Large-Scale Japanese Multi-Domain Task-Oriented Dialogue Dataset, LREC-COLING 2024☆24Updated last year
- python版日本語意味役割付与システム(ASA)☆23Updated 2 years ago
- 日本語マルチタスク言語理解ベンチマーク Japanese Massive Multitask Language Understanding Benchmark☆36Updated 7 months ago
- Japanese tokenizer for Transformers☆79Updated last year
- wikipedia 日本語の文を、各種日本語の embeddings や faiss index へと変換するスクリプト等。☆11Updated last year
- Yet another Python binding for Juman++/KNP/KWJA☆33Updated last week
- Zunda: Japanese Enhanced Modality Analyzer client for Python.☆10Updated 5 years ago
- ☆51Updated 2 years ago
- ☆15Updated last year
- 日本語文法誤り訂正ツール☆29Updated 3 years ago
- A paraphrase database for Japanese text simplification☆32Updated 8 years ago
- An integrated Japanese analyzer based on foundation models☆133Updated last week
- ☆23Updated last year
- Namelti : The automatic transcription generation library for person name in Katakana☆21Updated 2 years ago
- Accommodation Search Dialog Corpus (宿泊施設探索対話コーパス)☆25Updated last year
- RealPersonaChat: A Realistic Persona Chat Corpus with Interlocutors' Own Personalities☆60Updated last year
- Japanese Word Similarity Dataset☆101Updated 3 years ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆122Updated this week
- Flexible evaluation tool for language models☆49Updated last week
- COMET-ATOMIC ja☆30Updated last year
- Utility scripts for preprocessing Wikipedia texts for NLP☆77Updated last year
- おーぷん2ちゃんねるをクロールして作成した対話コーパス☆97Updated 4 years ago
- Neologism dictionary based on the language resources on the Web for mecab-unidic☆87Updated 4 years ago
- Japanese LLaMa experiment☆53Updated 7 months ago
- ☆85Updated last year
- ☆14Updated 3 weeks ago
- ☆61Updated last year
- ☆9Updated 10 months ago
- 青空文庫テキストをより便利にする(機械可読性を高める)ためのプロジェクト☆21Updated 2 years ago