全国書誌データから作成した振り仮名のデータセット
☆28Sep 21, 2021Updated 4 years ago
Alternatives and similar repositories for huriganacorpus-ndlbib
Users that are interested in huriganacorpus-ndlbib are comparing it to the libraries listed below
Sorting:
- 青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット☆17Jan 17, 2024Updated 2 years ago
- 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer☆252Feb 7, 2026Updated last month
- Japanese-BPEEncoder☆41Sep 12, 2021Updated 4 years ago
- ☆11Jan 11, 2022Updated 4 years ago
- Juliusを使ったセグメンテーション支援ツール☆13Feb 13, 2020Updated 6 years ago
- MYCOEIROINK作成用のコーパスを管理☆14Mar 21, 2023Updated 2 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆16May 21, 2023Updated 2 years ago
- マルコフ連鎖を使った文章自動生成プログラム+はてなブログ投稿スクリプト☆19Jan 18, 2016Updated 10 years ago
- Dictionary Library for Kagome v2☆15Feb 10, 2026Updated 3 weeks ago
- 漢語常用字詞表☆16Jun 3, 2023Updated 2 years ago
- Twitter access token generator for CLI☆15Nov 24, 2021Updated 4 years ago
- 青空文庫振り仮名注釈付き音声コーパスのデータセット☆45Mar 7, 2025Updated last year
- recpt1をベースにしたLinux用BonDriver録画コマンド☆14Apr 3, 2015Updated 10 years ago
- ☆16May 21, 2019Updated 6 years ago
- The corpus of Japanese spam messages of invitation Mama Katu.☆42Aug 1, 2025Updated 7 months ago
- ☆29Feb 12, 2026Updated 3 weeks ago
- The repository contains scripts and merge scripts that have been modified to adapt an Alpaca-Lora adapter for LoRA tuning when assuming t…☆18May 24, 2023Updated 2 years ago
- ジャンプ系列の漫画をダウンロードするライブラリ/ソフトウェア☆17Oct 19, 2024Updated last year
- Annotated Fuman Kaitori Center Corpus☆18Dec 18, 2023Updated 2 years ago
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆20Mar 2, 2024Updated 2 years ago
- A fast character conversion and transliteration library based on the scheme defined for Japan National Tax Agency (国税庁) 's corporate numb…☆21Updated this week
- 📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information☆131Mar 15, 2023Updated 2 years ago
- Alpaca-LoRAをlivedoorニュースコーパスでFineTuningさせるサンプルコード☆21Mar 19, 2023Updated 2 years ago
- InputMethodKit Sample App with macOS12, Xcode13, Swift5.6 in 2022.☆56Mar 18, 2024Updated last year
- A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.☆514Oct 24, 2025Updated 4 months ago
- Neologism dictionary based on the language resources on the Web for mecab-unidic☆87Sep 14, 2020Updated 5 years ago
- 自作フォント「Xim Sans」の配布場所☆32Sep 16, 2023Updated 2 years ago
- 🍊 PAUSE (Positive and Annealed Unlabeled Sentence Embedding), accepted by EMNLP'2021 🌴☆26May 30, 2024Updated last year
- python版日本語意味役割付与システム(ASA)☆22Nov 11, 2022Updated 3 years ago
- ☆39Oct 21, 2025Updated 4 months ago
- programming contests, problems, et cetera☆25Feb 21, 2026Updated 2 weeks ago
- Delete your 2 or more days ago tweets Automatically.☆31Jul 19, 2023Updated 2 years ago
- 粵語對話語料☆29May 12, 2023Updated 2 years ago
- COEIROINK v2 を VOICEVOX のマルチエンジンで読み込めるようにするためのブリッジ。☆34Jan 13, 2026Updated last month
- Google Input Tools for macOS☆32Updated this week
- 🦞 Rust library of natural language dictionaries using character-wise double-array tries.☆37Jan 13, 2025Updated last year
- ☆72Sep 30, 2022Updated 3 years ago
- Japanese data from the Google UDT 2.0.☆28Mar 24, 2023Updated 2 years ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆125Nov 13, 2025Updated 3 months ago