全国書誌データから作成した振り仮名のデータセット
☆31Sep 21, 2021Updated 4 years ago
Alternatives and similar repositories for huriganacorpus-ndlbib
Users that are interested in huriganacorpus-ndlbib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット☆22Jan 17, 2024Updated 2 years ago
- Disambiguate japanese heteronyms☆32Oct 3, 2023Updated 2 years ago
- VOICEVOX ENGINE、VOICEVOX NEMO ENGINE、COEIROINK用コマンドラインクライアント。複数のエンジンを使用した並列処理もできます☆11May 4, 2024Updated last year
- Japanese-BPEEncoder☆41Sep 12, 2021Updated 4 years ago
- 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer☆256Feb 7, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Jan 11, 2022Updated 4 years ago
- This is a server to use voicepeak as api. By using openAI's API, emotions, voice pitch, and speed are automatically adjusted.☆18Nov 12, 2023Updated 2 years ago
- Annotated Fuman Kaitori Center Corpus☆18Dec 18, 2023Updated 2 years ago
- Juliusを使ったセグメンテーション支援ツール☆13Feb 13, 2020Updated 6 years ago
- ☆29Feb 12, 2026Updated 2 months ago
- This is a server to use voicepeak as api.☆26Nov 12, 2023Updated 2 years ago
- MYCOEIROINK作成用のコーパスを管理☆14Mar 21, 2023Updated 3 years ago
- 次世代デジタルライブラリーのソースコード(Programs of the Next Digital Library.)☆26Apr 27, 2023Updated 2 years ago
- Pre-train Embedding in LightFM Recommender System Framework☆11Apr 28, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆20Mar 2, 2024Updated 2 years ago
- 📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information☆132Mar 15, 2023Updated 3 years ago
- PDFからテキストデータを抽出して機械学習等に適用するためのツール群☆12Aug 4, 2021Updated 4 years ago
- ☆72Sep 30, 2022Updated 3 years ago
- This is an implementation of "Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention" wit…☆28Dec 23, 2017Updated 8 years ago
- NDL-DocLデータセット(資料画像レイアウトデータセット)☆30Mar 2, 2023Updated 3 years ago
- The repository contains scripts and merge scripts that have been modified to adapt an Alpaca-Lora adapter for LoRA tuning when assuming t…☆19May 24, 2023Updated 2 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆13Jun 7, 2023Updated 2 years ago
- ☆13Jun 4, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A fast character conversion and transliteration library based on the scheme defined for Japan National Tax Agency (国税庁) 's corporate numb…☆21Mar 11, 2026Updated last month
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆126Apr 10, 2026Updated last week
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)☆20Jan 13, 2025Updated last year
- 議事録メタデータセット☆12Jun 10, 2018Updated 7 years ago
- ☆15Aug 3, 2024Updated last year
- ☆16May 21, 2019Updated 6 years ago
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆10Dec 3, 2023Updated 2 years ago
- The corpus of Japanese spam messages of invitation Mama Katu.☆42Aug 1, 2025Updated 8 months ago
- ☆11Dec 13, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- python版日本語意味役割付与システム(ASA)☆22Nov 11, 2022Updated 3 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆16May 21, 2023Updated 2 years ago
- NVIDIA's FastPitch, extracted from the DeepLearningExamples repository☆14Mar 29, 2024Updated 2 years ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Apr 9, 2024Updated 2 years ago
- Dictionary Library for Kagome v2☆15Apr 9, 2026Updated last week
- A client of DNSPod☆19Jan 25, 2014Updated 12 years ago
- A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.☆518Oct 24, 2025Updated 5 months ago