全国書誌データから作成した振り仮名のデータセット
☆31Sep 21, 2021Updated 4 years ago
Alternatives and similar repositories for huriganacorpus-ndlbib
Users that are interested in huriganacorpus-ndlbib are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 青空文庫及びサピエの点字データから作成した振り仮名コーパスのデータセット☆22Jan 17, 2024Updated 2 years ago
- Disambiguate japanese heteronyms☆32Oct 3, 2023Updated 2 years ago
- VOICEVOX ENGINE、VOICEVOX NEMO ENGINE、COEIROINK用コマンドラインクライアント。複数のエンジンを使用した並列処理もできます☆11May 4, 2024Updated last year
- Japanese-BPEEncoder☆41Sep 12, 2021Updated 4 years ago
- 🛥 Vaporetto: Very accelerated pointwise prediction based tokenizer☆254Feb 7, 2026Updated last month
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆11Jan 11, 2022Updated 4 years ago
- This is a server to use voicepeak as api. By using openAI's API, emotions, voice pitch, and speed are automatically adjusted.☆18Nov 12, 2023Updated 2 years ago
- Annotated Fuman Kaitori Center Corpus☆18Dec 18, 2023Updated 2 years ago
- ☆29Feb 12, 2026Updated last month
- This is a server to use voicepeak as api.☆26Nov 12, 2023Updated 2 years ago
- MYCOEIROINK作成用のコーパスを管理☆14Mar 21, 2023Updated 3 years ago
- 次世代デジタルライブラリーのソースコード(Programs of the Next Digital Library.)☆26Apr 27, 2023Updated 2 years ago
- Pre-train Embedding in LightFM Recommender System Framework☆11Apr 28, 2019Updated 6 years ago
- Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch☆20Mar 2, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 📝 A list of pre-trained BERT models for Japanese with word/subword tokenization + vocabulary construction algorithm information☆131Mar 15, 2023Updated 3 years ago
- ☆72Sep 30, 2022Updated 3 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆14Aug 7, 2022Updated 3 years ago
- NDL-DocLデータセット(資料画像レイアウトデータセット)☆30Mar 2, 2023Updated 3 years ago
- The repository contains scripts and merge scripts that have been modified to adapt an Alpaca-Lora adapter for LoRA tuning when assuming t…☆19May 24, 2023Updated 2 years ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆13Jun 7, 2023Updated 2 years ago
- ☆13Jun 4, 2024Updated last year
- A fast character conversion and transliteration library based on the scheme defined for Japan National Tax Agency (国税庁) 's corporate numb…☆21Mar 11, 2026Updated 2 weeks ago
- The robust text processing pipeline framework enabling customizable, efficient, and metric-logged text preprocessing.☆124Nov 13, 2025Updated 4 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- AJIMEE-Bench (Advanced Japanese IME Evaluation Benchmark)☆20Jan 13, 2025Updated last year
- 議事録メタデータセット☆12Jun 10, 2018Updated 7 years ago
- ☆15Aug 3, 2024Updated last year
- ☆16May 21, 2019Updated 6 years ago
- The corpus of Japanese spam messages of invitation Mama Katu.☆42Aug 1, 2025Updated 7 months ago
- ☆11Dec 13, 2023Updated 2 years ago
- 44100Hz日本語HuBERTに対応した QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion です。☆16May 21, 2023Updated 2 years ago
- NVIDIA's FastPitch, extracted from the DeepLearningExamples repository☆14Mar 29, 2024Updated 2 years ago
- Utility scripts for preprocessing Wikipedia texts for NLP☆78Apr 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Dictionary Library for Kagome v2☆15Mar 11, 2026Updated 2 weeks ago
- A Cython MeCab wrapper for fast, pythonic Japanese tokenization and morphological analysis.☆516Oct 24, 2025Updated 5 months ago
- Tokenizer POS-tagger Lemmatizer and Dependency-parser for modern and contemporary Japanese☆38Dec 29, 2025Updated 3 months ago
- Audio Language Examples☆20Dec 26, 2020Updated 5 years ago
- Japanese-BPEEncoder Version 2☆41Jan 15, 2023Updated 3 years ago
- 日本語文法誤り訂正ツール☆29Jun 22, 2022Updated 3 years ago
- Neologism dictionary based on the language resources on the Web for mecab-unidic☆87Sep 14, 2020Updated 5 years ago