☆81Oct 23, 2024Updated last year
Alternatives and similar repositories for kotoba-whisper
Users that are interested in kotoba-whisper are comparing it to the libraries listed below
Sorting:
- Massive open Japanese speech corpus☆366Jan 19, 2026Updated last month
- ☆10Dec 10, 2021Updated 4 years ago
- ☆39Oct 21, 2025Updated 4 months ago
- ☆15Nov 10, 2025Updated 4 months ago
- EPGStation 4K fork☆12Dec 15, 2024Updated last year
- narabas: Japanese phoneme forced alignment tool☆13Mar 15, 2023Updated 2 years ago
- Robust Speech Recognition via Large-Scale Weak Supervision☆19Dec 1, 2022Updated 3 years ago
- MPEG-TS section (PSI/SI etc.) archiver☆18Jun 8, 2024Updated last year
- モーラバランス型日本語コーパス☆67Feb 1, 2023Updated 3 years ago
- pytorch model for contexless-phoneme prediction from speech audio☆32Oct 30, 2025Updated 4 months ago
- 44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。☆21May 2, 2023Updated 2 years ago
- This project uses llama.cpp as an LLM server to perform inference and generate speech using Synthetic voice library☆22Mar 5, 2024Updated 2 years ago
- Infer only tts☆45Jan 6, 2026Updated 2 months ago
- GUI for Beatrice Voice Changer☆21Feb 18, 2026Updated 2 weeks ago
- Mirakurun 4K fork☆24Dec 7, 2024Updated last year
- OpenAI API の使用例とガイド(https://github.com/openai/openai-cookbook 日本語訳)☆11Nov 13, 2023Updated 2 years ago
- low latency player used Breakout box☆27Oct 22, 2022Updated 3 years ago
- FlexGen with docker☆29Mar 20, 2023Updated 2 years ago
- A TTS Trained on Universal Audio.☆41Jun 6, 2025Updated 9 months ago
- ComfyUI-AniSora is now available in ComfyUI, Index-AniSora is the most powerful open-source animated video generation model. It enables o…☆48May 27, 2025Updated 9 months ago
- 全国書誌データから作成した振り仮名のデータセット☆28Sep 21, 2021Updated 4 years ago
- Package for inference for punctuation, true-casing, and sentence boundary detection☆29Jun 8, 2024Updated last year
- 💠 Aivis: AI Voice Imitation System☆27Feb 25, 2024Updated 2 years ago
- おとうふくんのslackスタンプ☆28Jul 23, 2024Updated last year
- Bluetooth plugin for Flutter☆10Dec 19, 2022Updated 3 years ago
- VOICEVOXのコア内で用いられているディープラーニングモデルの推論コード☆31Dec 3, 2025Updated 3 months ago
- transcribe iPhone VoiceMemo and send it to iPhone Memo application☆46Feb 9, 2025Updated last year
- 日本語音声に対して音素ラベルをアラインメントするためのツールです☆36Aug 19, 2025Updated 6 months ago
- JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit☆44May 26, 2025Updated 9 months ago
- La-O-Dan iOS study☆19Oct 11, 2011Updated 14 years ago
- A lightweight .NET Core console program to merge multiple TIFF files into one.☆12Jul 30, 2019Updated 6 years ago
- ISDB-S3 fork☆10Dec 13, 2024Updated last year
- ☆13Jul 17, 2021Updated 4 years ago
- A real-time and light-weight software for generation of non-linguistic behaviors (turn-taking, backchannel, and head-nodding) in conversa…☆81Feb 20, 2026Updated 2 weeks ago
- Simple persistent unique IDs for machines.☆11Aug 27, 2023Updated 2 years ago
- call rwkv v4/v5/v6/v7 raven/world/finch 1B5-14B rwkv.cpp using csharp cpu/gpu (support INT4,8,Float16,32)☆35Feb 21, 2025Updated last year
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆108Jan 17, 2025Updated last year
- AI based singing voice synthesis☆37Jun 10, 2024Updated last year
- 日本語TTS(VITS)の学習と音声合成のGradio WebUI☆42Jan 5, 2024Updated 2 years ago