kotoba-tech/kotoba-whisper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kotoba-tech/kotoba-whisper)

kotoba-tech / kotoba-whisper

☆97

Alternatives and similar repositories for kotoba-whisper

Users that are interested in kotoba-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kotoba-tech / Open-GPT-4o
View on GitHub
☆10May 16, 2024Updated 2 years ago
unilight / jatts
View on GitHub
JATTS: A modern, research-oriented Japanese Text-to-speech Open-sourced Toolkit
☆43Mar 13, 2026Updated 4 months ago
reazon-research / ReazonSpeech
View on GitHub
Massive open Japanese speech corpus
☆389Jun 10, 2026Updated last month
SakanaAI / TinySwallow-ChatUI
View on GitHub
Browser-based chat UI for TinySwallow-1.5B that runs without API calls.
☆136Dec 1, 2025Updated 7 months ago
ryota-komatsu / slp2025
View on GitHub
Survey of audio language models
☆65Apr 18, 2026Updated 3 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
Atotti / miipher-2
View on GitHub
Googleの音声復元モデルMiipher-2の再現実装の学習および推論コード。学習済みモデルも公開しています。
☆32Feb 7, 2026Updated 5 months ago
kotoba-tech / kotoba-recipes
View on GitHub
Support Continual pre-training & Instruction Tuning forked from llama-recipes
☆34Feb 17, 2024Updated 2 years ago
zbller / Mecari
View on GitHub
☆40Oct 21, 2025Updated 9 months ago
otya128 / EPGStation
View on GitHub
EPGStation 4K fork
☆12Dec 15, 2024Updated last year
kotoba-tech / kotoba-speech-release
View on GitHub
☆49Jul 22, 2024Updated last year
nikhilraghav29 / diarizen-tutorial
View on GitHub
DiariZen Explained: A Tutorial for the Open Source State-of-the-Art Speaker Diarization Pipeline.
☆21Apr 24, 2026Updated 2 months ago
lourson1091 / audiobertscore
View on GitHub
☆15Nov 10, 2025Updated 8 months ago
mmttmte / arib-b61-stream-test
View on GitHub
ARIB STD-B61 implementation
☆15Feb 20, 2025Updated last year
neodyland / sbv2-api
View on GitHub
Infer only tts
☆48Jul 13, 2026Updated last week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
tonnetonne814 / WhisperLive-PEFT
View on GitHub
Whisper系列のPEFTと、PEFT済のモデルを使ったストリーミング書き起こしを実装するためのリポジトリです。
☆15Oct 16, 2025Updated 9 months ago
Wataru-Nakata / latentlm-tts
View on GitHub
☆29Jul 3, 2026Updated 2 weeks ago
ndl-lab / hurigana-speech-corpus-aozora
View on GitHub
青空文庫振り仮名注釈付き音声コーパスのデータセット
☆50Mar 7, 2025Updated last year
mmorise / rohan4600
View on GitHub
モーラバランス型日本語コーパス
☆73Mar 13, 2026Updated 4 months ago
chutaklee / CantoASR
View on GitHub
Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆16May 8, 2022Updated 4 years ago
Hiroshiba / openjtalk-label-getter
View on GitHub
☆10Dec 10, 2021Updated 4 years ago
tonnetonne814 / PITS-44100-Ja
View on GitHub
44100Hz日本語音源に対応した PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor です。
☆21May 2, 2023Updated 3 years ago
yashbonde / RNN-sim
View on GitHub
Running massive simulations using RNNs on CPUs for building bots and all kinds of things.
☆12Jun 13, 2021Updated 5 years ago
aq2r / beatrice-client
View on GitHub
GUI for Beatrice Voice Changer
☆31Feb 18, 2026Updated 5 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
michiyasunaga / pos_adv
View on GitHub
[NAACL 2018] Robust Sequence Labeling with Adversarial Training
☆10Sep 30, 2019Updated 6 years ago
axeber01 / wav2pos
View on GitHub
3D Sound Source Localization using Masked Autoencoders
☆21Feb 12, 2025Updated last year
b-sigpro / neural-fcasa
View on GitHub
This is a repository of neural full-rank spatial covariance analysis with speaker activity (neural FCASA).
☆40Mar 12, 2025Updated last year
sarulab-speech / audio-foundation-model-dataset
View on GitHub
☆65Jan 8, 2025Updated last year
lepus-ctf / lepus-api
View on GitHub
Lepus CTF(旧: TDUCTF)で利用されているRESTfulなスコアサーバー
☆11Nov 23, 2015Updated 10 years ago
amane-uehara / cpubook
View on GitHub
書籍「作ろう！CPU」のサポートページ
☆20Sep 12, 2024Updated last year
xtne6f / psisiarc
View on GitHub
MPEG-TS section (PSI/SI etc.) archiver
☆19Jun 8, 2024Updated 2 years ago
sarulab-speech / xvector_jtubespeech
View on GitHub
xvector model on jtubespeech
☆47Nov 5, 2023Updated 2 years ago
SakanaAI / TinySwallow-ChatUI-Local
View on GitHub
Python-based chat demo for TinySwallow-1.5B that works completely offline
☆56Jan 29, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Deep-unlearning / Finetune-Parakeet
View on GitHub
☆25Oct 22, 2025Updated 8 months ago
ayutaz / uni-llm-voice-chat
View on GitHub
This project uses llama.cpp as an LLM server to perform inference and generate speech using Synthetic voice library
☆22Mar 5, 2024Updated 2 years ago
sammthomson / ChuLiuEdmonds
View on GitHub
Tarjan's implementation of the Chu-Liu-Edmonds algorithm for finding min/max spanning trees of dense graphs.
☆11Apr 19, 2015Updated 11 years ago
sevenc-nanashi / coeiroink-v2-bridge
View on GitHub
COEIROINK v2 を VOICEVOX のマルチエンジンで読み込めるようにするためのブリッジ。
☆36Jan 13, 2026Updated 6 months ago
ohuelab / FastLomap
View on GitHub
Alchemical mutation scoring map
☆11May 19, 2024Updated 2 years ago
kaistmm / seed-pytorch
View on GitHub
[INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement Diffusion Model"
☆59Nov 3, 2025Updated 8 months ago
mzyy94 / InkArt
View on GitHub
Display grayscale art on Inkplate!
☆13Apr 15, 2022Updated 4 years ago