yukiarimo / hanasuLinks

Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture

☆33

Alternatives and similar repositories for hanasu

Users that are interested in hanasu are comparing it to the libraries listed below

Sorting:

RobViren / kvoicewalk
A random walk voice style cloning application for Kokoro text to speech
☆110Updated last month
thomasgauthier / csm-hf
Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers
☆57Updated 2 months ago
davidbrowne17 / chatterbox-streaming
Streaming and Fine-tuning for Chatterbox TTS
☆149Updated last month
taresh18 / TTSizer
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨
☆97Updated this week
sidharthrajaram / StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
☆160Updated last year
NeuralVox / StyleTTS2
☆98Updated last year
nivibilla / local-llasa-tts
Examples of using the llasa-tts models locally
☆177Updated 3 months ago
taylorchu / 2cent-tts
☆32Updated last month
IIEleven11 / Automatic-Audio-Dataset-Maker
Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.
☆42Updated 2 weeks ago
anan235 / dia-multilingual
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆198Updated 3 months ago
balisujohn / tortoise.cpp
A ggml (C++) re-implementation of tortoise-tts
☆188Updated 11 months ago
gooofy / zerovox
zero-shot realtime TTS system, fully offline, free and open source
☆41Updated 3 months ago
stlohrey / dia-finetuning
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆114Updated 2 weeks ago
kyutai-labs / moshi-finetune
☆272Updated last month
stlohrey / chatterbox-finetuning
SoTA open-source TTS
☆72Updated 2 months ago
rsxdalv / chatterbox
SoTA open-source TTS
☆50Updated last week
manmay-nakhashi / tortoise-tts-fastest
Faster Tortoise inference then Tortoise Fast Fork
☆128Updated last year
JarodMica / GPT-SoVITS-Package
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆28Updated 2 months ago
Picovoice / orca
On-device streaming text-to-speech engine powered by deep learning
☆121Updated this week
EndlessReform / smoltts
Open TTS models, built for streaming on the edge
☆43Updated 4 months ago
dangtr0408 / StyleTTS2-lite
A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.
☆35Updated 2 months ago
Bill13579 / beltout
BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC
☆70Updated 3 weeks ago
clement-pages / gryannote
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
☆65Updated 2 weeks ago
skirdey / voicerestore
VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration
☆178Updated 3 months ago
zenforic / csm-multi
Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…
☆23Updated 4 months ago
Saganaki22 / OrpheusTTS-WebUI
Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]
☆101Updated 4 months ago
jakariaemon / WSI
Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.
☆22Updated 4 months ago
efogdev / apollo
Deploy Apollo HF space locally
☆40Updated 7 months ago
IIEleven11 / StyleTTS2FineTune
☆245Updated last month
theodorblackbird / lina-speech
Official implementation of the TTS model Lina-Speech
☆167Updated 7 months ago