yukiarimo / hanasuLinks
Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture
☆33Updated 3 weeks ago
Alternatives and similar repositories for hanasu
Users that are interested in hanasu are comparing it to the libraries listed below
Sorting:
- A random walk voice style cloning application for Kokoro text to speech☆110Updated last month
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57Updated 2 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆149Updated last month
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆97Updated this week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- ☆98Updated last year
- Examples of using the llasa-tts models locally☆177Updated 3 months ago
- ☆32Updated last month
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated 2 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆198Updated 3 months ago
- A ggml (C++) re-implementation of tortoise-tts☆188Updated 11 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆41Updated 3 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆114Updated 2 weeks ago
- ☆272Updated last month
- SoTA open-source TTS☆72Updated 2 months ago
- SoTA open-source TTS☆50Updated last week
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated last year
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆28Updated 2 months ago
- On-device streaming text-to-speech engine powered by deep learning☆121Updated this week
- Open TTS models, built for streaming on the edge☆43Updated 4 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆35Updated 2 months ago
- BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC☆70Updated 3 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆65Updated 2 weeks ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆178Updated 3 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 4 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆101Updated 4 months ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆22Updated 4 months ago
- Deploy Apollo HF space locally☆40Updated 7 months ago
- ☆245Updated last month
- Official implementation of the TTS model Lina-Speech☆167Updated 7 months ago