yukiarimo / hanasuLinks
Hanasu is a human-like TTS model based on the multilingual Himitsu V1 transformer-based encoder and VITS architecture
☆32Updated last month
Alternatives and similar repositories for hanasu
Users that are interested in hanasu are comparing it to the libraries listed below
Sorting:
- A random walk voice style cloning application for Kokoro text to speech☆103Updated last month
- Streaming and Fine-tuning for Chatterbox TTS☆128Updated last month
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆90Updated last month
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆56Updated last month
- ☆24Updated 2 weeks ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆41Updated 2 weeks ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆20Updated 4 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆63Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆160Updated last year
- Open TTS models, built for streaming on the edge☆43Updated 4 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆34Updated last month
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX☆20Updated 9 months ago
- Course Project for COMP4471 on RWKV☆17Updated last year
- ☆97Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆188Updated 10 months ago
- SoTA open-source TTS☆63Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆109Updated last month
- Examples of using the llasa-tts models locally☆175Updated 2 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆26Updated last month
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆27Updated 9 months ago
- zero-shot realtime TTS system, fully offline, free and open source☆41Updated 2 months ago
- TTS support with GGML☆127Updated 2 weeks ago
- A cli app for experimenting with kokoro voice creating and mixing using the available voices to interpolate new ones☆27Updated 5 months ago
- ☆51Updated 2 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆189Updated 2 months ago
- StyleTTS 2 Optimized Training Fork☆32Updated 5 months ago
- SoTA open-source TTS☆42Updated 3 weeks ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆11Updated 3 months ago
- Explore, Install, Innovate — in 1 Click.☆27Updated this week
- Joint speech-language model - respond directly to audio!☆30Updated last year