hexgrad / misakiLinks
G2P
☆403Updated 5 months ago
Alternatives and similar repositories for misaki
Users that are interested in misaki are comparing it to the libraries listed below
Sorting:
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆651Updated 2 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆219Updated 9 months ago
- Fine Tune the Style-TTS2 Voice Model☆266Updated 7 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆267Updated 7 months ago
- Running the F5-TTS by ONNX Runtime☆191Updated last month
- A high quality and fast TTS repository☆498Updated last month
- ☆370Updated 4 months ago
- SoTA open-source TTS☆135Updated 8 months ago
- Interface for OuteTTS models.☆1,421Updated 7 months ago
- A random walk voice style cloning application for Kokoro text to speech☆205Updated 7 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆128Updated 6 months ago
- ☆297Updated 6 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆892Updated 8 months ago
- ☆100Updated last year
- ☆476Updated this week
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆187Updated last year
- ☆346Updated 5 months ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆838Updated 2 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆295Updated 8 months ago
- ☆474Updated 8 months ago
- Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in Pytorch☆514Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- Realtime demo, Streaming and Finetuning code for CSM☆442Updated 4 months ago
- Real-time Speech-Text Foundation Model Toolkit (wip)☆252Updated 10 months ago
- Run Orpheus 3B Locally With LM Studio☆513Updated 10 months ago
- ☆388Updated last year
- Efficient approach to speaker diarization using voice characteristics extraction☆106Updated 7 months ago
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆258Updated last year
- Open source inference code for Rev's model☆435Updated 9 months ago
- Very fast, accurate speaker diarization☆228Updated this week