mush42 / optispeech
A lightweight end-to-end text-to-speech model
☆110Updated 3 weeks ago
Alternatives and similar repositories for optispeech:
Users that are interested in optispeech are comparing it to the libraries listed below
- A toolkit for speaker diarization.☆172Updated 4 months ago
- We Speech Transcript based on LLM, in 300 lines of code.☆149Updated 2 weeks ago
- ☆157Updated 3 months ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆86Updated 5 months ago
- Open source inference code for Rev's model☆383Updated 2 weeks ago
- Running the F5-TTS by ONNX Runtime☆123Updated this week
- Nendo is an open source platform for AI-driven audio management, intelligence, and generation.☆120Updated last year
- F5-TTS 推理加速,速度提升约4倍!☆59Updated 2 months ago
- VoiceBox neural network implementation☆105Updated 7 months ago
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆232Updated 6 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆60Updated last week
- Speech Diarization for scrum automation☆102Updated last year
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆157Updated 9 months ago
- Cantonese Text to Speech with VITS implementation☆20Updated last year
- AMT-APC: AMT-APC: Automatic Piano Cover by Fine-Tuning an Automatic Music Transcription Model☆61Updated last week
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆232Updated last week
- ☆254Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆94Updated 5 months ago
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆34Updated 3 months ago
- Real time faster whisper gradio☆26Updated 5 months ago
- G2P☆171Updated this week
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- VALL-E 2 reproduction☆117Updated 8 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆89Updated last month