kadirnar / VoiceHubLinks
VoiceHub: A Unified Inference Interface for TTS Models
☆58Updated 3 weeks ago
Alternatives and similar repositories for VoiceHub
Users that are interested in VoiceHub are comparing it to the libraries listed below
Sorting:
- Callytics is an advanced call analytics solution that leverages speech recognition and large language models (LLMs) technologies to analy…☆74Updated 7 months ago
- ☆203Updated last month
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆36Updated 8 months ago
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆276Updated last month
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆295Updated 5 months ago
- Roomey is a multi-purpose Voice Agent designed to run your personal and business life.☆50Updated 5 months ago
- Open TTS models, built for streaming on the edge☆44Updated 8 months ago
- Kyutai with an "eye"☆224Updated 8 months ago
- Orpheus Server with streaming support (TTFB ~160ms)☆18Updated 2 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- An example repository to use HuggingFace smolagents, Phidata and CrewAI frameworks with local LLMs☆40Updated 10 months ago
- ☆12Updated 7 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last month
- ☆313Updated 3 months ago
- A lightweight Python library for running TTS models with a unified API.☆21Updated 9 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆23Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆49Updated 6 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆344Updated 7 months ago
- MeloPlus: Advanced Python Library for MeloTts☆11Updated 3 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆215Updated 7 months ago
- Collection of Open Source Speech Data☆163Updated last month
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated 11 months ago
- An open-source implementation of Whisper☆459Updated last month
- Speech synthesis (TTS) in low-resource languages by training from scratch with Fastpitch and fine-tuning with HifiGan☆61Updated last year
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 models☆31Updated 3 years ago
- SoTA open-source TTS☆114Updated 5 months ago
- Inference, Fine Tuning and many more recipes with Gemma family of models☆274Updated 4 months ago
- A package for NeuCodec: a 50hz, 0.8kbps, 24kHz audio codec.☆120Updated last month
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆193Updated 7 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆128Updated 3 months ago