taresh18 / orpheus-streamingLinks
Orpheus Server with streaming support (TTFB ~160ms)
☆13Updated last month
Alternatives and similar repositories for orpheus-streaming
Users that are interested in orpheus-streaming are comparing it to the libraries listed below
Sorting:
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57Updated 4 months ago
- ☆140Updated 3 weeks ago
- ☆246Updated 2 weeks ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last week
- Collection of Open Source Speech Data☆160Updated this week
- Open TTS models, built for streaming on the edge☆42Updated 6 months ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆24Updated 6 months ago
- Video+code lecture on building nanoGPT from scratch☆69Updated last year
- ☆62Updated last year
- SoTA open-source TTS☆86Updated 3 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆123Updated last month
- Efficient approach to speaker diarization using voice characteristics extraction☆100Updated 3 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆282Updated 3 months ago
- VoiceHub: A Unified Inference Interface for TTS Models☆52Updated 2 weeks ago
- Finetune Sesame's CSM 1B model, for fun and profit☆17Updated 5 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆278Updated 4 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆182Updated 3 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Updated last year
- Playing with CSM☆22Updated 6 months ago
- SoTA open-source TTS☆90Updated this week
- VLLM Port of the Chatterbox TTS model☆293Updated last week
- a simple system for 2-way interruptible voice interactions between human and LLM☆30Updated last year
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆27Updated 6 months ago
- ☆280Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆122Updated last month
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆52Updated 7 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated last week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated last year
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆192Updated 5 months ago
- A random walk voice style cloning application for Kokoro text to speech☆128Updated 3 months ago