fluxions-ai / vuiLinks
☆577Updated this week
Alternatives and similar repositories for vui
Users that are interested in vui are comparing it to the libraries listed below
Sorting:
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆260Updated 3 weeks ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆294Updated 2 months ago
- ☆488Updated last week
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆440Updated 2 months ago
- Run Orpheus 3B Locally With LM Studio☆428Updated 3 months ago
- Daily Bots Web Demo showcasing how to build real-time voice AI agents☆243Updated 8 months ago
- List of curated use cases built using Sesame's CSM 1B☆66Updated 3 weeks ago
- Delayed Streams Modeling (DSM) is a flexible formulation for streaming, multimodal sequence-to-sequence learning.☆310Updated this week
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆359Updated last month
- Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗☆661Updated last week
- An experiment in meeting transcription and diarization with just an LLM. Maybe I went a little overboard though☆553Updated 3 weeks ago
- ☆432Updated last month
- Sesame CSM 1B Voice Cloning☆305Updated 3 months ago
- Kyutai with an "eye"☆200Updated 3 months ago
- podcastfy.ai gradio demo app☆334Updated 6 months ago
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆255Updated 3 weeks ago
- ☆754Updated 2 months ago
- ☆238Updated 2 months ago
- Gemini Multimodal Live + WebRTC in a single `app.ts`☆206Updated 6 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆230Updated 5 months ago
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆752Updated 2 weeks ago
- Interface for OuteTTS models.☆1,318Updated last week
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆179Updated 2 months ago
- A Fast TTS Engine☆517Updated 5 months ago
- G2P☆262Updated last month
- Streaming and Fine-tuning for Chatterbox TTS☆109Updated last week
- Youtube API Server used in https://git.new/scira☆328Updated 3 months ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆579Updated 2 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆259Updated last month
- Example UI implementing the RTVI web client☆477Updated 6 months ago