allvoicelab / AllVoiceLab-MCPLinks
Official AllVoiceLab Model Context Protocol (MCP) server, supporting interaction with powerful text-to-speech and video translation APIs.
☆55Updated 5 months ago
Alternatives and similar repositories for AllVoiceLab-MCP
Users that are interested in AllVoiceLab-MCP are comparing it to the libraries listed below
Sorting:
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆86Updated 3 weeks ago
- ☆483Updated 8 months ago
- ☆532Updated 3 months ago
- Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.☆695Updated 3 weeks ago
- Open source video call conversational bot☆53Updated 2 months ago
- ☆344Updated 4 months ago
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆231Updated last month
- ☆473Updated 8 months ago
- GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning☆889Updated last month
- ☆340Updated 9 months ago
- Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation☆423Updated last month
- The official Python library for the Fish Audio API.☆137Updated this week
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆305Updated 7 months ago
- Open-source voice + video AI assistant built on LiveKit☆75Updated 5 months ago
- PersonaPlex code.☆110Updated this week
- generate lyrics, song and background music(instrumental). Model Context Protocol (MCP) server.☆70Updated 8 months ago
- ☆296Updated 5 months ago
- ☆141Updated 3 weeks ago
- List of curated use cases built using Sesame's CSM 1B☆73Updated 7 months ago
- Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching☆142Updated 2 months ago
- A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…☆822Updated 3 weeks ago
- ☆455Updated this week
- MiMo-Audio: Audio Language Models are Few-Shot Learners☆955Updated 4 months ago
- A FastAPI service for text-to-speech synthesis using the F5-TTS model. Includes authentication token☆36Updated 8 months ago
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆296Updated 2 months ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆771Updated last month
- Turn detection for full-duplex dialogue communication☆513Updated 3 weeks ago
- JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment☆147Updated 5 months ago
- ☆635Updated 2 months ago
- A high quality and fast TTS repository☆461Updated 3 weeks ago