microsoft / VibeVoiceLinks
Open-Source Frontier Voice AI
☆22,955Updated this week
Alternatives and similar repositories for VibeVoice
Users that are interested in VibeVoice are comparing it to the libraries listed below
Sorting:
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆5,715Updated 2 weeks ago
- On-device TTS model by Neuphonic☆4,768Updated last week
- State-of-the-art TTS model under 25MB 😻☆9,590Updated this week
- The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trai…☆3,256Updated last month
- Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streamin…☆6,994Updated this week
- Simultaneous speech-to-text model☆9,644Updated 3 weeks ago
- SoTA open-source TTS☆22,346Updated this week
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆5,842Updated this week
- The world's first open-source multimodal creative assistant This is a substitute for Canva and Manus that prioritizes privacy and is usa…☆5,849Updated 2 months ago
- Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation☆4,477Updated 7 months ago
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆2,620Updated last month
- Wan: Open and Advanced Large-Scale Video Generative Models☆13,991Updated last month
- A TTS that fits in your CPU (and pocket)☆2,995Updated this week
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,088Updated 2 months ago
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,832Updated 2 weeks ago
- Lightning-Fast, On-Device, Multilingual TTS — running natively via ONNX.☆2,552Updated 2 weeks ago
- An Open Source implementation of Notebook LM with more flexibility and features☆19,137Updated this week
- https://hf.co/hexgrad/Kokoro-82M☆5,574Updated 6 months ago
- Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.☆3,485Updated last week
- ☆11,124Updated this week
- Kortix – build, manage and train AI Agents.☆19,325Updated this week
- An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System☆18,601Updated 2 months ago
- LLM agents built for control. Designed for real-world use. Deployed in minutes.☆17,703Updated this week
- Text-audio foundation model from Boson AI☆7,898Updated 3 weeks ago
- An open-source alternative to Claude Cowork, powered by opencode☆8,772Updated this week
- A research prototype of a human-centered web agent☆9,632Updated 2 weeks ago
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆4,750Updated last month
- AGENTS.md — a simple, open format for guiding coding agents☆16,991Updated last month
- Prompt Orchestration Markup Language☆4,846Updated 3 weeks ago
- SkyReels-V2: Infinite-length Film Generative model☆6,212Updated last week