microsoft / VibeVoiceLinks
Open-Source Frontier Voice AI
☆18,756Updated last week
Alternatives and similar repositories for VibeVoice
Users that are interested in VibeVoice are comparing it to the libraries listed below
Sorting:
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆3,098Updated last week
- On-device TTS model by Neuphonic☆4,296Updated this week
- SoTA open-source TTS☆18,069Updated last week
- Send a phone call from AI agent, in an API call. Or, directly call the bot from the configured phone number!☆6,042Updated 2 months ago
- ☆6,053Updated 3 months ago
- Bytebot is a self-hosted AI desktop agent that automates computer tasks through natural language commands, operating within a containeriz…☆10,029Updated 3 months ago
- Simultaneous speech-to-text model☆9,311Updated last week
- Kyutai's Speech-To-Text and Text-To-Speech models based on the Delayed Streams Modeling framework.☆2,671Updated last month
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆3,954Updated last week
- LLM agents built for control. Designed for real-world use. Deployed in minutes.☆16,752Updated last week
- Towards Human-Sounding Speech☆5,828Updated 3 weeks ago
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆19,438Updated last month
- ☆7,867Updated this week
- LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.☆9,674Updated last week
- The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trai…☆2,506Updated this week
- Wan: Open and Advanced Large-Scale Video Generative Models☆13,045Updated last week
- Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation☆4,417Updated 6 months ago
- A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speec…☆3,122Updated this week
- A free, open source, and extensible speech-to-text application that works completely offline.☆8,844Updated this week
- State-of-the-art TTS model under 25MB 😻☆9,405Updated 4 months ago
- Lightning-Fast, On-Device TTS — running natively via ONNX.☆1,891Updated last week
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆2,504Updated last week
- Agent S: an open agentic framework that uses computers like a human☆9,078Updated last week
- A simple yet powerful agent framework that delivers with open-source models☆3,991Updated last week
- An Open Source implementation of Notebook LM with more flexibility and features☆16,320Updated last week
- https://hf.co/hexgrad/Kokoro-82M☆5,157Updated 4 months ago
- Open-source platform to build and deploy AI agent workflows.☆24,025Updated this week
- "DeepCode: Open Agentic Coding (Paper2Code & Text2Web & Text2Backend)"☆12,950Updated 2 weeks ago
- An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System☆16,967Updated 3 weeks ago
- ☆6,806Updated this week