A Unified Framework for Expressive Speech Synthesis with Voice Cloning
☆416Dec 3, 2025Updated 5 months ago
Alternatives and similar repositories for Marco-Voice
Users that are interested in Marco-Voice are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆47Sep 2, 2025Updated 8 months ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 10 months ago
- ☆101Jan 19, 2026Updated 4 months ago
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 6 months ago
- Align Anything: Training All-modality Model with Feedback☆4,650Nov 27, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆81Aug 11, 2025Updated 9 months ago
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆248Feb 25, 2026Updated 2 months ago
- Descript Audio Codec - VAE Variant (.dac-vae): High-Fidelity Audio Compression with Variational Autoencoder☆36Aug 30, 2025Updated 8 months ago
- This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …☆57Aug 9, 2025Updated 9 months ago
- Voicebox: Text-Guided Multilingual Universal Speech Generation at Scale☆28Aug 4, 2023Updated 2 years ago
- 数字底座是一款面向大型政府、企业数字化转型,基于身份认证、组织架构、岗位职务、应用系统、资源角色、数据目录、安全控制等功能构建的统一且安全的管理支撑平台。数字底座基于三员管理模式,具备微服务、多租户、容器化和国产化,支持用户利用代码生成器快速构建自己的业务应用,同时可关联诸…☆2,591May 12, 2026Updated last week
- poorman's ar-dit tts☆45Dec 31, 2025Updated 4 months ago
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆53Sep 20, 2025Updated 8 months ago
- AppPlatform 是一个前沿的大模型应用工程,旨在通过集成的声明式编程和低代码配置工具,简化和优化大模型的训练与推理应用的开发过程。本工程为软件工程师和产品经理提供一个强大的、可扩展的环境,以支持从概念到部署的全流程 AI 应用开发。☆1,431May 7, 2026Updated 2 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- FIT: 企业级AI开发框架,提供多语言函数引擎(FIT)、流式编排引擎(WaterFlow)及Java生态的LangChain替代方案(FEL)。原生/Spring双模运行,支持插件热插拔与智能聚散部署,无缝统一大模型与业务系统。☆2,106Mar 13, 2026Updated 2 months ago
- The next generation deep reinforcement learning tookit☆3,464Jun 16, 2023Updated 2 years ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆112Apr 1, 2024Updated 2 years ago
- [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"☆373Sep 3, 2024Updated last year
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆190Feb 28, 2026Updated 2 months ago
- ☆70Sep 3, 2024Updated last year
- Variable Bitrate Residual Vector Quantization for Audio Coding☆52May 1, 2025Updated last year
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆108Jan 17, 2025Updated last year
- ☆25Jan 24, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 💰唯一正版💰 minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy minerproxy 矿池抽水 矿池代理 矿池中转 矿池抽…☆3,871May 8, 2026Updated last week
- SDG is a specialized framework designed to generate high-quality structured tabular data.☆2,419May 11, 2026Updated last week
- ☆40Jul 15, 2025Updated 10 months ago
- A Doctor for your data☆3,486Jan 14, 2025Updated last year
- 2024 Latest laughter detection & segmentaion model. Paper: "Robust Laughter Segmentation with Automatic Diverse Data Synthesis", Interspe…☆65Sep 1, 2024Updated last year
- The first open autoregressive foundational video AI model.☆2,892Oct 14, 2024Updated last year
- ☆36Sep 6, 2025Updated 8 months ago
- ☆71Jul 13, 2023Updated 2 years ago
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆353Jul 21, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- High quality text-to-speech based on StyleTTS 2.☆77Apr 6, 2026Updated last month
- The baselines of ARC-Challenge-Interspeech2026☆59Dec 1, 2025Updated 5 months ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- VoiceLDM: Text-to-Speech with Environmental Context☆192Aug 9, 2024Updated last year
- ☆341Jul 4, 2025Updated 10 months ago
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆49Feb 17, 2026Updated 3 months ago
- speaker-disentangled speech linguistic content quantizer☆25Mar 19, 2025Updated last year