WWWWxp / M3-TTSLinks
Pytorch Implementation of the paper "M3-TTS: Multi-modal DiT Alignment & Mel-latent for Zero-shot High-fidelity Speech Synthesis"
☆110Updated last month
Alternatives and similar repositories for M3-TTS
Users that are interested in M3-TTS are comparing it to the libraries listed below
Sorting:
- Ultra-low bitrate speech codec (0.27-1 kbps) with cross-modal alignment and real-time capabilities☆213Updated 5 months ago
- ☆68Updated 4 months ago
- [ICLR'26] Scaling Up, Speeding Up: A Benchmark of Speculative Decoding for Efficient LLM Test-Time Scaling☆37Updated last week
- Stream-Omni is a GPT-4o-like language-vision-speech chatbot that simultaneously supports interaction across various modality combinations…☆383Updated 7 months ago
- Official repo for 'Large Multimodal Models Evaluation: A Survey'☆100Updated last month
- 🧩 An open-source multi-agent framework for intelligent health management, powered by the Linkage ecosystem.☆92Updated last week
- This repository contains experimental reports and training results for my research☆103Updated last month
- Ond ESG Intelligence Platform is a cloud-native solution that ingests ESG data, processes it with Azure Data Factory & Databricks, and ap…☆136Updated 2 months ago
- Updating curated list of research advancements on item identification in generative recommender systems.☆50Updated last week
- An efficient and lightweight disk space analysis tool.☆99Updated last month
- YiShape-Math is a Java math library that provides NumPy-like functionalities including vector & matrix operations, data visualization, st…☆201Updated last month
- 🧊 A High-Perf Quantitative Trading Framework for Crypto☆134Updated last month
- AI Agent Development Platform - Supports multiple models (OpenAI/DeepSeek/Wenxin/Tongyi), knowledge base management, workflow automation,…☆525Updated last month
- A modern and efficient travel planning companion.☆50Updated last month
- 以帮助你快速找到 LLM 相关工作,尽快抓住 AI 红利为目标的【LLM 教程】☆114Updated last week
- 我是一个开放平台、开发者平台、单点登录平台、基础管理平台。包含:单点登录、Oauth2 登录、用户身份管理、应用申请、应用接入、主数据维护、主数据订阅、主数据广播、应用接口调用、接口管理管理和查看、系统配置等功能。☆85Updated this week
- ☆76Updated 3 months ago
- ☆75Updated 2 weeks ago
- 导航站☆50Updated 2 weeks ago
- Desktop Pixel Pet(桌面像素宠物)是一个轻量、可扩展的桌面陪伴应用:在你的电脑桌面上展示可爱的像素宠物,让它在屏幕角落里待机、走动、互动,陪你工作与学习。项目内置宠物商城与解锁机制,支持使用运行时间作为货币购买/激活宠物与粮食,并提供本地数据导入/导出能力,…☆64Updated last month
- [ICCV 2025] Official code for "Align Your Rhythm: Generating Highly Aligned Dance Poses with Gating-Enhanced Rhythm-Aware Feature Represe…☆92Updated 3 months ago
- ☆33Updated this week
- 🎓 机器学习与深度学习实战教程 | Comprehensive ML & DL Tutorial with Jupyter Notebooks | 包含线性回归、神经网络、CNN、RNN等完整教程☆318Updated this week
- cf worker reverse proxy☆41Updated 2 months ago
- 心理健康倾诉管理系统☆54Updated last month
- BizSpring Java开发定制,线上商城,购物商城,商城网站,在线购物,免费建站,mall☆75Updated 2 months ago
- 🧠 AI-powered Personalized Exam System: Integrating OpenPangu LLM, Knowledge Graph RAG, and BKT algorithm for adaptive question generatio…☆158Updated this week
- Langgraph V1 入门+进阶教程☆78Updated last month
- GPU-Health-eXpert☆68Updated 3 months ago
- An open-source Vibe platform similar to Claude Cowork / Manus / Clawdbot, with professional rich image document generation.☆74Updated this week