Pytorch Implementation of the paper "M3-TTS: Multi-modal DiT Alignment & Mel-latent for Zero-shot High-fidelity Speech Synthesis"
☆117Dec 18, 2025Updated 2 months ago
Alternatives and similar repositories for M3-TTS
Users that are interested in M3-TTS are comparing it to the libraries listed below
Sorting:
- A modern and efficient travel planning companion.☆52Dec 21, 2025Updated 2 months ago
- cf worker reverse proxy☆43Nov 30, 2025Updated 3 months ago
- 基于 Rust 的隐私优先数据聚合平台,支持网络搜索、RSS 聚合、天气,股票,热榜等多模态搜索能力,私人部署的云端数据获取中心。☆71Feb 25, 2026Updated last week
- An Autonomous AI SRE Agent for Kubernetes, built with Java Spring Boot & LangChain4j. Implements OODA loop for self-healing.☆63Dec 29, 2025Updated 2 months ago
- This repository contains experimental reports and training results for my research☆112Feb 5, 2026Updated 3 weeks ago
- Luagin is a plugin based on the bukkit API and LuaJIT. It allows developers to highly customize the server through Lua scripts in an extr…☆38Nov 10, 2025Updated 3 months ago
- ☆44Dec 24, 2025Updated 2 months ago
- 简小派求职方法论,正确找工作的方法☆130Nov 30, 2025Updated 3 months ago
- Ultra-low bitrate speech codec (0.27-1 kbps) with cross-modal alignment and real-time capabilities☆216Aug 27, 2025Updated 6 months ago
- 🧩 An open-source multi-agent framework for intelligent health management, powered by the Linkage ecosystem.☆99Jan 29, 2026Updated last month
- devSphere-chat 是一个基于 Spring Boot 构建的高性能实时聊天系统,支持私聊、群聊、消息持久化、离线消息推送等功能。系统采用 Netty 实现 WebSocket 通信,通过 Redis Stream 保证消息可靠传输,并集成完善的用户认证和权限管理…☆125Feb 23, 2026Updated last week
- An out-of-the-box local Web UI for DeepSeek-OCR. Built with FastAPI + Vue.js, it supports PDF/Image uploads, progress tracking, and resu…☆80Dec 6, 2025Updated 2 months ago
- 解构rocketmq☆39Dec 2, 2025Updated 3 months ago
- ☆24Feb 8, 2026Updated 3 weeks ago
- 整合了双端聊天记录导出功能,用于分析QQ和微信双端群聊记录并生成年度热词可视化报告的工具☆66Jan 4, 2026Updated 2 months ago
- ai-smart-draw☆160Dec 4, 2025Updated 3 months ago
- Langgraph V1 入门+进阶教程☆82Jan 4, 2026Updated 2 months ago
- Put some Christmas vibes to GitHub profile.☆54Dec 26, 2025Updated 2 months ago
- 📱 Practice algorithms anytime, anywhere | Native Android IDE for Competitive Programming | C++/Java/Python support | ✨ Syntax highlighti…☆112Feb 9, 2026Updated 3 weeks ago
- Desktop Pixel Pet(桌面像素宠物)是一个轻量、可扩展的桌面陪伴应用:在你的电脑桌面上展示可爱的像素宠物,让它在屏幕角落里待机、走动、互动,陪你工作与学习。项目内置宠物商城与解锁机制,支持使用运行时间作为货币购买/激活宠物与粮食,并提供本地数据导入/导出能力,…☆74Feb 3, 2026Updated last month
- An exciting fruit-slicing game using camera gesture controls, built with Three.js and MediaPipe.☆67Jan 13, 2026Updated last month
- [ICLR'26] Scaling Up, Speeding Up: A Benchmark of Speculative Decoding for Efficient LLM Test-Time Scaling☆38Jan 29, 2026Updated last month
- 小可の聚集地,由 Next.js、TypeScript、MDX 和 TailwindCSS 构建。My blog built with Next.js, TypeScript, MDX, and TailwindCSS.☆162Jan 26, 2026Updated last month
- NebulaKit is a Metal-based iOS 3D scene and terrain rendering engine designed for building interactive 3D browsers, terrain visualization…☆217Dec 24, 2025Updated 2 months ago
- rule☆47Nov 28, 2025Updated 3 months ago
- 心理健康倾诉管理系统☆57Dec 15, 2025Updated 2 months ago
- Offical implementation of "Visual Instruction Pretraining for Domain-Specific Foundation Models"☆160Nov 12, 2025Updated 3 months ago
- 🚀 一键监听☁️ CZ 推特,⚡️秒级提取🪙 代币+🔗链上地址,🤖自动计算滑点并执行 DEX 抢跑。📦开源轻量,接入 GitHub Actions 即可 24 h ⏰无人值守,⚠️仅用于技术学习,非投资建议。🙏☆66Oct 10, 2025Updated 4 months ago
- ☆31Feb 26, 2026Updated last week
- A fast, local-first web GUI for exploring large CSV/Parquet/JSONL files. Powered by VisiData’s engine. Opens millions of rows instantly. …☆58Dec 1, 2025Updated 3 months ago
- ☆34Nov 23, 2025Updated 3 months ago
- 最好的headless ui组件 库教程代码 (The code for the best headless component library series tutorial)☆185Feb 13, 2026Updated 2 weeks ago
- A versatile WireGuard client for desktop, highly configurable, based on wg-easy.☆147Nov 29, 2025Updated 3 months ago
- 🧊 A High-Perf Quantitative Trading Framework for Crypto☆151Updated this week
- 适用于Android的库(Library),封装了增量更新算法(Bsdiff与HDiffPatch),以方便开发者增量更新应用☆34Feb 14, 2026Updated 2 weeks ago
- 2025最新:聚焦 C++ 的自动驾驶资源库,含感知、规划等核心技术讲解,覆盖多岗位面试题,从技术学习到求职全支持。☆245Dec 12, 2025Updated 2 months ago
- ☆55Dec 31, 2025Updated 2 months ago
- ☆57Dec 18, 2025Updated 2 months ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆44Oct 28, 2024Updated last year