WWWWxp / M3-TTSLinks
Pytorch Implementation of the paper "M3-TTS: Multi-modal DiT Alignment & Mel-latent for Zero-shot High-fidelity Speech Synthesis"
☆113Updated last month
Alternatives and similar repositories for M3-TTS
Users that are interested in M3-TTS are comparing it to the libraries listed below
Sorting:
- Ultra-low bitrate speech codec (0.27-1 kbps) with cross-modal alignment and real-time capabilities☆214Updated 5 months ago
- ☆68Updated 4 months ago
- Stream-Omni is a GPT-4o-like language-vision-speech chatbot that simultaneously supports interaction across various modality combinations…☆386Updated 7 months ago
- [ICLR'26] Scaling Up, Speeding Up: A Benchmark of Speculative Decoding for Efficient LLM Test-Time Scaling☆37Updated last week
- 🧩 An open-source multi-agent framework for intelligent health management, powered by the Linkage ecosystem.☆94Updated last week
- Official repo for 'Large Multimodal Models Evaluation: A Survey'☆100Updated last month
- ☆76Updated 2 weeks ago
- This repository contains experimental reports and training results for my research☆106Updated this week
- An efficient and lightweight disk space analysis tool.☆99Updated last month
- Ond ESG Intelligence Platform is a cloud-native solution that ingests ESG data, processes it with Azure Data Factory & Databricks, and ap…☆142Updated 3 months ago
- 我是一个开放平台、开发者平台、单点登录平台、基础管理平台。包含:单点登录、Oauth2 登录、用户身份管理、应用申请、应用接入、主数据维护、主数据订阅、主数据广播、应用接口调用、接口管理管理和查看、系统配置等功能。☆85Updated last week
- [ACMMM 2025] "Set You Straight: Auto-Steering Denoising Trajectories to Sidestep Unwanted Concepts" (Official Implementation)☆81Updated 7 months ago
- [ICCV 2025] Official code for "Align Your Rhythm: Generating Highly Aligned Dance Poses with Gating-Enhanced Rhythm-Aware Feature Represe…☆92Updated 3 months ago
- ☆76Updated 3 months ago
- ☆33Updated this week
- YiShape-Math is a Java math library that provides NumPy-like functionalities including vector & matrix operations, data visualization, st…☆209Updated last month
- A modern and efficient travel planning companion.☆49Updated last month
- 🧊 A High-Perf Quantitative Trading Framework for Crypto☆142Updated last month
- ☆49Updated last month
- Updating curated list of research advancements on item identification in generative recommender systems.☆50Updated last week
- 以帮助你快速找到 LLM 相关工作,尽快抓住 AI 红利为目标的【LLM 教程】☆114Updated last week
- 导航站☆50Updated last week
- cf worker reverse proxy☆42Updated 2 months ago
- CHATS: Combining Human-Aligned Optimization and Test-Time Sampling for Text-to-Image Generation (ICML2025)☆116Updated 5 months ago
- [NeurIPS2025] AdaDetectGPT: Adaptive Detection of LLM-Generated Text with Statistical Guarantees☆62Updated 2 months ago
- LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement Fine-Tuning Framework☆153Updated this week
- 2025春国科大情感计算大作业☆51Updated 7 months ago
- GPU-Health-eXpert☆69Updated 3 months ago
- 🧠 AI-powered Personalized Exam System: Integrating OpenPangu LLM, Knowledge Graph RAG, and BKT algorithm for adaptive question generatio…☆174Updated this week
- A project which can organize self-driving blockchains☆70Updated 3 months ago