Tencent-Hunyuan/Hunyuan-MT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Tencent-Hunyuan/Hunyuan-MT)

Tencent-Hunyuan / Hunyuan-MT

☆713

Alternatives and similar repositories for Hunyuan-MT

Users that are interested in Hunyuan-MT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tencent-Hunyuan / Hy-MT
View on GitHub
☆797Jun 1, 2026Updated last month
ByteDance-Seed / Seed-X-7B
View on GitHub
☆170Aug 18, 2025Updated 11 months ago
facebookresearch / omnilingual-asr
View on GitHub
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
☆2,863Dec 30, 2025Updated 6 months ago
Tencent-Hunyuan / HunyuanImage-2.1
View on GitHub
HunyuanImage-2.1: An Efficient Diffusion Model for High-Resolution (2K) Text-to-Image Generation
☆674Oct 14, 2025Updated 9 months ago
stepfun-ai / Step-Audio2
View on GitHub
Step-Audio 2 is an end-to-end multi-modal large language model designed for industry-strength audio understanding and speech conversation…
☆1,488Mar 16, 2026Updated 4 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
Tencent-Hunyuan / HunyuanOCR
View on GitHub
HunyuanOCR-1.5: Making Lightweight OCR VLMs Faster and Better
☆1,895Updated this week
Tencent / AngelSlim
View on GitHub
Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
☆1,491Updated this week
inclusionAI / Ming-UniAudio
View on GitHub
Ming-UniAudio: Speech LLM for Joint Understanding, Generation and Editing with Unified Representation
☆451Nov 27, 2025Updated 8 months ago
QwenLM / Qwen3-Omni
View on GitHub
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…
☆3,917Apr 23, 2026Updated 3 months ago
Tencent / POINTS-Reader
View on GitHub
☆197Dec 7, 2025Updated 7 months ago
krystalan / DRT
View on GitHub
Deep Reasoning Translation (DRT) Project
☆242Sep 1, 2025Updated 10 months ago
zai-org / GLM-TTS
View on GitHub
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning
☆1,046Apr 10, 2026Updated 3 months ago
QwenLM / Qwen2-Audio
View on GitHub
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
☆2,097Apr 21, 2025Updated last year
boson-ai / higgs-audio
View on GitHub
Text-audio foundation model from Boson AI
☆8,304Jun 5, 2026Updated last month
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,559Updated this week
QwenAudio / Fun-ASR
View on GitHub
Open-source LLM-based ASR model family for Chinese, dialect, accent, and multilingual speech, with FunASR, vLLM, streaming, and llama.cpp…
☆1,445Updated this week
QwenAudio / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,954Updated this week
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,464May 25, 2026Updated 2 months ago
EIT-NLP / LLaSO
View on GitHub
☆116Oct 21, 2025Updated 9 months ago
QwenLM / Qwen3-ASR-Toolkit
View on GitHub
Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…
☆984Feb 5, 2026Updated 5 months ago
XiaomiMiMo / MiMo-Audio
View on GitHub
MiMo-Audio: Audio Language Models are Few-Shot Learners
☆1,070Jun 17, 2026Updated last month
TencentARC / AudioStory
View on GitHub
AudioStory: Generating Long-Form Narrative Audio with Large Language Models
☆301Sep 21, 2025Updated 10 months ago
zai-org / GLM-V
View on GitHub
GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
☆2,362Jul 21, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
bytedance / USO
View on GitHub
[CVPR 2026] 🔥🔥 Official Repo of USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning
☆1,229Sep 12, 2025Updated 10 months ago
DataoceanAI / Dolphin
View on GitHub
Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
☆776Jun 11, 2026Updated last month
alibaba / Logics-Parsing
View on GitHub
☆1,396May 13, 2026Updated 2 months ago
IEIT-AGI / Droplet3D
View on GitHub
☆43Sep 1, 2025Updated 10 months ago
zai-org / CogView4
View on GitHub
CogView4, CogView3-Plus and CogView3(ECCV 2024)
☆1,101Mar 29, 2025Updated last year
QwenAudio / FunCineForge
View on GitHub
☆444Mar 25, 2026Updated 4 months ago
Tencent-Hunyuan / Hunyuan-0.5B
View on GitHub
☆54Aug 5, 2025Updated 11 months ago
playht / PlayDiffusion
View on GitHub
☆539Jun 11, 2026Updated last month
QwenLM / Qwen-Image
View on GitHub
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
☆8,182Feb 10, 2026Updated 5 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Tencent-Hunyuan / Hunyuan-A13B
View on GitHub
Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.
☆817Jul 8, 2025Updated last year
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,023Dec 2, 2025Updated 7 months ago
Tongyi-Zhiwen / Qwen-Doc
View on GitHub
☆549May 25, 2026Updated 2 months ago
FireRedTeam / FireRedTTS2
View on GitHub
Long-form streaming TTS system for multi-speaker dialogue generation
☆1,417Oct 26, 2025Updated 9 months ago
Alibaba-NLP / DeepResearch
View on GitHub
Tongyi Deep Research, the Leading Open-source Deep Research Agent
☆19,752Feb 27, 2026Updated 5 months ago
FireRedTeam / FireRedASR
View on GitHub
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…
☆1,947Feb 25, 2026Updated 5 months ago
Tencent-Hunyuan / Hy-MT2
View on GitHub
☆509Jun 30, 2026Updated 3 weeks ago