Tencent-Hunyuan/Hy-MT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Tencent-Hunyuan/Hy-MT)

Tencent-Hunyuan / Hy-MT

☆797

Alternatives and similar repositories for Hy-MT

Users that are interested in Hy-MT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Tencent-Hunyuan / Hunyuan-MT
View on GitHub
☆713Dec 30, 2025Updated 6 months ago
Tencent-Hunyuan / Hy-MT2
View on GitHub
☆509Jun 30, 2026Updated last month
LemonSky1995 / DreamStyle
View on GitHub
DreamStyle: A Unified Framework for Video Stylization
☆124Jan 7, 2026Updated 6 months ago
Tencent-Hunyuan / HunyuanOCR
View on GitHub
HunyuanOCR-1.5: Making Lightweight OCR VLMs Faster and Better
☆1,895Updated this week
guilinhu / proactive_hearing_assistant
View on GitHub
Code for the paper Proactive Hearing Assistants that Isolate Egocentric Conversations
☆46Nov 19, 2025Updated 8 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
snowflakewang / CustomX
View on GitHub
[ECCV 2026] CustomX: Unified Character, Action, and Scene Customization in Video World Models
☆96Jun 25, 2026Updated last month
QwenLM / Qwen3-ASR
View on GitHub
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music…
☆3,246Jun 26, 2026Updated last month
XiaokunSun / MorphAny3D
View on GitHub
[CVPR 2026] Official repo of "MorphAny3D: Unleashing the Power of Structured Latent in 3D Morphing“
☆110Apr 13, 2026Updated 3 months ago
Tencent-Hunyuan / HY-Motion-1.0
View on GitHub
HY-Motion model for 3D human motion or 3D character animation generation.
☆2,477Jul 18, 2026Updated last week
QwenAudio / Fun-Audio-Chat
View on GitHub
Fun-Audio-Chat is a Large Audio Language Model built for natural, low-latency voice interactions.
☆985Feb 27, 2026Updated 5 months ago
Tencent / AngelSlim
View on GitHub
Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
☆1,491Updated this week
ATH-MaaS / Ovis-Image
View on GitHub
Ovis-Image is a 7B text-to-image model specifically optimized for high-quality text rendering, designed to operate efficiently under stri…
☆321May 15, 2026Updated 2 months ago
zai-org / GLM-TTS
View on GitHub
GLM-TTS: Controllable & Emotion-Expressive Zero-shot TTS with Multi-Reward Reinforcement Learning
☆1,046Apr 10, 2026Updated 3 months ago
ysharma3501 / NovaSR
View on GitHub
A lightning fast audio upsampler.
☆775Feb 26, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
SOTAMak1r / VINO-code
View on GitHub
A Unified Visual Generator with Interleaved OmniModal Context
☆232Mar 5, 2026Updated 4 months ago
zai-org / GLM-ASR
View on GitHub
GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters
☆835Mar 6, 2026Updated 4 months ago
Francis-Rings / FlashPortrait
View on GitHub
[CVPR2026]We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length vide…
☆480Feb 21, 2026Updated 5 months ago
QwenAudio / Fun-ASR
View on GitHub
Open-source LLM-based ASR model family for Chinese, dialect, accent, and multilingual speech, with FunASR, vLLM, streaming, and llama.cpp…
☆1,445Updated this week
yichuanH / GaMO_official
View on GitHub
☆73Jan 12, 2026Updated 6 months ago
facebookresearch / omnilingual-asr
View on GitHub
Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages
☆2,863Dec 30, 2025Updated 6 months ago
ysy31415 / EffectMaker
View on GitHub
Code repo for EffectMaker: Unifying Reasoning and Generation for Customized Visual Effect Creation
☆42Mar 6, 2026Updated 4 months ago
k2-fsa / OmniVoice
View on GitHub
High-Quality Voice Cloning TTS for 600+ Languages
☆8,611Updated this week
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,542Updated this week
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ssj9596 / One-to-All-Animation
View on GitHub
[CVPR 2026 Poster] One-to-All Animation: Alignment-Free Character Animation and Image Pose Transfer
☆490Apr 19, 2026Updated 3 months ago
kszpxxzmc / ViSAudio
View on GitHub
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
☆117Dec 11, 2025Updated 7 months ago
QwenAudio / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,954Updated this week
ysharma3501 / LavaSR
View on GitHub
🌋LavaSR: Fast Speech restoration and enhancement
☆566Jun 19, 2026Updated last month
QwenLM / Qwen3-TTS
View on GitHub
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streamin…
☆12,672Mar 17, 2026Updated 4 months ago
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,464May 25, 2026Updated 2 months ago
Tongyi-MAI / Z-Image
View on GitHub
☆11,797Feb 9, 2026Updated 5 months ago
microsoft / VibeVoice
View on GitHub
Open-Source Frontier Voice AI
☆51,268Updated this week
ByteDance-Seed / Seed-X-7B
View on GitHub
☆170Aug 18, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
inclusionAI / TwinFlow
View on GitHub
[ICLR 2026] Taming large-scale few-step training with self-adversarial flows! 👏🏻
☆537Feb 24, 2026Updated 5 months ago
jamichss / Stream-DiffVSR
View on GitHub
The official repository of paper "Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion"
☆310Jan 12, 2026Updated 6 months ago
JavisVerse / JavisGPT
View on GitHub
[NeurIPS'25 Spotlight] Official implementation of "JavisGPT: A Unified Multi-modal LLM for Sounding-Video Comprehension and Generation"
☆75Feb 26, 2026Updated 5 months ago
ekwek1 / soprano
View on GitHub
Soprano: Instant, Ultra-Realistic Text-to-Speech
☆1,382Jan 15, 2026Updated 6 months ago
FudanCVL / PSDesigner
View on GitHub
[CVPR 2026] PSDesigner: Automated Graphic Design with a Human-Like Creative Workflow
☆149Mar 28, 2026Updated 4 months ago
PKU-YuanGroup / UltraShape-1.0
View on GitHub
High-Fidelity 3D Shape Generation via Scalable Geometric Refinement
☆811Jan 6, 2026Updated 6 months ago
QwenLM / Qwen3-Omni
View on GitHub
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, im…
☆3,917Apr 23, 2026Updated 3 months ago