Tencent / Tencent-Hunyuan-Large

☆1,510

Alternatives and similar repositories for Tencent-Hunyuan-Large

Users that are interested in Tencent-Hunyuan-Large are comparing it to the libraries listed below

Sorting:

THUDM / CogView4
CogView4, CogView3-Plus and CogView3(ECCV 2024)
☆1,025Updated last month
MiniMax-AI / MiniMax-01
The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention
☆2,599Updated this week
MoonshotAI / Moonlight
Muon is Scalable for LLM Training
☆1,044Updated last month
MoonshotAI / MoBA
MoBA: Mixture of Block Attention for Long-Context LLMs
☆1,776Updated last month
baaivision / Emu3
Next-Token Prediction is All You Need
☆2,121Updated 2 months ago
QwenLM / Qwen2.5-Omni
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…
☆2,913Updated this week
VITA-MLLM / VITA
✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
☆2,273Updated last month
Tencent / HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
☆4,088Updated 4 months ago
PRIME-RL / PRIME
Scalable RL solution for advanced reasoning of language models
☆1,552Updated 2 months ago
MoonshotAI / Kimi-VL
Kimi-VL: Mixture-of-Experts Vision-Language Model for Multimodal Reasoning, Long-Context Understanding, and Strong Agent Capabilities
☆847Updated 3 weeks ago
hao-ai-lab / FastVideo
FastVideo is a unified framework for accelerated video generation.
☆1,407Updated this week
hiyouga / EasyR1
EasyR1: An Efficient, Scalable, Multi-Modality RL Training Framework based on veRL
☆2,355Updated this week
Open-Reasoner-Zero / Open-Reasoner-Zero
Official Repo for Open-Reasoner-Zero
☆1,916Updated last month
hkust-nlp / simpleRL-reason
Simple RL training for reasoning
☆3,560Updated last month
Deep-Agent / R1-V
Witness the aha moment of VLM with less than $3.
☆3,658Updated 2 months ago
QwenLM / Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
☆1,721Updated 3 weeks ago
ByteDance-Seed / Seed-Thinking-v1.5
☆746Updated 3 weeks ago
AIDC-AI / Ovis
A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.
☆913Updated last month
HumanMLLM / R1-Omni
☆875Updated last month
stepfun-ai / Step-Video-T2V
☆2,928Updated 2 months ago
NVlabs / Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
☆4,126Updated last week
SkyworkAI / Skywork-OR1
Unleashing the Power of Reinforcement Learning for Math and Code Reasoners
☆566Updated this week
XiaomiMiMo / MiMo
MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining
☆1,332Updated this week
agentica-project / rllm
Democratizing Reinforcement Learning for LLMs
☆3,236Updated this week
NUS-HPC-AI-Lab / VideoSys
VideoSys: An easy and efficient system for video generation
☆1,963Updated 2 months ago
FoundationVision / LlamaGen
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,750Updated 9 months ago
stepfun-ai / Step1X-Edit
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…
☆1,191Updated this week
Tencent / HunyuanCustom
HunyuanCustom: A Multimodal-Driven Architecture for Customized Video Generation
☆823Updated this week
SandAI-org / MAGI-1
MAGI-1: Autoregressive Video Generation at Scale
☆3,001Updated this week
Unakar / Logic-RL
Reproduce R1 Zero on Logic Puzzle
☆2,337Updated last month