Tencent-Hunyuan / Hunyuan-7BLinks

Tencent Hunyuan 7B (short as Hunyuan-7B) is one of the large language dense models of Tencent Hunyuan

☆66

Alternatives and similar repositories for Hunyuan-7B

Users that are interested in Hunyuan-7B are comparing it to the libraries listed below

Sorting:

xverse-ai / XVERSE-MoE-A36B
XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.
☆38Updated last year
Tencent / POINTS-Reader
☆187Updated 2 months ago
Tencent-Hunyuan / Hunyuan-0.5B
☆45Updated 4 months ago
woct0rdho / transformers-qwen3-moe-fused
Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth
☆212Updated last month
zai-org / GLM-Edge
GLM Series Edge Models
☆153Updated 5 months ago
xorbitsai / xllamacpp
xllamacpp - a Python wrapper of llama.cpp
☆66Updated this week
AlexBodner / How_Much_VRAM
☆101Updated last year
FreedomIntelligence / FastLLM
Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];
☆41Updated last year
hellangleZ / Qwen3_autothink_adapter
Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…
☆22Updated 6 months ago
Zyphra / transformers_zamba2
☆49Updated 10 months ago
shootime2021 / APUS-xDAN-4.0-moe
Its an open source LLM based on MOE Structure.
☆58Updated last year
LB-Young / Bambo
Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…
☆34Updated 9 months ago
NVIDIA / workbench-llamafactory
This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.
☆68Updated last year
Tongyi-Zhiwen / Qwen-Doc
☆300Updated 6 months ago
OpenBMB / MobileCPM
A Toolkit for Running On-device Large Language Models (LLMs) in APP
☆79Updated last year
SkyworkAI / vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
☆16Updated last year
RUC-NLPIR / HiRA
The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search
☆63Updated 5 months ago
krystalan / DRT
Deep Reasoning Translation (DRT) Project
☆239Updated 3 months ago
QwenLM / WorldPM
☆91Updated 6 months ago
thunlp / Delta-CoMe
Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024
☆59Updated last year
modelscope / modelscope-studio
A third-party component library based on Gradio. Integrates Ant Design, Ant Design X, Monaco Editor and more advanced components to help…
☆132Updated 2 weeks ago
xverse-ai / XVERSE-MoE-A4.2B
XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.
☆39Updated last year
PKU-YuanGroup / LLaVA-o1
☆56Updated last year
oumi-ai / halloumi-demo
Try out HallOumi, a state-of-the-art claim verification model in a simple UI!
☆41Updated 8 months ago
nyunAI / PruneGPT
☆51Updated last year
OpenSQZ / MiniCPM-V-CookBook
Cook up amazing multimodal AI applications effortlessly with MiniCPM-o
☆225Updated 3 weeks ago
Tencent / AngelSlim
Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.
☆212Updated last week
gabrielolympie / moe-pruner
A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size
☆79Updated 3 months ago
voyage-ai / voyage-multimodal-3
☆19Updated last year
Muhtasham / pod-helper
🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware
☆77Updated last year