Tencent-Hunyuan / Hunyuan-7BLinks
Tencent Hunyuan 7B (short as Hunyuan-7B) is one of the large language dense models of Tencent Hunyuan
☆66Updated 3 months ago
Alternatives and similar repositories for Hunyuan-7B
Users that are interested in Hunyuan-7B are comparing it to the libraries listed below
Sorting:
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆38Updated last year
- ☆187Updated 2 months ago
- ☆45Updated 4 months ago
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆212Updated last month
- GLM Series Edge Models☆153Updated 5 months ago
- xllamacpp - a Python wrapper of llama.cpp☆66Updated this week
- ☆101Updated last year
- Fast LLM Training CodeBase With dynamic strategy choosing [Deepspeed+Megatron+FlashAttention+CudaFusionKernel+Compiler];☆41Updated last year
- Implemented a script that automatically adjusts Qwen3's inference and non-inference capabilities, based on an OpenAI-like API. The infere…☆22Updated 6 months ago
- ☆49Updated 10 months ago
- Its an open source LLM based on MOE Structure.☆58Updated last year
- Bambo is a new proxy framework. Compared with mainstream frameworks, it is more lightweight and flexible and can handle various load task…☆34Updated 9 months ago
- This is an NVIDIA AI Workbench example project that demonstrates an end-to-end model development workflow using Llamafactory.☆68Updated last year
- ☆300Updated 6 months ago
- A Toolkit for Running On-device Large Language Models (LLMs) in APP☆79Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Updated last year
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 5 months ago
- Deep Reasoning Translation (DRT) Project☆239Updated 3 months ago
- ☆91Updated 6 months ago
- Delta-CoMe can achieve near loss-less 1-bit compressin which has been accepted by NeurIPS 2024☆59Updated last year
- A third-party component library based on Gradio. Integrates Ant Design, Ant Design X, Monaco Editor and more advanced components to help…☆132Updated 2 weeks ago
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated last year
- ☆56Updated last year
- Try out HallOumi, a state-of-the-art claim verification model in a simple UI!☆41Updated 8 months ago
- ☆51Updated last year
- Cook up amazing multimodal AI applications effortlessly with MiniCPM-o☆225Updated 3 weeks ago
- Model compression toolkit engineered for enhanced usability, comprehensiveness, and efficiency.☆212Updated last week
- A repository aimed at pruning DeepSeek V3, R1 and R1-zero to a usable size☆79Updated 3 months ago
- ☆19Updated last year
- 🎧 Pod-Helper: Real-time audio transcription and repair on consumer hardware☆77Updated last year