Tencent-Hunyuan / Hunyuan-0.5BLinks
☆33Updated this week
Alternatives and similar repositories for Hunyuan-0.5B
Users that are interested in Hunyuan-0.5B are comparing it to the libraries listed below
Sorting:
- ☆20Updated last year
- Pixel Parsing. A reproduction of OCR-free end-to-end document understanding models with open data☆21Updated last year
- ASTChunk is a Python toolkit for code chunking using Abstract Syntax Trees (ASTs), designed to create structurally sound and meaningful c…☆54Updated last month
- XVERSE-MoE-A36B: A multilingual large language model developed by XVERSE Technology Inc.☆39Updated 10 months ago
- Tencent Hunyuan 7B (short as Hunyuan-7B) is one of the large language dense models of Tencent Hunyuan☆41Updated this week
- My personal implementation of the model from "Qwen-VL: A Frontier Large Vision-Language Model with Versatile Abilities", they haven't rel…☆12Updated last year
- Exploration of the multi modal fuyu-8b model of Adept. 🤓 🔍☆28Updated last year
- Fused Qwen3 MoE layer for faster training, compatible with HF Transformers, LoRA, 4-bit quant, Unsloth☆142Updated last week
- A REST API for vLLM, production ready☆24Updated last week
- ☆19Updated last month
- Rust bindings for CTranslate2☆14Updated 2 years ago
- ☆29Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆35Updated 2 years ago
- Multi-Layer Key-Value sharing experiments on Pythia models☆33Updated last year
- ☆13Updated last year
- AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…☆14Updated 2 weeks ago
- Simple Implementation of TinyGPTV in super simple Zeta lego blocks☆16Updated 8 months ago
- ☆48Updated 6 months ago
- Data preparation code for CrystalCoder 7B LLM☆45Updated last year
- imagetokenizer is a python package, helps you encoder visuals and generate visuals token ids from codebook, supports both image and video…☆35Updated last year
- XVERSE-MoE-A4.2B: A multilingual large language model developed by XVERSE Technology Inc.☆40Updated last year
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Updated last year
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Updated 8 months ago
- ☆16Updated last year
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 2 weeks ago
- [CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding☆25Updated 2 weeks ago
- Official implementations for paper: DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models☆15Updated last year
- ImageSlider custom component for gradio.☆42Updated last year
- PresentAgent: Multimodal Agent for Presentation Video Generation☆91Updated last week
- ☆11Updated 2 months ago