Tencent-Hunyuan / Hunyuan-A13BLinks
Tencent Hunyuan A13B (short as Hunyuan-A13B), an innovative and open-source LLM built on a fine-grained MoE architecture.
☆719Updated 3 weeks ago
Alternatives and similar repositories for Hunyuan-A13B
Users that are interested in Hunyuan-A13B are comparing it to the libraries listed below
Sorting:
- open-source coding LLM for software engineering tasks☆886Updated last month
- GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai☆1,541Updated this week
- ☆207Updated this week
- MiniMax-M1, the world's first open-weight, large-scale hybrid-attention reasoning model.☆2,774Updated 3 weeks ago
- F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.☆401Updated last month
- Self-Adapting Language Models☆733Updated last month
- ☆361Updated last week
- ☆510Updated last month
- Seed-Coder is a family of lightweight open-source code LLMs comprising base, instruct and reasoning models, developed by ByteDance Seed.☆539Updated 2 months ago
- Kyutai with an "eye"☆212Updated 4 months ago
- A Tree Search Library with Flexible API for LLM Inference-Time Scaling☆423Updated this week
- Make text LLMs listen and speak☆739Updated this week
- 🌐 WebThinker: Empowering Large Reasoning Models with Deep Research Capability☆1,199Updated this week
- ☆155Updated 3 months ago
- ☆628Updated this week
- II-Researcher: a new open-source framework designed to aid building search / research agents☆445Updated last month
- Inference, Fine Tuning and many more recipes with Gemma family of models☆262Updated 2 weeks ago
- OpenAlpha_Evolve is an open-source Python framework inspired by the groundbreaking research on autonomous coding agents like DeepMind's A…☆861Updated 2 months ago
- Official python implementation of the UTCP☆364Updated this week
- AlphaGo Moment for Model Architecture Discovery.☆794Updated last week
- An open-source application for building, observing, and collaborating with teams of AI agents.☆355Updated last week
- Inference service for Qwen2.5-VL-7b model☆191Updated 4 months ago
- Hypernetworks that adapt LLMs for specific benchmark tasks using only textual task description as the input☆836Updated last month
- OmniGen2: Exploration to Advanced Multimodal Generation.☆3,540Updated last week
- ☆144Updated 3 weeks ago
- Training teachers with reinforcement learning able to make LLMs learn how to reason for test time scaling.☆324Updated last month
- A powerful Python library for creating and managing isolated desktop environments using Docker containers.☆312Updated last week
- Sparse Inferencing for transformer based LLMs☆196Updated this week
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆269Updated 2 months ago
- 🚀 MassGen: An Open-source Multi-Agent Scaling System Inspired by Grok Heavy and Gemini Deep Think. Join the discord channel: https://dis…☆167Updated this week