Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b MoE model based on llama3.
☆27Jul 1, 2024Updated last year
Alternatives and similar repositories for llama3-8x8b-MoE
Users that are interested in llama3-8x8b-MoE are comparing it to the libraries listed below
Sorting:
- pre-training llama3 using chinese☆13May 1, 2024Updated last year
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Feb 10, 2026Updated 3 weeks ago
- ☆10Jul 20, 2024Updated last year
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- Responsible Robotic Manipulation☆16Aug 31, 2025Updated 6 months ago
- flux1非官方的量化模型(flux1 unofficial quantize model)☆12Aug 14, 2024Updated last year
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆14Aug 25, 2024Updated last year
- smart chinese LLm☆19Jan 31, 2024Updated 2 years ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Apr 24, 2024Updated last year
- An open-source toolkit helping developers build natural language database query solutions☆27May 5, 2025Updated 10 months ago
- ☆11Updated this week
- Llama.cui is a small llama.cpp-based chat application for Node.js☆20Jul 10, 2025Updated 8 months ago
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆19Jun 9, 2024Updated last year
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Feb 11, 2025Updated last year
- 浏览器AI插件,一键把网页文章内容生成为思维导图,很方便。☆26Jul 4, 2024Updated last year
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆27Jul 26, 2025Updated 7 months ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- ☆26Feb 28, 2026Updated last week
- Text-to-Speech (TTS) engine for the Armenian language☆12Sep 29, 2024Updated last year
- A simple WeChat Official Account layout tool based on Dify☆17Jun 27, 2025Updated 8 months ago
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 9 months ago
- A cross-platform desktop client supporting multiple LLM providers, integrated with AI search, developer tools, and third-party AI tool ac…☆54Aug 6, 2025Updated 7 months ago
- Golang web client for Ollama, fast and easy to use.☆32Jul 18, 2025Updated 7 months ago
- A desktop GUI for Flux 1.1 Pro built using DelphiFMX For Python☆11Oct 5, 2024Updated last year
- Workflow automation, but you just describe what you want and it happens.☆27Nov 22, 2025Updated 3 months ago
- Write the database metadata into the dify knowledge☆12Dec 30, 2025Updated 2 months ago
- ☆28Dec 4, 2025Updated 3 months ago
- LCM Drawing app☆12Dec 1, 2023Updated 2 years ago
- Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)☆30Feb 21, 2026Updated 2 weeks ago
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆28Feb 13, 2026Updated 3 weeks ago
- ☆11Aug 29, 2025Updated 6 months ago
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆37Aug 5, 2024Updated last year
- Dify插件市场应用,开箱即用,用于快速对接微信公众号☆61Oct 5, 2025Updated 5 months ago
- A framework for evaluating function calls made by LLMs☆40Jul 23, 2024Updated last year
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- ☆28Jun 27, 2025Updated 8 months ago
- ☆11Oct 25, 2024Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Updated this week
- An SSH plugin for Dify☆13Jan 16, 2026Updated last month