cooper12121 / llama3-8x8b-MoEView external linksLinks
Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b MoE model based on llama3.
☆27Jul 1, 2024Updated last year
Alternatives and similar repositories for llama3-8x8b-MoE
Users that are interested in llama3-8x8b-MoE are comparing it to the libraries listed below
Sorting:
- pre-training llama3 using chinese☆13May 1, 2024Updated last year
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- ☆10Jul 20, 2024Updated last year
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Feb 10, 2026Updated last week
- Responsible Robotic Manipulation☆16Aug 31, 2025Updated 5 months ago
- flux1非官方的量化模型(flux1 unofficial quantize model)☆12Aug 14, 2024Updated last year
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆14Aug 25, 2024Updated last year
- smart chinese LLm☆19Jan 31, 2024Updated 2 years ago
- An open-source toolkit helping developers build natural language database query solutions☆27May 5, 2025Updated 9 months ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆17Apr 24, 2024Updated last year
- ☆11Updated this week
- Terminal Voice Assistant is a powerful and flexible tool designed to help users interact with their terminal using natural language comma…☆19Jun 9, 2024Updated last year
- Llama.cui is a small llama.cpp-based chat application for Node.js☆20Jul 10, 2025Updated 7 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆26Feb 11, 2025Updated last year
- 浏览器AI插件,一键把网页文章内容生成为思维导图,很方便。☆26Jul 4, 2024Updated last year
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆26Jul 26, 2025Updated 6 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆41Apr 4, 2025Updated 10 months ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Difyで作る生成AIアプリ完全入門☆17May 25, 2025Updated 8 months ago
- A simple WeChat Official Account layout tool based on Dify☆16Jun 27, 2025Updated 7 months ago
- Text-to-Speech (TTS) engine for the Armenian language☆12Sep 29, 2024Updated last year
- A cross-platform desktop client supporting multiple LLM providers, integrated with AI search, developer tools, and third-party AI tool ac…☆53Aug 6, 2025Updated 6 months ago
- Golang web client for Ollama, fast and easy to use.☆32Jul 18, 2025Updated 6 months ago
- LCM Drawing app☆12Dec 1, 2023Updated 2 years ago
- Write the database metadata into the dify knowledge☆12Dec 30, 2025Updated last month
- ☆28Dec 4, 2025Updated 2 months ago
- A full-stack AI-powered business intelligence tool for non-experts, featuring serverless backend processing and a secure Streamlit fronte…☆25Jan 6, 2026Updated last month
- ☆11Aug 29, 2025Updated 5 months ago
- Workflow automation, but you just describe what you want and it happens.☆26Nov 22, 2025Updated 2 months ago
- A desktop GUI for Flux 1.1 Pro built using DelphiFMX For Python☆11Oct 5, 2024Updated last year
- 100 Production-Ready Claude Code Skills - The most comprehensive collection of AI skills for sales, business automation, content creation…☆35Oct 22, 2025Updated 3 months ago
- Dify插件市场应用,开箱即用,用于快速对接微信公众号☆60Oct 5, 2025Updated 4 months ago
- (撰写ing..)本仓库偏教程性质,以「模型中文化」为一个典型的模型训练问题切入场景,指导读者上手学习LLM二次微调训练。☆36Aug 5, 2024Updated last year
- A framework for evaluating function calls made by LLMs☆40Jul 23, 2024Updated last year
- dify 知识库检索工具☆13Apr 3, 2025Updated 10 months ago
- ☆28Jun 27, 2025Updated 7 months ago
- LangReact 是一个配置化的 Planning Agent 应用开发工具,通过配置、插件,能快速为你的 GPT 应用提供 Planning 功能。☆12Apr 23, 2024Updated last year
- 今日头条搜索引擎以及新闻详情页爬虫(Selenium)☆15Mar 13, 2025Updated 11 months ago
- Kotlin library for Cortex.cpp a Local AI API Platform that is used to run and customize LLMs.☆10Apr 2, 2025Updated 10 months ago