chenweiphd / DeepSeek-MoE-ResourceMap
☆131Updated 2 months ago
Alternatives and similar repositories for DeepSeek-MoE-ResourceMap
Users that are interested in DeepSeek-MoE-ResourceMap are comparing it to the libraries listed below
Sorting:
- LLM101n: Let's build a Storyteller 中文版☆132Updated 9 months ago
- LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.☆767Updated this week
- DeepSeek 系列工作解读、扩展和复现。☆643Updated last month
- ☆310Updated 5 months ago
- 顾名思义:手搓的RAG☆122Updated last year
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆69Updated this week
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆251Updated this week
- LLM全栈优质资源汇总☆547Updated 5 months ago
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆55Updated 9 months ago
- 大模型/LLM推理和部署理论与实践☆259Updated 2 months ago
- Inference code for LLaMA models☆120Updated last year
- UltraScale Playbook 中文版☆37Updated 2 months ago
- LLM Inference benchmark☆417Updated 9 months ago
- Awesome LLM pre-training resources, including data, frameworks, and methods.☆144Updated 2 weeks ago
- ☆108Updated 6 months ago
- GLM Series Edge Models☆139Updated 2 months ago
- ☆226Updated last year
- This is a user guide for the MiniCPM and MiniCPM-V series of small language models (SLMs) developed by ModelBest. “面壁小钢炮” focuses on achi…☆236Updated 6 months ago
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆682Updated 2 months ago
- Collect every awesome work about r1!☆363Updated 2 weeks ago
- 这是一个从头训练大语言模型的项目,包括预训练、微调和直接偏好优化,模型拥有1B参数,支持中英文。☆392Updated 2 months ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆69Updated last month
- Inferflow is an efficient and highly configurable inference engine for large language models (LLMs).☆242Updated last year
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆160Updated 5 months ago
- ☆53Updated 2 months ago
- 看图学大模型☆300Updated 9 months ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆321Updated 9 months ago
- 中文版 llm-numbers☆123Updated last year
- A visuailzation tool to make deep understaning and easier debugging for RLHF training.☆189Updated 2 months ago
- 一些 LLM 方面的从零复现笔记☆192Updated 2 weeks ago