chenweiphd / DeepSeek-MoE-ResourceMapLinks
☆132Updated 3 months ago
Alternatives and similar repositories for DeepSeek-MoE-ResourceMap
Users that are interested in DeepSeek-MoE-ResourceMap are comparing it to the libraries listed below
Sorting:
- LLM101n: Let's build a Storyteller 中文版☆131Updated 9 months ago
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆55Updated 10 months ago
- DashInfer is a native LLM inference engine aiming to deliver industry-leading performance atop various hardware architectures, including …☆254Updated this week
- LLM notes, including model inference, transformer model structure, and llm framework code analysis notes.☆774Updated last week
- 一个手把手教你从零开始编写GPT并训练大语言模型的教程☆79Updated 4 months ago
- LLM全栈优质资源汇总☆566Updated 6 months ago
- DeepSeek 系列工作解读、扩展和复现。☆652Updated 2 months ago
- vLLM Documentation in Chinese Simplified / vLLM 中文文档☆75Updated 3 weeks ago
- 顾名思义:手搓的RAG☆123Updated last year
- 利用免费的大模型api来结合你的私域数据来生成sft训练数据(妥妥白嫖)支持llamafactory等工具的训练数据格式synthetic data☆164Updated 6 months ago
- Awesome LLM pre-training resources, including data, frameworks, and methods.☆170Updated last month
- 大模型/LLM推理和部署理论与实践☆266Updated 2 months ago
- UltraScale Playbook 中文版☆39Updated 2 months ago
- ☆228Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆326Updated 10 months ago
- This repo is used for archiving my notes, codes and materials of cs learning.☆29Updated this week
- Triton Documentation in Chinese Simplified / Triton 中文文档☆71Updated last month
- Inference code for LLaMA models☆121Updated last year
- 🔥Your Daily Dose of AI Research from Hugging Face 🔥 Stay updated with the latest AI breakthroughs! This bot automatically collects and…☆51Updated this week
- 训练一个对中文支持更好的LLaVA模型,并开源训练代码和数据。☆59Updated 9 months ago
- Baichuan2代码的逐行解析版本,适合小白☆214Updated last year
- 一些 LLM 方面的从零复现笔记☆200Updated last month
- Converted the Jina Tokenizer regex pattern to python.☆26Updated 9 months ago
- ☆166Updated this week
- LLM Inference benchmark☆419Updated 10 months ago
- 看图学大模型☆303Updated 10 months ago
- An open-source solution for full parameter fine-tuning of DeepSeek-V3/R1 671B, including complete code and scripts from training to infer…☆689Updated 2 months ago
- GLM Series Edge Models☆141Updated 3 months ago
- Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"☆80Updated 9 months ago
- FlagScale is a large model toolkit based on open-sourced projects.☆281Updated this week