mindspore-lab / mindpet
☆45Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for mindpet
- ☆145Updated this week
- ☆74Updated 11 months ago
- ☆51Updated last year
- 怎么训练一个LLM分词器☆130Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆269Updated 4 months ago
- Inference code for LLaMA models☆109Updated last year
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆40Updated 3 months ago
- FlagScale is a large model toolkit based on open-sourced projects.☆178Updated this week
- 中文大模型微调(LLM-SFT), 数学指令数据集MWP-Instruct, 支持模型(ChatGLM-6B, LLaMA, Bloom-7B, baichuan-7B), 支持(LoRA, QLoRA, DeepSpeed, UI, TensorboardX), 支持(微…☆169Updated 6 months ago
- Models and examples built with OneFlow☆96Updated last month
- A MoE impl for PyTorch, [ATC'23] SmartMoE☆57Updated last year
- ☆26Updated this week
- Baichuan2代码的逐行解析版本,适合小白☆208Updated last year
- 《ChatGPT原理与实战:大型语言模型的算法、技术和私有化》☆341Updated 11 months ago
- LLM Inference benchmark☆350Updated 4 months ago
- A streamlined and customizable framework for efficient large model evaluation and performance benchmarking☆263Updated this week
- 专注于Python/C++/CUDA、ML/DL/RL和NLP/KG/DS/LLM领域的技术分享。☆63Updated 4 months ago
- Transformer related optimization, including BERT, GPT☆39Updated last year
- ☆290Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMs☆124Updated 11 months ago
- 大模型/LLM推理和部署理论与实践☆84Updated this week
- ☆82Updated last year
- 中文书籍收录整理, Collection of Chinese Books☆173Updated 10 months ago
- 使用单个24G显卡,从0开始训练LLM☆49Updated last month
- FlagEval is an evaluation toolkit for AI large foundation models.☆302Updated 4 months ago
- ☆21Updated last year
- Best practice for training LLaMA models in Megatron-LM☆628Updated 10 months ago
- RTP-LLM: Alibaba's high-performance LLM inference engine for diverse applications.☆547Updated last month
- Transformer related optimization, including BERT, GPT☆17Updated last year
- A flexible and efficient training framework for large-scale alignment tasks☆211Updated this week