nlp-greyfoss / metagradLinks
一个用于学习的仿Pytorch纯Python实现的自动求导工具。
☆51Updated last year
Alternatives and similar repositories for metagrad
Users that are interested in metagrad are comparing it to the libraries listed below
Sorting:
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆159Updated last year
- Inference code for LLaMA models☆128Updated 2 years ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与 大模型相关的各种技术、原理和应用。☆355Updated last year
- personal chatgpt☆396Updated 11 months ago
- pytorch distribute tutorials☆160Updated 5 months ago
- ☆80Updated last week
- DeepSpeed Tutorial☆104Updated last year
- 使用单个24G显卡,从0开始训练LLM☆55Updated 5 months ago
- pytorch分布式训练☆72Updated 2 years ago
- 一些 LLM 方面的从零复现笔记☆238Updated 7 months ago
- MindSpore online courses: Step into LLM☆482Updated 3 weeks ago
- ☆51Updated 2 years ago
- Huggingface transformers的中文文档☆282Updated 2 years ago
- 对llama3进行全参微调、lora微调以及qlora微调。☆211Updated last year
- ☆115Updated last year
- 记录个人的学习历程。包括但不限于算法、机器学习、论文写作等。☆111Updated 9 months ago
- 怎么训练一个LLM分词器☆154Updated 2 years ago
- 大模型/LLM推理和部署理论与实践☆366Updated 5 months ago
- from MHA, MQA, GQA to MLA by 苏剑林, with code☆34Updated 9 months ago
- This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.☆482Updated 7 months ago
- ☆180Updated last week
- ☆47Updated last year
- 个人总结的大模型、自然语言处理NLP、多模态、计算机视觉CV等方向paper的阅读笔记;收集到或者使用到的一些NLP、CV等领域的优秀开源仓库;其他:如数据集、评测leaderboard等☆57Updated 2 weeks ago
- Train a 1B LLM with 1T tokens from scratch by personal☆766Updated 7 months ago
- NumPy-based Dynamic Deep Learning Framework☆84Updated 3 months ago
- From Llama to Deepseek, grpo/mtp implemented. With pt/sft/lora/qlora included☆30Updated 7 months ago
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆183Updated 2 years ago
- LLM Tokenizer with BPE algorithm☆45Updated last year
- 深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。☆503Updated 6 months ago
- 关于Transformer模型的最简洁pytorch实现,包含详细注释☆227Updated 2 years ago