nlp-greyfoss / metagradLinks
一个用于学习的仿Pytorch纯Python实现的自动求导工具。
☆51Updated last year
Alternatives and similar repositories for metagrad
Users that are interested in metagrad are comparing it to the libraries listed below
Sorting:
- Inference code for LLaMA models☆121Updated last year
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆154Updated 8 months ago
- NumPy实现类PyTorch的动态计算图和神经网络框架(MLP, CNN, RNN, Transformer)☆81Updated 11 months ago
- ☆82Updated 8 months ago
- simplest online-softmax notebook for explain Flash Attention☆10Updated 5 months ago
- ☆72Updated last month
- DeepSpeed Tutorial☆97Updated 10 months ago
- ☆36Updated last year
- 使用单个24G显卡,从0开始训练LLM☆55Updated last month
- Implementation of Denoising Diffusion Probabilistic Model in MindSpore☆36Updated 2 years ago
- ☆52Updated last year
- from MHA, MQA, GQA to MLA by 苏剑林, with code☆22Updated 4 months ago
- personal chatgpt☆373Updated 6 months ago
- Triton Documentation in Chinese Simplified / Triton 中文文档☆71Updated 2 months ago
- 怎么训练一个LLM分词器☆150Updated last year
- ☆109Updated 7 months ago
- LLM101n: Let's build a Storyteller 中文版☆131Updated 10 months ago
- ☆34Updated 6 months ago
- ☆137Updated last month
- pytorch分布式训练☆66Updated last year
- 通义千问的DPO训练☆49Updated 9 months ago
- Pretrain、decay、SFT a CodeLLM from scratch 🧙♂️☆36Updated last year
- DeepSpeed教程 & 示例注释 & 学习笔记 (大模型高效训练)☆169Updated last year
- pytorch distribute tutorials☆138Updated last week
- 一些 LLM 方面的从零复现笔记☆203Updated last month
- llama,chatglm 等模型的微调☆89Updated 11 months ago
- 《深度学习入门2-自制框架》Building Deep Learning Framework☆42Updated last year
- ☆18Updated 3 years ago
- mindspore implementation of transformers☆67Updated 2 years ago
- 天池算法比赛《BetterMixture - 大模型数据混合挑战赛》的第一名top1解决方案☆31Updated 11 months ago