nlp-greyfoss / metagrad
一个用于学习的仿Pytorch纯Python实现的自动求导工具。
☆51Updated 11 months ago
Alternatives and similar repositories for metagrad:
Users that are interested in metagrad are comparing it to the libraries listed below
- Inference code for LLaMA models☆118Updated last year
- 使用单个24G显卡,从0开始训练LLM☆50Updated 5 months ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆152Updated 5 months ago
- pytorch分布式训练☆64Updated last year
- NumPy实现类PyTorch的动态计算图和神经网络框架(MLP, CNN, RNN, Transformer)☆80Updated 9 months ago
- ☆68Updated last month
- from MHA, MQA, GQA to MLA by 苏剑林, with code☆13Updated last month
- 怎么训练一个LLM分词器☆142Updated last year
- ☆52Updated last year
- 用Numpy复现可训练的LLaMa3☆34Updated 8 months ago
- 一些 LLM 方面的从零复现笔记☆177Updated 6 months ago
- DeepSpeed Tutorial☆95Updated 7 months ago
- personal chatgpt☆356Updated 3 months ago
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆305Updated 8 months ago
- ☆108Updated this week
- ☆40Updated 7 months ago
- llm theoretical performance analysis tools and support params, flops, memory and latency analysis.☆80Updated 2 months ago
- ☆74Updated 4 months ago
- ☆105Updated 4 months ago
- RLHF experiments on a single A100 40G GPU. Support PPO, GRPO, REINFORCE, RAFT, RLOO, ReMax, DeepSeek R1-Zero reproducing.☆50Updated last month
- ☆50Updated 6 months ago
- 个人总结的大模型、自然语言处理NLP、多模态、计算机视觉CV等方向paper的阅读笔记;收集到或者使用到的一些NLP、CV等领域的优秀开源仓库;其他:如数据集、评测leaderboard等☆41Updated this week
- Retriever-0.1B☆86Updated 9 months ago
- unify-easy-llm(ULM)旨在打造一个简易的一键式大模型训练工具,支持Nvidia GPU、Ascend NPU等不同硬件以及常用的大模型。☆55Updated 8 months ago
- ☆66Updated last year
- ☆33Updated last year
- pytorch distribute tutorials☆117Updated last month
- llama,chatglm 等模型的微调☆86Updated 8 months ago
- 关于Transformer模型的最简洁pytorch实现,包含详细注释☆184Updated last year