nlp-greyfoss / metagrad
一个用于学习的仿Pytorch纯Python实现的自动求导工具。
☆51Updated 9 months ago
Alternatives and similar repositories for metagrad:
Users that are interested in metagrad are comparing it to the libraries listed below
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆151Updated 4 months ago
- Inference code for LLaMA models☆113Updated last year
- ☆62Updated last month
- pytorch分布式训练☆63Updated last year
- pytorch distribute tutorials☆104Updated this week
- ☆52Updated last year
- ☆37Updated 4 months ago
- 使用单个24G显卡,从0开始训练LLM☆50Updated 3 months ago
- 怎么训练一个LLM分词器☆140Updated last year
- 欢迎来到 "LLM-travel" 仓库!探索大语言模型(LLM)的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。☆298Updated 7 months ago
- ☆39Updated 6 months ago
- Implementation of FlashAttention in PyTorch☆133Updated last month
- NumPy实现类PyTorch的动态计算图和神经网络框架(MLP, CNN, RNN, Transformer)☆79Updated 7 months ago
- 一个很小很小的RAG系统☆139Updated 2 months ago
- 从零到一实现一个 miniLLM~(动手学习LLM)☆61Updated 9 months ago
- The blog, read report and code example for AGI/LLM related knowledge.☆32Updated 3 weeks ago
- personal chatgpt☆337Updated 2 months ago
- ☆62Updated last week
- seq2seq_translation☆26Updated 3 years ago
- ☆104Updated 3 months ago
- ☆30Updated 2 months ago
- 对llama3进行全参微调、lora微调以及qlora微调。☆176Updated 4 months ago
- A Transformer Framework Based Translation Task☆144Updated this week
- ☆84Updated last year
- Hugging Face Transformers Course 笔记☆39Updated 2 years ago
- ☆64Updated last year
- DeepSpeed Tutorial☆95Updated 6 months ago
- ☆18Updated 2 years ago
- 个人总结的大模型、自然语言处理NLP、多模态、计算机视觉CV等方向paper的阅读笔记;收集到或者使用到的一些NLP、CV等领域的优秀开源仓库;其他:如数据集、评测leaderboard等☆35Updated last month
- A text classification example using ddp horovod and accelerate☆33Updated 3 years ago