nlp-greyfoss / metagradLinks

一个用于学习的仿Pytorch纯Python实现的自动求导工具。

☆51

Alternatives and similar repositories for metagrad

Users that are interested in metagrad are comparing it to the libraries listed below

Sorting:

firechecking / CleanTransformer
an implementation of transformer, bert, gpt, and diffusion models for learning purposes
☆155Updated 9 months ago
sunkx109 / llama
Inference code for LLaMA models
☆122Updated last year
chunhuizhang / personal_chatgpt
personal chatgpt
☆380Updated 7 months ago
Mxoder / LLM-from-scratch
一些 LLM 方面的从零复现笔记
☆210Updated 3 months ago
chunhuizhang / bert_t5_gpt
☆73Updated 2 months ago
mindspore-courses / step_into_llm
MindSpore online courses: Step into LLM
☆475Updated 3 weeks ago
Glanvery / LLM-Travel
欢迎来到 "LLM-travel" 仓库！探索大语言模型（LLM）的奥秘 🚀。致力于深入理解、探讨以及实现与大模型相关的各种技术、原理和应用。
☆331Updated last year
chunhuizhang / pytorch_distribute_tutorials
pytorch distribute tutorials
☆145Updated last month
hengjiUSTC / learn-llm
☆112Updated 8 months ago
akaihaoshuai / baby-llama2-chinese_cybertron
使用单个24G显卡，从0开始训练LLM
☆56Updated last month
OvJat / DeepSpeedTutorial
DeepSpeed Tutorial
☆100Updated 11 months ago
yanqiangmiffy / how-to-train-tokenizer
怎么训练一个LLM分词器
☆151Updated 2 years ago
cauyxy / bilivideos
☆52Updated 2 years ago
WeltXing / PyDyNet
NumPy实现类PyTorch的动态计算图和神经网络框架(MLP, CNN, RNN, Transformer)
☆82Updated last year
taishan1994 / Llama3.1-Finetuning
对llama3进行全参微调、lora微调以及qlora微调。
☆204Updated 10 months ago
liuzard / transformers_zh_docs
Huggingface transformers的中文文档
☆267Updated last year
preacher-1 / MLA_tutorial
from MHA, MQA, GQA to MLA by 苏剑林, with code
☆25Updated 5 months ago
jiahe7ay / MINI_LLM
This is a repository used by individuals to experiment and reproduce the pre-training process of LLM.
☆460Updated 3 months ago
taishan1994 / pytorch-distributed-NLP
pytorch分布式训练
☆67Updated 2 years ago
mindspore-lab / mindrlhf
☆36Updated 7 months ago
zhoucz97 / myLearning
记录个人的学习历程。包括但不限于算法、机器学习、论文写作等。
☆106Updated 5 months ago
we1k / Randeng-MLT-PromptCBLUE
CCKS2023-PromptCBLUE: Code implement of TianChi completition
☆19Updated last year
Pillars-Creation / ChatGLM-RLHF-LoRA-RM-PPO
ChatGLM-6B添加了RLHF的实现，以及部分核心代码的逐行讲解 ,实例部分是做了个新闻短标题的生成，以及指定context推荐的RLHF的实现
☆86Updated last year
suu990901 / LLaMA-MiLe-Loss
Code for a New Loss for Mitigating the Bias of Learning Difficulties in Generative Language Models
☆65Updated 5 months ago
datawhalechina / llm-deploy
大模型/LLM推理和部署理论与实践
☆304Updated 3 weeks ago
xueyongfu11 / awesome-deep-learning-resource
个人总结的大模型、自然语言处理NLP、多模态、计算机视觉CV等方向paper的阅读笔记；收集到或者使用到的一些NLP、CV等领域的优秀开源仓库；其他：如数据集、评测leaderboard等
☆51Updated last week
chaoswork / llm_illustrated
看图学大模型
☆316Updated last year
KMnO4-zx / TinyRAG
TinyRAG
☆317Updated last month
owenliang / qwen-dpo
通义千问的DPO训练
☆51Updated 10 months ago
stanleylsx / llms_tool
一个基于HuggingFace开发的大语言模型训练、测试工具。支持各模型的webui、终端预测，低参数量及全参数模型训练(预训练、SFT、RM、PPO、DPO)和融合、量化。
☆219Updated last year