wenjtop / transformerLinks
Transformer是谷歌在17年发表的Attention Is All You Need 中使用的模型,经过这些年的大量的工业使用和论文验证,在深度学习领域已经占据重要地位。Bert就是从Transformer中衍生出来的语言模型。我会以中文翻译英文为例,来解释Transformer输入到输出整个流程。
☆265Updated last year
Alternatives and similar repositories for transformer
Users that are interested in transformer are comparing it to the libraries listed below
Sorting:
- 关于Transformer模型的最简洁pytorch实现,包含详细注释☆202Updated last year
- ☆126Updated 3 weeks ago
- Demos for deep learning☆622Updated 6 months ago
- 《跟我一起深度学习》@月来客栈 出品☆222Updated 2 weeks ago
- personal chatgpt☆373Updated 6 months ago
- How to use wandb?☆655Updated last year
- Transformer的完整实现。详细构建Encoder、Decoder、Self-attention。以实际例子进行展示,有完整的输入、训练、预测过程。可用于学习理解self-attention和Transformer☆83Updated 2 months ago
- 算法岗笔试面试大全,励志做算法届的《五年高考,三年模拟》!☆507Updated 3 months ago
- ☆336Updated 4 months ago
- pytorch distribute tutorials☆138Updated last week
- To be the world's best PyTorch project template.☆508Updated 2 years ago
- Huggingface transformers的中文文档☆254Updated last year
- ☆178Updated 3 years ago
- Natural Language Processing Tutorial for Deep Learning Researchers☆1,128Updated 3 years ago
- LLM大模型(重点)以及搜广推等 AI 算法中手写的面试题,(非 LeetCode),比如 Self-Attention, AUC等,一般比 LeetCode 更考察一个人的综合能力,又更贴近业务和基础知识一点☆295Updated 5 months ago
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆154Updated 8 months ago
- 博客配套视频链接: https://space.bilibili.com/383551518?spm_id_from=333.1007.0.0 b 站直接看 配套 github 链接:https://github.com/nickchen121/Pre-trainin…☆433Updated 2 years ago
- ☆72Updated last month
- Learning LLM Implementaion and Theory for Practical Landing☆163Updated 5 months ago
- 大模型基础学习和面试八股文☆129Updated last year
- A Transformer Framework Based Translation Task☆151Updated 2 weeks ago
- 欢迎来到 LLM-Dojo,这里是一个开源大模型学习场所,使用简洁且易阅读的代码构建模型训练框架(支持各种主流模型如Qwen、Llama、GLM等等)、RLHF框架(DPO/CPO/KTO/PPO)等各种功能。👩🎓👨🎓☆781Updated this week
- Pytorch Lightning入门中文教程,转载请注明来源。(当初是写着玩的,建议看完MNIST这个例子再上手)☆218Updated 4 years ago
- Transformer Encoder PyTorch note☆112Updated 2 years ago
- modern AI for beginners☆141Updated 2 weeks ago
- Inference code for LLaMA models☆121Updated last year
- 深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。☆474Updated 2 weeks ago
- 整理 pytorch 单机多 GPU 训练方法与原理☆837Updated 3 years ago
- ChineseNMT: Translate English to Chinese with PyTorch Implementation of Transformer☆479Updated 2 years ago
- ☆79Updated 10 months ago