wenjtop / transformer
Transformer是谷歌在17年发表的Attention Is All You Need 中使用的模型,经过这些年的大量的工业使用和论文验证,在深度学习领域已经占据重要地位。Bert就是从Transformer中衍生出来的语言模型。我会以中文翻译英文为例,来解释Transformer输入到输出整个流程。
☆250Updated last year
Alternatives and similar repositories for transformer:
Users that are interested in transformer are comparing it to the libraries listed below
- 关于Transformer模型的最简洁pytorch实现,包含详细注释☆190Updated last year
- A Transformer Framework Based Translation Task☆150Updated 2 months ago
- Transformer的完整实现。详细构建Encoder、Decoder、Self-attention。以实际例子进行展示,有完整的输入、训练、预测过程。可用于学习理解self-attention和Transformer☆76Updated 2 weeks ago
- Natural Language Processing Tutorial for Deep Learning Researchers☆1,123Updated 3 years ago
- 《跟我一起深度学习》@月来客栈 出品☆214Updated last week
- ☆125Updated last year
- personal chatgpt☆361Updated 4 months ago
- ☆312Updated 2 months ago
- pytorch distribute tutorials☆123Updated this week
- ☆168Updated 3 years ago
- To be the world's best PyTorch project template.☆505Updated 2 years ago
- LLM大模型(重点)以及搜广推等 AI 算法中手写的面试题,(非 LeetCode),比如 Self-Attention, AUC等,一般比 LeetCode 更考察一个人的综合能力,又更贴近业务和基础知识一点☆238Updated 3 months ago
- DeepSpeed Tutorial☆95Updated 8 months ago
- Inference code for LLaMA models☆120Updated last year
- 博客配套视频链接: https://space.bilibili.com/383551518?spm_id_from=333.1007.0.0 b 站直接看 配套 github 链接:https://github.com/nickchen121/Pre-trainin…☆415Updated 2 years ago
- 深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。☆465Updated last month
- Demos for deep learning☆576Updated 4 months ago
- Huggingface transformers的中文文档☆231Updated last year
- 大模型基础学习和面试八股文☆108Updated last year
- How to use wandb?☆635Updated last year
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆153Updated 6 months ago
- 算法岗笔试面试大全,励志做算法届的《五年高考,三年模拟》!☆453Updated last month
- Transformer Encoder PyTorch note☆107Updated last year
- ☆70Updated 2 months ago
- MindSpore online courses: Step into LLM☆458Updated 3 months ago
- An implementation of the BERT model and its related downstream tasks based on the PyTorch framework. @月来客栈☆593Updated last month
- ☆68Updated 8 months ago
- 学习深度学习不如边写代码边学习,实际操作一遍才能理解数据的变换过程,参数的训练过程,这里整合了B站的jupter代码,可以结合着B站的视频边看边练,希望能对大家有帮助。☆129Updated 2 years ago
- A translation model with Transformer, implement by pytorch, which is for learning Transformer.☆49Updated 4 years ago
- 一个很小很小的RAG系统☆207Updated 4 months ago