wenjtop / transformer
Transformer是谷歌在17年发表的Attention Is All You Need 中使用的模型,经过这些年的大量的工业使用和论文验证,在深度学习领域已经占据重要地位。Bert就是从Transformer中衍生出来的语言模型。我会以中文翻译英文为例,来解释Transformer输入到输出整个流程。
☆253Updated last year
Alternatives and similar repositories for transformer
Users that are interested in transformer are comparing it to the libraries listed below
Sorting:
- Natural Language Processing Tutorial for Deep Learning Researchers☆1,126Updated 3 years ago
- 关于Transformer模型的最简洁pytorch实现,包含详细注释☆194Updated last year
- ☆126Updated last year
- ☆173Updated 3 years ago
- 算法岗笔试面试大全,励志做算法届的《五年高考,三年模拟》!☆475Updated last month
- 《跟我一起深度学习》@月来客栈 出品☆218Updated 2 weeks ago
- LLM大模型(重点)以及搜广推等 AI 算法中手写的面试题,(非 LeetCode),比如 Self-Attention, AUC等,一般比 LeetCode 更考察一个人的综合能力,又更贴近业务和基础知识一点☆257Updated 4 months ago
- Demos for deep learning☆596Updated 5 months ago
- pytorch distribute tutorials☆131Updated this week
- A Transformer Framework Based Translation Task☆150Updated 2 months ago
- 自然语言处理学习笔记:机器学习及深度学习原理和示例,基于 Tensorflow 和 PyTorch 框架,Transformer、BERT、ALBERT等最新预训练模型及源代码详解,及基于预训练模型进行各种自然语言处理任务。模型部署☆406Updated 4 years ago
- An implementation of the BERT model and its related downstream tasks based on the PyTorch framework. @月来客栈☆597Updated 2 months ago
- Huggingface transformers的中文文档☆240Updated last year
- 博客配套视频链接: https://space.bilibili.com/383551518?spm_id_from=333.1007.0.0 b 站直接看 配套 github 链接:https://github.com/nickchen121/Pre-trainin…☆424Updated 2 years ago
- personal chatgpt☆363Updated 5 months ago
- 学习深度学习不如边写代码边学习,实际操作一遍才能理解数据的变换过程,参数的训练过程,这里整合了B站的jupter代码,可以结合着B站的视频边看边练,希望能对大家有帮助。☆130Updated 2 years ago
- MindSpore online courses: Step into LLM☆465Updated 4 months ago
- ☆70Updated 2 months ago
- Transformer的完整实现。详细构建Encoder、Decoder、Self-attention。以实际例子进行展示,有完整的输入、训练、预测过程。可用于学习理解self-attention和Transformer☆79Updated last month
- 深度学习系统笔记,包含深度学习数学基础知识、神经网络基础部件详解、深度学习炼丹策略、模型压缩算法详解。☆469Updated last week
- How to use wandb?☆642Updated last year
- pytorch复现transformer☆78Updated last year
- ☆81Updated last year
- an implementation of transformer, bert, gpt, and diffusion models for learning purposes☆154Updated 7 months ago
- DeepSpeed Tutorial☆97Updated 9 months ago
- modern AI for beginners☆132Updated last month
- Inference code for LLaMA models☆120Updated last year
- To be the world's best PyTorch project template.☆506Updated 2 years ago
- ☆322Updated 3 months ago
- 一个很小很小的RAG系统☆223Updated 2 weeks ago