brandokoch / attention-is-all-you-need-paper
Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.
☆237Updated last year
Alternatives and similar repositories for attention-is-all-you-need-paper
Users that are interested in attention-is-all-you-need-paper are comparing it to the libraries listed below
Sorting:
- this is where we share notebooks/projects used in your youtube channel☆148Updated 3 years ago
- Annotations of the interesting ML papers I read☆239Updated last week
- Representation Learning MSc course Summer Semester 2023☆79Updated last year
- PyTorch 101 series covering everything from the basic building blocks all the way to building custom architectures.☆262Updated 4 years ago
- All about the fundamental blocks of TF and JAX!☆274Updated 3 years ago
- ML Research paper summaries, annotated papers and implementation walkthroughs☆114Updated 3 years ago
- Interview Questions and Answers for Machine Learning Engineer role☆119Updated this week
- deep learning with pytorch lightning☆1Updated 6 months ago
- Source of the FSDL 2022 labs, which are at https://github.com/full-stack-deep-learning/fsdl-text-recognizer-2022-labs☆80Updated last year
- This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.☆172Updated 3 years ago
- Host repository for the "Reproducible Deep Learning" PhD course☆405Updated 3 years ago
- MinT: Minimal Transformer Library and Tutorials☆254Updated 2 years ago
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆173Updated 2 years ago
- Some notebooks for NLP☆204Updated last year
- An open-source AutoML Library based on PyTorch☆306Updated last month
- A curated list of awesome fastai projects/blog posts/tutorials/etc.☆170Updated 3 years ago
- ☆138Updated 2 years ago
- This is a collection of the code that accompanies the reports in The Gallery by Weights & Biases.☆339Updated 3 years ago
- Infographic about the inner computations of a transformer model, training and inference☆83Updated last year
- Software Architecture for ML engineers☆404Updated 2 years ago
- The "tl;dr" on a few notable transformer papers (pre-2022).☆190Updated 2 years ago
- A library to inspect and extract intermediate layers of PyTorch models.☆473Updated 3 years ago
- Serving PyTorch models with TorchServe☆101Updated 2 years ago
- Lightning Bits: Engineering for Researchers repo☆132Updated 2 years ago
- My repo for training neural nets using pytorch-lightning and hydra☆220Updated 2 months ago
- FasterAI: Prune and Distill your models with FastAI and PyTorch☆248Updated last month
- Notebooks for the Practicals at the Deep Learning Indaba 2022.☆175Updated last year
- Deploying PyTorch Model to Production with FastAPI in CUDA-supported Docker☆102Updated 3 years ago
- chaii: hindi and tamil question answering☆35Updated 3 years ago
- A project template where PyTorch Lightning, Pydantic, and more! being used for training MNIST as an example.☆26Updated 2 years ago