brandokoch / attention-is-all-you-need-paperView external linksLinks
Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.
☆243Apr 29, 2024Updated last year
Alternatives and similar repositories for attention-is-all-you-need-paper
Users that are interested in attention-is-all-you-need-paper are comparing it to the libraries listed below
Sorting:
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Apr 22, 2020Updated 5 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.☆12Jun 9, 2023Updated 2 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Sep 29, 2021Updated 4 years ago
- ☆14Feb 9, 2022Updated 4 years ago
- This Repository contains notebook for learning the Deep Learning Framework - PyTorch.☆15Sep 28, 2021Updated 4 years ago
- This is a tutorial to connect the fundamental mathematics to a practical implementation addressing the continual learning problem of arti…☆363Apr 17, 2023Updated 2 years ago
- A beginner friendly repository for getting started with adversarial machine learning in PyTorch☆26Apr 20, 2022Updated 3 years ago
- A curated resources on what's happening in multimodal learning. Features recent papers, books, related lectures, and other relevant resou…☆16Apr 28, 2023Updated 2 years ago
- A Gentle Principled Introduction to Deep Reinforcement Learning☆19Apr 4, 2025Updated 10 months ago
- ☆24Sep 2, 2022Updated 3 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆20Feb 22, 2021Updated 4 years ago
- Implementations of growing and pruning in neural networks☆22Jul 26, 2023Updated 2 years ago
- Agorithms and data structures in Python 🐍☆37Apr 2, 2022Updated 3 years ago
- 📄 Evidence Retrieval and Claim Verification for the FEVER shared task using Transformer Networks☆12Feb 21, 2020Updated 5 years ago
- This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.☆21Oct 26, 2021Updated 4 years ago
- automatic analysis of scanned documents(only extracting handwritten digits)☆16Jul 24, 2021Updated 4 years ago
- Includes PyTorch -> Keras model porting code for DeiT models with fine-tuning and inference notebooks.☆41Apr 30, 2022Updated 3 years ago
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Apr 28, 2021Updated 4 years ago
- Course repository for the Spring 2022 COMP790 course "Deep Learning" at UNC☆19Apr 13, 2022Updated 3 years ago
- Object recognition with Pepper using a deep learning model☆10Sep 16, 2021Updated 4 years ago
- My classnotes, experiments, reproducible notebooks from fast.ai Deep Learning Class (v2)☆36Jun 2, 2018Updated 7 years ago
- ☆44Aug 2, 2021Updated 4 years ago
- Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters …☆76Jan 8, 2026Updated last month
- Needles in Haystacks: On Classifying Tiny Objects in Large Images☆22Jun 28, 2019Updated 6 years ago
- Deep Convolutional Bidirectional LSTM for Complex Activity Recognition with Missing Data. Human Activity Recognition Challenge. Springer …☆23Apr 26, 2021Updated 4 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Mar 16, 2021Updated 4 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- Running RL algorithms on the fish/shark aquarium environment to find unexpected biological insights.☆10Nov 30, 2021Updated 4 years ago
- Cookiecutter skeleton for minimal flask app☆10Jun 27, 2022Updated 3 years ago
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆14Apr 17, 2023Updated 2 years ago
- ⚛️ ⚗️ 👨🏫 A project based in Quantum Physics/Mechanics, Quantum Information Science, Quantum Theory, Quantum Chemistry and Quantum Comp…☆14Nov 7, 2020Updated 5 years ago
- Sharpened Cosine Distance implementation in PyTorch☆10Feb 1, 2022Updated 4 years ago
- ☆14Mar 9, 2023Updated 2 years ago
- A UI automation engine☆11Aug 14, 2025Updated 6 months ago
- TensorFlow/Keras port of Stable Diffusion☆324Sep 29, 2022Updated 3 years ago
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆1,079Dec 27, 2020Updated 5 years ago
- OpenVINO Edge AI Applications deployment on Google Colaboratory☆70Mar 22, 2022Updated 3 years ago
- Denoising networks for ray traced images☆13May 21, 2020Updated 5 years ago