Original transformer paper: Implementation of Vaswani, Ashish, et al. "Attention is all you need." Advances in neural information processing systems. 2017.
☆243Apr 29, 2024Updated last year
Alternatives and similar repositories for attention-is-all-you-need-paper
Users that are interested in attention-is-all-you-need-paper are comparing it to the libraries listed below
Sorting:
- Implements EvoNorms B0 and S0 as proposed in Evolving Normalization-Activation Layers.☆11Apr 22, 2020Updated 5 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Dec 21, 2021Updated 4 years ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.☆12Jun 9, 2023Updated 2 years ago
- This Repository contains notebook for learning the Deep Learning Framework - PyTorch.☆15Sep 28, 2021Updated 4 years ago
- ☆14Feb 9, 2022Updated 4 years ago
- This repository hosts code for converting the original MLP Mixer models (JAX) to TensorFlow.☆15Sep 29, 2021Updated 4 years ago
- A beginner friendly repository for getting started with adversarial machine learning in PyTorch☆26Apr 20, 2022Updated 3 years ago
- A curated resources on what's happening in multimodal learning. Features recent papers, books, related lectures, and other relevant resou…☆16Apr 28, 2023Updated 2 years ago
- A Gentle Principled Introduction to Deep Reinforcement Learning☆19Apr 4, 2025Updated 11 months ago
- ☆24Sep 2, 2022Updated 3 years ago
- Comparing PyTorch, JIT and ONNX for inference with Transformers☆20Feb 22, 2021Updated 5 years ago
- Agorithms and data structures in Python 🐍☆37Apr 2, 2022Updated 3 years ago
- Implementations of growing and pruning in neural networks☆22Jul 26, 2023Updated 2 years ago
- automatic analysis of scanned documents(only extracting handwritten digits)☆16Jul 24, 2021Updated 4 years ago
- This repository holds files and scripts for incorporating simple CI/CD practices for model training in ML.☆21Oct 26, 2021Updated 4 years ago
- Includes PyTorch -> Keras model porting code for DeiT models with fine-tuning and inference notebooks.☆41Apr 30, 2022Updated 3 years ago
- Contains code to demonstrate distributed training in TensorFlow 2 with AI Platform and custom Docker contains.☆20Apr 28, 2021Updated 4 years ago
- Course repository for the Spring 2022 COMP790 course "Deep Learning" at UNC☆19Apr 13, 2022Updated 3 years ago
- Object recognition with Pepper using a deep learning model☆10Sep 16, 2021Updated 4 years ago
- My classnotes, experiments, reproducible notebooks from fast.ai Deep Learning Class (v2)☆36Jun 2, 2018Updated 7 years ago
- Deep Convolutional Bidirectional LSTM for Complex Activity Recognition with Missing Data. Human Activity Recognition Challenge. Springer …☆23Apr 26, 2021Updated 4 years ago
- Code for gradient rollback, which explains predictions of neural matrix factorization models, as for example used for knowledge base comp…☆21Mar 16, 2021Updated 5 years ago
- Accurate word segmentation for hashtags and text, powered by Transformers and Beam Search. A scalable alternative to heuristic splitters …☆77Jan 8, 2026Updated 2 months ago
- A UI automation engine☆11Aug 14, 2025Updated 7 months ago
- Cookiecutter skeleton for minimal flask app☆10Jun 27, 2022Updated 3 years ago
- Running RL algorithms on the fish/shark aquarium environment to find unexpected biological insights.☆10Nov 30, 2021Updated 4 years ago
- Official TensorFlow code for the paper "DeepWay: a Deep Learning Waypoint Estimator for Global Path Generation".☆11Jun 24, 2022Updated 3 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Aug 15, 2021Updated 4 years ago
- Sharpened Cosine Distance implementation in PyTorch☆10Feb 1, 2022Updated 4 years ago
- ☆14Mar 9, 2023Updated 3 years ago
- Course repository for the Spring 2023 COMP664 course "Deep Learning" at UNC☆14Apr 17, 2023Updated 2 years ago
- TensorFlow/Keras port of Stable Diffusion☆324Sep 29, 2022Updated 3 years ago
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆1,085Dec 27, 2020Updated 5 years ago
- OpenVINO Edge AI Applications deployment on Google Colaboratory☆70Mar 22, 2022Updated 3 years ago
- Project for my graduate neural networks course - combining RL with VAEs☆22Nov 10, 2019Updated 6 years ago
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Apr 21, 2022Updated 3 years ago
- "자연어처리 알고리즘을 활용한 느린학습자 교육 컨텐츠 제작" 프로젝트 "애움길" 팀입니다. 데이터 수집(크롤링)/EDA/Preprocessing, 쉬운말 생성요약 AI 모델링(NLP - KoBERT, KoBART), 프로토타입 제작을 진행했습니다…☆13Mar 24, 2022Updated 3 years ago
- Illustration of Markov Decision Processes (MDPs)☆11May 22, 2020Updated 5 years ago
- A set of simple tutorial programs for quantum computing including a game, Fly Unicorn.☆15Oct 25, 2019Updated 6 years ago