feather-ai / transformers-tutorialLinks

The code for the video tutorial series on building a Transformer from scratch: https://www.youtube.com/watch?v=XR4VDnJzB8o

☆19

Alternatives and similar repositories for transformers-tutorial

Users that are interested in transformers-tutorial are comparing it to the libraries listed below

Sorting:

jiaweizzhao / ZerO-initialization
☆74Updated 2 years ago
sayakpaul / MLP-Mixer-CIFAR10
Implements MLP-Mixer (https://arxiv.org/abs/2105.01601) with the CIFAR-10 dataset.
☆56Updated 3 years ago
FrancescoSaverioZuppichini / Loading-huge-PyTorch-models-with-linear-memory-consumption
Little article showing how to load pytorch's models with linear memory consumption
☆34Updated 2 years ago
lucidrains / tranception-pytorch
Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction
☆32Updated 3 years ago
rasbt / cyclemoid-pytorch
Cyclemoid implementation for PyTorch
☆90Updated 3 years ago
lucidrains / learning-to-expire-pytorch
An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain
☆34Updated 4 years ago
teddykoker / learning-to-learn-jax
JAX implementation of Learning to learn by gradient descent by gradient descent
☆27Updated 9 months ago
LukeWood / reef-net
☆24Updated 2 years ago
lucidrains / deep-linear-network
A simple implementation of a deep linear Pytorch module
☆21Updated 4 years ago
AakashKumarNain / diffusion_models
All about the fundamentals and working of Diffusion Models
☆158Updated 2 years ago
lucidrains / kalman-filtering-attention
Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"
☆58Updated last year
sayakpaul / cait-tf
Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.
☆12Updated 2 years ago
facebookresearch / LAWT
Code for papers Linear Algebra with Transformers (TMLR) and What is my Math Transformer Doing? (AI for Maths Workshop, Neurips 2022)
☆71Updated 11 months ago
swyoon / pytorch-energy-based-model
Simple illustrative examples for energy-based models in PyTorch
☆64Updated 5 years ago
sayakpaul / BiT-jax2tf
This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.
☆14Updated 3 years ago
conceptofmind / vit-flax
Implementation of numerous Vision Transformers in Google's JAX and Flax.
☆22Updated 2 years ago
r-isachenko / 2022-2023-DGM-AIMasters-course
Deep Generative Models course, AIMasters, 2022
☆46Updated 2 years ago
elliotwaite / pytorch-hooks-tutorial
Examples of using PyTorch hooks, as covered in my YouTube tutorial video.
☆34Updated last year
sgugger / hf_examples
NLP Examples using the 🤗 libraries
☆41Updated 4 years ago
lucidrains / triangle-multiplicative-module
Implementation of the Triangle Multiplicative module, used in Alphafold2 as an efficient way to mix rows or columns of a 2d feature map, …
☆35Updated 3 years ago
lucidrains / adjacent-attention-network
Graph neural network message passing reframed as a Transformer with local attention
☆69Updated 2 years ago
lucidrains / all-normalization-transformer
A simple Transformer where the softmax has been replaced with normalization
☆20Updated 4 years ago
lucidrains / tableformer-pytorch
Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch
☆39Updated 3 years ago
sayakpaul / Handwriting-Recognizer-in-Keras
This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.
☆13Updated 3 years ago
jfpuget / NVIDIA-PCQM4Mv2
Code of the NVIDIA winning solution to the 2nd OGB-LSC at the NeurIPS 2022 challenge with dataset PCQM4Mv2
☆17Updated 2 years ago
microsoft / ResiDual
ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802
☆94Updated last year
lucidrains / frame-averaging-pytorch
Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network
☆50Updated 11 months ago
sayakpaul / ViT-jax2tf
This repository hosts code for converting the original Vision Transformer models (JAX) to TensorFlow.
☆33Updated 3 years ago
dlvu / vugrad
Yet another mini autodiff system for educational purposes
☆30Updated 8 months ago
vvvm23 / sundae
Unofficial PyTorch implementation of "Step-unrolled Denoising Autoencoders for Text Generation"
☆24Updated 2 years ago