feather-ai / transformers-tutorial
The code for the video tutorial series on building a Transformer from scratch: https://www.youtube.com/watch?v=XR4VDnJzB8o
☆18Updated 2 years ago
Alternatives and similar repositories for transformers-tutorial:
Users that are interested in transformers-tutorial are comparing it to the libraries listed below
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- An implementation of Transformer with Expire-Span, a circuit for learning which memories to retain☆33Updated 4 years ago
- notebooks of cool EBM visualizations☆15Updated 4 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 2 years ago
- All my experiments with the various transformers and various transformer frameworks available☆14Updated 3 years ago
- A simple implementation of a deep linear Pytorch module☆19Updated 4 years ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆49Updated last year
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆48Updated 3 years ago
- Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction☆31Updated 2 years ago
- Contains Colab Notebooks show cool use-cases of different GCP ML APIs.☆10Updated 4 years ago
- ☆24Updated 2 years ago
- A simple Transformer where the softmax has been replaced with normalization☆19Updated 4 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- A JAX nn library☆21Updated 2 months ago
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Updated 7 months ago
- NLP Examples using the 🤗 libraries☆41Updated 4 years ago
- Cyclemoid implementation for PyTorch☆89Updated 3 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- Unofficial PyTorch implementation of the Involution layer from CVPR 2021☆45Updated 3 years ago
- Utilities for PyTorch distributed☆24Updated 2 months ago
- A GPT, made only of MLPs, in Jax☆57Updated 3 years ago
- Unofficial PyTorch implementation of "Step-unrolled Denoising Autoencoders for Text Generation"☆24Updated 2 years ago
- JAX implementation of Learning to learn by gradient descent by gradient descent☆27Updated 6 months ago
- Generative Deep Learning Sessions led by Anugraha Sinha (Machine Learning Tokyo)☆24Updated 4 years ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.☆12Updated last year
- Another attempt at a long-context / efficient transformer by me☆37Updated 3 years ago
- Code for running the experiments in Deep Subjecthood: Higher Order Grammatical Features in Multilingual BERT☆17Updated last year
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 3 years ago
- This project shows how to build a simple handwriting recognizer in Keras with the IAM dataset.☆13Updated 3 years ago