BobMcDear / vit-pytorch
PyTorch implementation of the vision transformer
☆18Updated 2 years ago
Alternatives and similar repositories for vit-pytorch:
Users that are interested in vit-pytorch are comparing it to the libraries listed below
- PyTorch implementation of SimSiam☆8Updated 2 years ago
- Neural network from scratch in CUDA/C++☆78Updated 2 months ago
- PyTorch implementation of EfficientNet☆10Updated 2 years ago
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆19Updated last year
- PyTorch implementation of popular attention mechanisms in vision☆15Updated last year
- Parallel Associative Scan for Language Models☆18Updated last year
- 11-785 Introduction to Deep Learning (IDeeL) website with logistics and select course materials☆41Updated this week
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- NeurIPS2021: Vision Transformer Paper Collection☆8Updated 3 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.☆12Updated last year
- Source-to-Source Debuggable Derivatives in Pure Python☆15Updated last year
- Explorations into the recently proposed Taylor Series Linear Attention☆95Updated 7 months ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆15Updated 3 years ago
- Nadir: Cutting-edge PyTorch optimizers for simplicity & composability! 🔥🚀💻☆14Updated 9 months ago
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆15Updated 5 months ago
- This project shows how to derive the total number of training tokens from a large text dataset from 🤗 datasets with Apache Beam and Data…☆24Updated 2 years ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Updated last year
- HomebrewNLP in JAX flavour for maintable TPU-Training☆49Updated last year
- Deep Learning Experiment Code.☆19Updated 8 months ago
- Shows how to do parameter ensembling using differential evolution.☆10Updated 3 years ago
- A curated list of reinforcement learning in NLP. :-)☆20Updated 3 years ago
- Local Attention - Flax module for Jax☆20Updated 3 years ago
- Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)☆60Updated 2 years ago
- Sequence models in Numpy☆25Updated 4 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- ☆51Updated 9 months ago
- Explorations with Geoffrey Hinton's Forward Forward algoithm☆33Updated last year
- ☆18Updated 2 years ago