BobMcDear / vit-pytorch
PyTorch implementation of the vision transformer
☆18Updated 2 years ago
Alternatives and similar repositories for vit-pytorch:
Users that are interested in vit-pytorch are comparing it to the libraries listed below
- PyTorch implementation of EfficientNet☆10Updated 2 years ago
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆19Updated last year
- PyTorch implementation of SimSiam☆8Updated 2 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- Deep Learning Experiment Code.☆19Updated 8 months ago
- ☆28Updated last year
- A multi-label text classifier to predict the subject areas of arXiv papers from their abstract bodies.☆17Updated 3 years ago
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 2 years ago
- Code for the PAPA paper☆27Updated 2 years ago
- NeurIPS2021: Vision Transformer Paper Collection☆8Updated 3 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆37Updated 3 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- Factorized Neural Layers☆27Updated last year
- ☆22Updated last year
- Skyformer: Remodel Self-Attention with Gaussian Kernel and Nystr\"om Method (NeurIPS 2021)☆60Updated 3 years ago
- This is a simple torch implementation of the high performance Multi-Query Attention☆16Updated last year
- Utilities for Training Very Large Models☆58Updated 7 months ago
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆17Updated 6 months ago
- several types of attention modules written in PyTorch for learning purposes☆50Updated 6 months ago
- ☆51Updated 10 months ago
- PyTorch implementation of FNet: Mixing Tokens with Fourier transforms☆27Updated 3 years ago
- AdamW optimizer for bfloat16 models in pytorch 🔥.☆32Updated 10 months ago
- Another attempt at a long-context / efficient transformer by me☆37Updated 3 years ago
- Implementation of Spectral State Space Models☆16Updated last year
- Evaluating majors LLMs on the Abstraction and Reasoning Corpus☆16Updated last year
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Updated last year
- Includes additional materials for the following keras.io blog post.☆12Updated 3 years ago
- LLM training in simple, raw C/CUDA☆14Updated 4 months ago