subramen / minGPT-ddp
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
☆20Updated 2 years ago
Related projects: ⓘ
- Transformers w/o Attention, based fully on MLPs☆85Updated 5 months ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆66Updated last year
- Recent Advances on Efficient Vision Transformers☆46Updated last year
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 2 years ago
- code for NASViT☆66Updated 2 years ago
- VIT inference in triton because, why not?☆16Updated 3 months ago
- ☆48Updated 11 months ago
- A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.☆118Updated 9 months ago
- ☆30Updated 3 months ago
- Megatron's multi-modal data loader☆42Updated this week
- implement minimal pytorch from scratch☆18Updated 3 years ago
- Examples for the WebDataset PyTorch Dataset Library☆47Updated 3 years ago
- Pytorch reimplementation of the Mixer (MLP-Mixer: An all-MLP Architecture for Vision)☆32Updated 3 years ago
- [CVPR'23] SparseViT: Revisiting Activation Sparsity for Efficient High-Resolution Vision Transformer☆59Updated 4 months ago
- [NeurIPS 2022] “M³ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design”, Hanxue …☆90Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆92Updated last week
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆69Updated 2 years ago
- Official implementation of "Active Image Indexing"☆58Updated last year
- [ECCV 2024] Official PyTorch implementation of RoPE-ViT "Rotary Position Embedding for Vision Transformer"☆157Updated last month
- Implementation of Hire-MLP: Vision MLP via Hierarchical Rearrangement and An Image Patch is a Wave: Phase-Aware Vision MLP.☆33Updated last year
- In progress.☆64Updated 5 months ago
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆45Updated 9 months ago
- A compilation of network architectures for vision and others without usage of self-attention mechanism☆77Updated last year
- ☆27Updated last year
- A block oriented training approach for inference time optimization.☆26Updated last month
- Code for ViTAS_Vision Transformer Architecture Search☆51Updated 3 years ago
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆60Updated last year
- ☆164Updated 8 months ago
- A simple minimal implementation of Reversible Vision Transformers☆114Updated 6 months ago
- This is the official PyTorch implementation for "Mesa: A Memory-saving Training Framework for Transformers".☆118Updated 2 years ago