BorealisAI / efficient-vit-training
PyTorch code of "Training a Vision Transformer from scratch in less than 24 hours with 1 GPU" (HiTY workshop at Neurips 2022)
☆20Updated last year
Alternatives and similar repositories for efficient-vit-training
Users that are interested in efficient-vit-training are comparing it to the libraries listed below
Sorting:
- Official PyTorch implementation of "LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging" (ICML 2024)☆29Updated 9 months ago
- Pytorch Implementation of the paper: "Learning to (Learn at Test Time): RNNs with Expressive Hidden States"☆24Updated 3 weeks ago
- ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2☆64Updated 6 months ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆33Updated last year
- Implementation of ViTaR: ViTAR: Vision Transformer with Any Resolution in PyTorch☆35Updated 6 months ago
- Implementation of the proposed DeepCrossAttention by Heddes et al at Google research, in Pytorch☆84Updated 2 months ago
- an implementation of FAdam (Fisher Adam) in PyTorch☆43Updated 11 months ago
- A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.☆35Updated 10 months ago
- Implementation of a modular, high-performance, and simplistic mamba for high-speed applications☆34Updated 6 months ago
- This repository contains the pytorch code for our work IEEE ISBI 2024 paper "ConvLoRA and AdaBN Based Domain Adaptation via Self-Training…☆74Updated 7 months ago
- Official Pytorch implementation for "IFORMER: INTEGRATING CONVNET AND TRANSFORMER FOR MOBILE APPLICATION" [ICLR 2025]☆42Updated last month
- Official code repository for paper: "ExPLoRA: Parameter-Efficient Extended Pre-training to Adapt Vision Transformers under Domain Shifts"☆31Updated 7 months ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆68Updated 9 months ago
- Official implementation of RMoE (Layerwise Recurrent Router for Mixture-of-Experts)☆20Updated 9 months ago
- The official implementation of the paper "Reducing Fine-Tuning Memory Overhead by Approximate and Memory-Sharing Backpropagation"☆20Updated 5 months ago
- Minimal Implementation of Visual Autoregressive Modelling (VAR)☆33Updated last month
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆20Updated 5 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆46Updated 7 months ago
- ☆48Updated last year
- Scaling Multi-modal Instruction Fine-tuning with Tens of Thousands Vision Task Types☆18Updated last month
- An efficient pytorch implementation of selective scan in one file, works with both cpu and gpu, with corresponding mathematical derivatio…☆87Updated last year
- ☆36Updated 9 months ago
- OLA-VLM: Elevating Visual Perception in Multimodal LLMs with Auxiliary Embedding Distillation, arXiv 2024☆60Updated 2 months ago
- More dimensions = More fun☆22Updated 9 months ago
- Triton implement of bi-directional (non-causal) linear attention☆47Updated 3 months ago
- FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation☆48Updated 10 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆21Updated 4 months ago
- [ICCV2023] TinyCLIP: CLIP Distillation via Affinity Mimicking and Weight Inheritance☆92Updated 10 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated 8 months ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆56Updated last year