Sharath-girish / LilNetXLinks
Official PyTorch implementation of LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification
☆46Updated 3 years ago
Alternatives and similar repositories for LilNetX
Users that are interested in LilNetX are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆72Updated 2 years ago
- ☆25Updated 3 years ago
- PyTorch implementation of the paper The Lottery Ticket Hypothesis for Object Recognition☆23Updated 4 years ago
- Implementation of Kronecker Attention in Pytorch☆19Updated 4 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated 2 years ago
- ☆41Updated 4 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Updated 3 years ago
- Cyclic Differentiable Architecture Search☆36Updated 3 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- [ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…☆53Updated last year
- ☆32Updated 2 weeks ago
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆23Updated last year
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated last year
- Official repository for MaGNET, ICLR 2022☆24Updated 2 years ago
- Github code for the paper Maximum Class Separation as Inductive Bias in One Matrix. Arxiv link: https://arxiv.org/abs/2206.08704☆29Updated 2 years ago
- Contains my experiments with the `big_vision` repo to train ViTs on ImageNet-1k.☆22Updated 2 years ago
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆40Updated 2 years ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆80Updated last year
- ☆51Updated last year
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- Repo for the paper "Extrapolating from a Single Image to a Thousand Classes using Distillation"☆36Updated 11 months ago
- A GPT, made only of MLPs, in Jax☆58Updated 4 years ago
- Official implementation for Wavelet Feature Maps Compression for Image-to-Image CNNs, NeurIPS 2022.☆34Updated 2 years ago
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆116Updated 2 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- Simple CIFAR-10 classification with ConvMixer☆45Updated 3 years ago
- Implementation of "Structured Multi-Hashing for Model Compression" (CVPR 2020)☆11Updated 4 years ago
- Implementing DropPath/StochasticDepth in PyTorch☆16Updated 3 years ago
- Tensorflow 2 implementations of the C-SimCLR and C-BYOL self-supervised visual representation methods from "Compressive Visual Representa…☆37Updated 3 years ago