CerebrasResearch / RevBiFPN
RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
☆13Updated 2 years ago
Alternatives and similar repositories for RevBiFPN:
Users that are interested in RevBiFPN are comparing it to the libraries listed below
- A implement of run-length encoding for Pytorch tensor using CUDA☆11Updated 3 years ago
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆49Updated 4 years ago
- Customized matrix multiplication kernels☆54Updated 3 years ago
- ☆41Updated 4 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆71Updated 2 years ago
- Transformers w/o Attention, based fully on MLPs☆93Updated 11 months ago
- [ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…☆30Updated last year
- A library for unit scaling in PyTorch☆125Updated 4 months ago
- Butterfly matrix multiplication in PyTorch☆168Updated last year
- A research library for pytorch-based neural network pruning, compression, and more.☆160Updated 2 years ago
- ☆57Updated 2 years ago
- PyTorch implementation of HashedNets☆36Updated last year
- [NeurIPS 2020] ShiftAddNet: A Hardware-Inspired Deep Network☆71Updated 4 years ago
- Code repository for the ICLR 2022 paper "FlexConv: Continuous Kernel Convolutions With Differentiable Kernel Sizes" https://openreview.ne…☆115Updated 2 years ago
- ☆18Updated 2 years ago
- ☆157Updated last year
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆125Updated 3 years ago
- Recent Advances on Efficient Vision Transformers☆50Updated 2 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆71Updated 2 years ago
- ☆43Updated last year
- MONeT framework for reducing memory consumption of DNN training☆173Updated 3 years ago
- Kervolution Library in PyTorch (CVPR 2019 Oral)☆39Updated 4 years ago
- AlphaNet Improved Training of Supernet with Alpha-Divergence☆98Updated 3 years ago
- ☆40Updated 3 years ago
- Nested Hierarchical Transformer https://arxiv.org/pdf/2105.12723.pdf☆196Updated 8 months ago
- Official implementation for "SimA: Simple Softmax-free Attention for Vision Transformers"☆43Updated 11 months ago
- ☆31Updated 9 months ago
- A simple minimal implementation of Reversible Vision Transformers☆123Updated last year
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated last year
- PyTorch implementation of LARS (Layer-wise Adaptive Rate Scaling)☆20Updated 5 years ago