CerebrasResearch / RevBiFPN
RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network
☆14Updated 2 years ago
Alternatives and similar repositories for RevBiFPN
Users that are interested in RevBiFPN are comparing it to the libraries listed below
Sorting:
- A implement of run-length encoding for Pytorch tensor using CUDA☆11Updated 4 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆72Updated 2 years ago
- ☆43Updated last year
- A research library for pytorch-based neural network pruning, compression, and more.☆161Updated 2 years ago
- Code accompanying the NeurIPS 2020 paper: WoodFisher (Singh & Alistarh, 2020)☆50Updated 4 years ago
- ☆31Updated 10 months ago
- Transformers w/o Attention, based fully on MLPs☆93Updated last year
- PyTorch implementation of LARS (Layer-wise Adaptive Rate Scaling)☆20Updated 6 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆71Updated 2 years ago
- Customized matrix multiplication kernels☆54Updated 3 years ago
- Implementation of fused cosine similarity attention in the same style as Flash Attention☆213Updated 2 years ago
- DeltaCNN End-to-End CNN Inference of Sparse Frame Differences in Videos☆59Updated 2 years ago
- Dynamic Neural Architecture Search Toolkit☆30Updated 5 months ago
- [ICLR 2021 Spotlight] "CPT: Efficient Deep Neural Network Training via Cyclic Precision" by Yonggan Fu, Han Guo, Meng Li, Xin Yang, Yinin…☆30Updated last year
- Using ideas from product quantization for state-of-the-art neural network compression.☆145Updated 3 years ago
- Pruning is all you need (hopefully)☆11Updated 2 years ago
- Artifact for IPDPS'21: DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions.☆13Updated 4 years ago
- Recent Advances on Efficient Vision Transformers☆50Updated 2 years ago
- Soft Threshold Weight Reparameterization for Learnable Sparsity☆89Updated 2 years ago
- FakeQuantize with Learned Step Size(LSQ+) as Observer in PyTorch☆34Updated 3 years ago
- DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)☆30Updated 2 years ago
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆20Updated 2 years ago
- Code repo for the paper BiT Robustly Binarized Multi-distilled Transformer☆106Updated last year
- Butterfly matrix multiplication in PyTorch☆170Updated last year
- ☆22Updated 6 years ago
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆56Updated 3 years ago
- ☆69Updated 5 years ago
- Identify a binary weight or binary weight and activation subnetwork within a randomly initialized network by only pruning and binarizing …☆52Updated 3 years ago
- Compression schema for gradients of activations in backward pass☆44Updated last year
- [CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu…☆57Updated 3 years ago