wiedersehne / ParamixerLinks
Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention (CVPR 2022)
☆20Updated 3 years ago
Alternatives and similar repositories for Paramixer
Users that are interested in Paramixer are comparing it to the libraries listed below
Sorting:
- Piecewise Linear Functions (PWL) implementation in PyTorch☆57Updated 3 years ago
- Architecture embeddings independent from the parametrization of the search space☆15Updated 4 years ago
- DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)☆32Updated 2 years ago
- ☆41Updated 4 years ago
- This repository implements the paper "Effective Training of Convolutional Neural Networks with Low-bitwidth Weights and Activations"☆20Updated 4 years ago
- [NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"☆73Updated 3 years ago
- Paper and Code for "Curriculum Learning by Optimizing Learning Dynamics" (AISTATS 2021)☆19Updated 4 years ago
- Official code for NeurIPS paper "Combinatorial Optimization for Panoptic Segmentation: A Fully Differentiable Approach".☆16Updated 3 years ago
- Drop-in replacement for any ResNet with a significantly reduced memory footprint and better representation capabilities☆208Updated last year
- Official implementation for Wavelet Feature Maps Compression for Image-to-Image CNNs, NeurIPS 2022.☆37Updated 3 years ago
- Code for the ICML 2021 and ICLR 2022 papers: Skew Orthogonal Convolutions, Improved deterministic l2 robustness on CIFAR-10 and CIFAR-100☆18Updated 3 years ago
- Good Subnetworks Provably Exist: Pruning via Greedy Forward Selection☆22Updated 5 years ago
- Official Implementation of Convolutional Normalization: Improving Robustness and Training for Deep Neural Networks☆30Updated 3 years ago
- Codebase for the paper "A Gradient Flow Framework for Analyzing Network Pruning"☆20Updated 5 years ago
- (ECCV 2022) BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks☆51Updated 3 years ago
- ☆13Updated 5 years ago
- PyTorch implementation of "Deep Transferring Quantization" (ECCV2020)☆18Updated 3 years ago
- ☆20Updated 2 years ago
- ☆33Updated 3 years ago
- Codes for Understanding Architectures Learnt by Cell-based Neural Architecture Search☆28Updated 5 years ago
- Official repository for the paper "Masksembles for Uncertainty Estimation" (CVPR 2021).☆103Updated 2 months ago
- Code base for SRSGD.☆28Updated 5 years ago
- ☆42Updated 2 years ago
- ☆35Updated 3 years ago
- Code for Sanity-Checking Pruning Methods: Random Tickets can Win the Jackpot☆42Updated 5 years ago
- Implementation of Kronecker Attention in Pytorch☆19Updated 5 years ago
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆34Updated 2 years ago
- Code for BlockSwap (ICLR 2020).☆33Updated 4 years ago
- Spectral Tensor Train Parameterization of Deep Learning Layers☆17Updated 4 years ago
- Differentiable Optimizers with Perturbations in Pytorch☆69Updated 4 years ago