gbup-group / EAN-efficient-attention-networkView external linksLinks
The implementation of paper ''Efficient Attention Network: Accelerate Attention by Searching Where to Plug''.
☆20Jun 16, 2023Updated 2 years ago
Alternatives and similar repositories for EAN-efficient-attention-network
Users that are interested in EAN-efficient-attention-network are comparing it to the libraries listed below
Sorting:
- The official implementation of paper "Drop-Activation: Implicit Parameter Reduction and Harmonious Regularization".☆10May 30, 2019Updated 6 years ago
- The official implementation of paper "Instance Enhancement Batch Normalization: an Adaptive Regulator of Batch Noise".☆41Jun 16, 2023Updated 2 years ago
- ☆20Oct 7, 2025Updated 4 months ago
- Implementation for <Regularizing Neural Networks via Minimizing Hyperspherical Energy> in CVPR'20.☆24Jun 23, 2020Updated 5 years ago
- Spell and pronounce words with a neural network☆10Feb 13, 2017Updated 9 years ago
- Hamiltonian neural network implementation for Henon Heiles dynamical system learning mix of order and chaos☆11Dec 2, 2023Updated 2 years ago
- Official PyTorch implementation of the paper : ProbAct: A Probabilistic Activation Function for Deep Neural Networks.☆13Jun 10, 2019Updated 6 years ago
- WWW'24, Mirror Gradient (MG) makes multimodal recommendation models approach flat local minima easier compared to models with normal trai…☆17Nov 1, 2024Updated last year
- Implementation of the Heterogeneous Knowledge Distillation using Information Flow Modeling method☆25May 25, 2020Updated 5 years ago
- The official implementation of two AI-enhanced numerical solvers: NeurVec (Sci. Rep.) and AttNS (ICML'24)☆27May 21, 2024Updated last year
- The final project of Advance Machine Learning course in Tsinghua University. This project aims to make a color transfer of animes charact…☆31Dec 21, 2020Updated 5 years ago
- This repository provides code source used in the paper: A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off☆13May 30, 2019Updated 6 years ago
- structured sparsity regularization☆14Oct 12, 2019Updated 6 years ago
- Data-free knowledge distillation using Gaussian noise (NeurIPS paper)☆15Mar 24, 2023Updated 2 years ago
- Self-Distillation with weighted ground-truth targets; ResNet and Kernel Ridge Regression☆19Oct 12, 2021Updated 4 years ago
- pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsup…☆18Mar 23, 2020Updated 5 years ago
- Codes of Centripetal SGD☆64Sep 8, 2022Updated 3 years ago
- A Python module for estimating divergence between two sets of samples.☆18Jul 6, 2023Updated 2 years ago
- This is the official repository for Batch Level Distillation (BLD)☆15Jan 25, 2021Updated 5 years ago
- Sparse Switchable Normalization with sparse activation function SparestMax☆65Aug 12, 2019Updated 6 years ago
- ☆19Jan 13, 2021Updated 5 years ago
- Mathematical consequences of orthogonal weights initialization and regularization in deep learning. Experiments with gain-adjusted orthog…☆17Sep 21, 2019Updated 6 years ago
- ☆20Nov 20, 2020Updated 5 years ago
- Code for "ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models" (ICLR 2024)☆20Feb 16, 2024Updated 2 years ago
- PyTorch implementation of the paper The Lottery Ticket Hypothesis for Object Recognition☆23Apr 22, 2021Updated 4 years ago
- The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…☆20Sep 18, 2023Updated 2 years ago
- ☆25May 20, 2020Updated 5 years ago
- Official PyTorch implementation for our ICCV 2019 paper - Fooling Network Interpretation in Image Classification☆24Nov 21, 2019Updated 6 years ago
- ☆21Aug 10, 2022Updated 3 years ago
- [NeurIPS'20] GradAug: A New Regularization Method for Deep Neural Networks☆94Dec 16, 2020Updated 5 years ago
- ☆24Feb 18, 2021Updated 4 years ago
- PyTorch and Torch implementation for our accepted CVPR 2020 paper (Oral): Controllable Orthogonalization in Training DNNs☆24Jan 19, 2021Updated 5 years ago
- Interpolation between Residual and Non-Residual Networks, ICML 2020. https://arxiv.org/abs/2006.05749☆26Aug 16, 2020Updated 5 years ago
- Data-Driven Neuron Allocation for Scale Aggregation Networks☆53Apr 29, 2019Updated 6 years ago
- ☆23Oct 27, 2019Updated 6 years ago
- Code for Active Mixup in 2020 CVPR☆23Jan 11, 2022Updated 4 years ago
- ☆112May 17, 2021Updated 4 years ago
- Code for the paper "Understanding Generalization through Visualizations"☆65Jan 15, 2021Updated 5 years ago
- Fine-grained ImageNet annotations☆30May 25, 2020Updated 5 years ago