wjxts / RegularizedBN
☆21Updated last year
Related projects ⓘ
Alternatives and complementary repositories for RegularizedBN
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- [ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…☆44Updated last year
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated this week
- Mixture of Attention Heads☆39Updated 2 years ago
- [ICLR 2021] "Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning" by Tianlong Chen*, Zhenyu Zhang*, Sijia Liu, S…☆23Updated 2 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆52Updated last month
- Code for T-MARS data filtering☆35Updated last year
- ☆17Updated 2 years ago
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆53Updated last year
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆32Updated last year
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆47Updated last year
- HGRN2: Gated Linear RNNs with State Expansion☆49Updated 3 months ago
- EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)☆22Updated last year
- Gradient-based Hyperparameter Optimization Over Long Horizons☆12Updated 3 years ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Updated 2 years ago
- ☆29Updated 2 years ago
- ☆32Updated 3 years ago
- Structured Pruning Adapters in PyTorch