miaozhang0525 / iDARTSLinks
codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients
☆10Updated 4 years ago
Alternatives and similar repositories for iDARTS
Users that are interested in iDARTS are comparing it to the libraries listed below
Sorting:
- Benchmarking Attention Mechanism in Vision Transformers.☆19Updated 3 years ago
- [ICML 2021] "Efficient Lottery Ticket Finding: Less Data is More" by Zhenyu Zhang*, Xuxi Chen*, Tianlong Chen*, Zhangyang Wang☆26Updated 4 years ago
- Experiments from "The Generalization-Stability Tradeoff in Neural Network Pruning": https://arxiv.org/abs/1906.03728.☆14Updated 5 years ago
- Automated neural architecture search algorithms implemented in PyTorch and Autogluon toolkit.☆12Updated 5 years ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Updated 10 months ago
- ☆57Updated 4 years ago
- Paper List for In-context Learning 🌷☆20Updated 3 years ago
- Data-free knowledge distillation using Gaussian noise (NeurIPS paper)☆15Updated 2 years ago
- [CVPR 2021] "The Lottery Tickets Hypothesis for Supervised and Self-supervised Pre-training in Computer Vision Models" Tianlong Chen, Jon…☆68Updated 3 years ago
- ☆38Updated last year
- ☆13Updated 4 years ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆41Updated 4 months ago
- Paper and Code for "Curriculum Learning by Optimizing Learning Dynamics" (AISTATS 2021)☆19Updated 4 years ago
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆34Updated 2 years ago
- [ICLR 2022] "As-ViT: Auto-scaling Vision Transformers without Training" by Wuyang Chen, Wei Huang, Xianzhi Du, Xiaodan Song, Zhangyang Wa…☆76Updated 3 years ago
- ☆27Updated 3 years ago
- Code for ViTAS_Vision Transformer Architecture Search☆51Updated 4 years ago
- [BMVC '21] DU-DARTS: Decreasing the Uncertainty of Differentiable Architecture Search☆13Updated 4 years ago
- Codes for DATA: Differentiable ArchiTecture Approximation.☆11Updated 4 years ago
- Implementation for <Orthogonal Over-Parameterized Training> in CVPR'21.☆22Updated 4 years ago
- BESA is a differentiable weight pruning technique for large language models.☆17Updated last year
- [NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective☆37Updated 3 years ago
- [CVPR 2021] Contrastive Neural Architecture Search with Neural Architecture Comparators☆40Updated 3 years ago
- Official implementation for paper "Relational Surrogate Loss Learning", ICLR 2022☆37Updated 3 years ago
- [CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model☆18Updated last year
- [NeurIPS 2024] Search for Efficient LLMs☆16Updated last year
- [ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Che…☆82Updated 4 years ago
- codes for Neural Architecture Ranker and detailed cell information datasets based on NAS-Bench series☆12Updated 3 years ago
- Codes for Understanding Architectures Learnt by Cell-based Neural Architecture Search☆28Updated 5 years ago