aaronserianni / training-free-nasLinks

[ACL'22] Training-free Neural Architecture Search for RNNs and Transformers

☆14

Alternatives and similar repositories for training-free-nas

Users that are interested in training-free-nas are comparing it to the libraries listed below

Sorting:

NVlabs / SMCP
☆21Updated 2 years ago
MingSun-Tse / Why-the-State-of-Pruning-so-Confusing
[Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…
☆40Updated 2 years ago
pprp / CVPR2022-NAS-competition-Track1-3th-solution
Implementation of PGONAS for CVPR22W and RD-NAS for ICASSP23
☆22Updated 2 years ago
OSVAI / NORM
The official project website of "NORM: Knowledge Distillation via N-to-One Representation Matching" (The paper of NORM is published in IC…
☆20Updated last year
great8nctu / FOX-NAS
FOX-NAS: Fast, On-device and Explainable NeuralArchitecture Search
☆11Updated 3 years ago
CownowAn / DaSS
Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)
☆24Updated last year
MingSun-Tse / TPP
[ICLR'23] Trainability Preserving Neural Pruning (PyTorch)
☆33Updated 2 years ago
shawnricecake / search-llm
[NeurIPS 2024] Search for Efficient LLMs
☆14Updated 6 months ago
lliai / EMQ-series
[ICCV-2023] EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization
☆26Updated last year
OpenGVLab / LLMPrune-BESA
BESA is a differentiable weight pruning technique for large language models.
☆17Updated last year
ZiweiWangTHU / GMPQ
This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…
☆25Updated 3 years ago
cogsys-tuebingen / uninas
A highly modular PyTorch framework with a focus on Neural Architecture Search (NAS).
☆23Updated 3 years ago
megvii-model / RLNAS
☆20Updated 2 years ago
ziplab / EcoFormer
[NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"
☆72Updated 2 years ago
VITA-Group / SFW-Once-for-All-Pruning
[ICLR 2022] "Learning Pruning-Friendly Networks via Frank-Wolfe: One-Shot, Any-Sparsity, and No Retraining" by Lu Miao*, Xiaolong Luo*, T…
☆30Updated 3 years ago
bychen515 / GLiT
☆24Updated 3 years ago
VITA-Group / WeakNAS
[NeurIPS 2021] “Stronger NAS with Weaker Predictors“, Junru Wu, Xiyang Dai, Dongdong Chen, Yinpeng Chen, Mengchen Liu, Ye Yu, Zhangyang W…
☆27Updated 2 years ago
bestfleer / RepNAS
Code for RepNAS
☆13Updated 3 years ago
VITA-Group / UVC
[ICLR 2022] "Unified Vision Transformer Compression" by Shixing Yu*, Tianlong Chen*, Jiayi Shen, Huan Yuan, Jianchao Tan, Sen Yang, Ji Li…
☆53Updated last year
lliai / DisWOT-CVPR2023
☆26Updated last year
Model-Compression / Lossless_Compression
We propose a lossless compression algorithm based on the NTK matrix for DNN. The compressed network yields asymptotically the same NTK a…
☆24Updated last year
HuangOwen / QAT-ACS
[TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"
☆33Updated 10 months ago
lliai / Auto-Prox-AAAI24
Auto-Prox-AAAI24
☆13Updated last year
selkerdawy / FTWT
Fire Together Wire Together: A Dynamic Pruning Approach with Self-Supervised Mask Prediction
☆11Updated 3 years ago
megvii-research / Arch-Net
Arch-Net: Model Distillation for Architecture Agnostic Model Deployment
☆23Updated 3 years ago
VascoLopes / EPE-NAS
☆26Updated 3 years ago
cmd2001 / KVTuner
KVTuner: Sensitivity-Aware Layer-wise Mixed Precision KV Cache Quantization for Efficient and Nearly Lossless LLM Inference
☆15Updated 2 months ago
MAC-AutoML / OMPQ
☆25Updated 3 years ago
ziplab / QLLM
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…
☆27Updated last year
chenjoya / dropit
DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training (ICLR 2023)
☆31Updated 2 years ago