automl / HW-GPT-Bench
HW-GPT-Bench: Hardware-Aware Architecture Benchmark for Language Models
☆18Updated last month
Alternatives and similar repositories for HW-GPT-Bench:
Users that are interested in HW-GPT-Bench are comparing it to the libraries listed below
- The first collection of surrogate benchmarks for Joint Architecture and Hyperparameter Search.☆15Updated last year
- ☆13Updated 2 years ago
- ☆76Updated 9 months ago
- ☆26Updated 7 months ago
- Smooth Variational Graph Embeddings for Efficient Neural Architecture Search☆15Updated last year
- ☆15Updated 3 months ago
- [ICLR '21] Interpretable Neural Architecture Search using Bayesian Optimisation with Weisfiler-Lehman Kernel (NAS-BOWL)☆24Updated 3 years ago
- ☆23Updated last year
- ☆35Updated 3 years ago
- The official implementation of PFNs4BO: In-Context Learning for Bayesian Optimization☆24Updated 10 months ago
- [ICML2022] Training Your Sparse Neural Network Better with Any Mask. Ajay Jaiswal, Haoyu Ma, Tianlong Chen, ying Ding, and Zhangyang Wang☆27Updated 2 years ago
- This repository contains the publishable code for CVPR 2021 paper TransNAS-Bench-101: Improving Transferrability and Generalizability of …☆22Updated last year
- Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.☆20Updated 3 years ago
- ☆14Updated 3 years ago
- Official repository for "Construction of Hierarchical Neural Architecture Search Spaces based on Context-free Grammars" (NeurIPS 2023)☆16Updated last year
- [ICLR 2023] 'Revisiting Pruning At Initialization Through The Lens of Ramanujan Graph" by Duc Hoang, Shiwei Liu, Radu Marculescu, Atlas W…☆12Updated last year
- [NeurIPS‘2021] "MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge", Geng Yuan, Xiaolong Ma, Yanzhi Wang et al…☆18Updated 2 years ago
- "Understanding and Accelerating Neural Architecture Search with Training-Free and Theory-Grounded Metrics" by Wuyang Chen, Xinyu Gong, Yu…☆26Updated last year
- Encodings for neural architecture search☆29Updated 3 years ago
- [ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, De…☆46Updated last year
- Implementation of Continuous Sparsification, a method for pruning and ticket search in deep networks☆33Updated 2 years ago
- ☆18Updated 5 years ago
- Code for our ICLR'2021 paper "DrNAS: Dirichlet Neural Architecture Search"☆44Updated 3 years ago
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆56Updated 3 years ago
- [ICML 2021 Oral] "CATE: Computation-aware Neural Architecture Encoding with Transformers" by Shen Yan, Kaiqiang Song, Fei Liu, Mi Zhang☆19Updated 3 years ago
- Comparison of method "Pruning at initialization prior to training" (Synflow/SNIP/GraSP) in PyTorch☆14Updated 8 months ago
- Generic Neural Architecture Search via Regression (NeurIPS'21 Spotlight)☆36Updated 2 years ago
- Introducing diverse tasks for NAS☆47Updated 2 years ago
- [ICLR 2023] Deep Ranking Ensembles for Hyperparameter Optimization☆13Updated 10 months ago
- Code to reproduce experiments from 'Does Knowledge Distillation Really Work' a paper which appeared in the 2021 NeurIPS proceedings.☆33Updated last year