mit-han-lab/hardware-aware-transformers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mit-han-lab/hardware-aware-transformers)

mit-han-lab / hardware-aware-transformers

[ACL'20] HAT: Hardware-Aware Transformers for Efficient Natural Language Processing

☆336

Alternatives and similar repositories for hardware-aware-transformers

Users that are interested in hardware-aware-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mit-han-lab / lite-transformer
View on GitHub
[ICLR 2020] Lite Transformer with Long-Short Range Attention
☆609Jul 11, 2024Updated 2 years ago
mit-han-lab / once-for-all
View on GitHub
[ICLR 2020] Once for All: Train One Network and Specialize it for Efficient Deployment
☆1,953Dec 14, 2023Updated 2 years ago
mit-han-lab / neurips-micronet
View on GitHub
[JMLR'20] NeurIPS 2019 MicroNet Challenge Efficient Language Modeling, Champion
☆41Feb 26, 2021Updated 5 years ago
mit-han-lab / apq
View on GitHub
[CVPR 2020] APQ: Joint Search for Network Architecture, Pruning and Quantization Policy
☆160Jun 16, 2020Updated 6 years ago
mit-han-lab / proxylessnas
View on GitHub
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
☆1,446Aug 30, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zhijian-liu / torchprofile
View on GitHub
Count the MACs / FLOPs of PyTorch models
☆643Mar 11, 2026Updated 4 months ago
mit-han-lab / haq
View on GitHub
[CVPR 2019, Oral] HAQ: Hardware-Aware Automated Quantization with Mixed Precision
☆408Feb 26, 2021Updated 5 years ago
GATECH-EIC / HW-NAS-Bench
View on GitHub
[ICLR 2021] HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark
☆118Apr 18, 2023Updated 3 years ago
facebookresearch / AlphaNet
View on GitHub
AlphaNet Improved Training of Supernet with Alpha-Divergence
☆99Aug 12, 2021Updated 4 years ago
changlin31 / DS-Net
View on GitHub
(CVPR 2021, Oral) Dynamic Slimmable Network
☆231Dec 31, 2021Updated 4 years ago
facebookresearch / mobile-vision
View on GitHub
Mobile vision models and code
☆921Updated this week
JiahuiYu / slimmable_networks
View on GitHub
Slimmable Networks, AutoSlim, and Beyond, ICLR 2019, and ICCV 2019
☆929Mar 9, 2023Updated 3 years ago
megvii-model / SinglePathOneShot
View on GitHub
☆267Oct 30, 2019Updated 6 years ago
xiusu / ViTAS
View on GitHub
Code for ViTAS_Vision Transformer Architecture Search
☆50Jul 22, 2021Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
mit-han-lab / amc
View on GitHub
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
☆449Nov 22, 2023Updated 2 years ago
enyac-group / single-path-nas
View on GitHub
Single-Path NAS: Designing Hardware-Efficient ConvNets in less than 4 Hours
☆394Dec 14, 2020Updated 5 years ago
google-research / nasbench
View on GitHub
NASBench: A Neural Architecture Search Dataset and Benchmark
☆720May 1, 2023Updated 3 years ago
sacmehta / delight
View on GitHub
DeLighT: Very Deep and Light-Weight Transformers
☆469Oct 16, 2020Updated 5 years ago
facebookresearch / NASViT
View on GitHub
code for NASViT
☆67Apr 25, 2022Updated 4 years ago
mit-han-lab / amc-models
View on GitHub
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
☆168Feb 26, 2021Updated 5 years ago
xiaomi-automl / FairDARTS
View on GitHub
Fair DARTS: Eliminating Unfair Advantages in Differentiable Architecture Search
☆177Oct 3, 2023Updated 2 years ago
1adrianb / binary-nas
View on GitHub
☆35Mar 4, 2020Updated 6 years ago
zhuhanqing / Lightening-Transformer-AE
View on GitHub
Artifact evaluation for HPCA'24 paper Lightening-Transformer: A Dynamically-operated Optically-interconnected Photonic Transformer Accele…
☆11Mar 3, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
mit-han-lab / gan-compression
View on GitHub
[CVPR 2020] GAN Compression: Efficient Architectures for Interactive Conditional GANs
☆1,116Jun 5, 2024Updated 2 years ago
meijieru / AtomNAS
View on GitHub
[ICLR 2020]: 'AtomNAS: Fine-Grained End-to-End Neural Architecture Search'
☆220Jun 8, 2020Updated 6 years ago
gatech-sysml / CompOFA
View on GitHub
[ICLR 2021] CompOFA: Compound Once-For-All Networks For Faster Multi-Platform Deployment
☆25Jan 5, 2023Updated 3 years ago
mit-han-lab / inter-operator-scheduler
View on GitHub
[MLSys 2021] IOS: Inter-Operator Scheduler for CNN Acceleration
☆201Apr 27, 2022Updated 4 years ago
JaminFong / DenseNAS
View on GitHub
Densely Connected Search Space for More Flexible Neural Architecture Search (CVPR2020)
☆292Jul 4, 2020Updated 6 years ago
CanyonWind / Single-Path-One-Shot-NAS-MXNet
View on GitHub
Single Path One-Shot NAS MXNet implementation with full training and searching pipeline. Support both Block and Channel Selection. Search…
☆150Mar 27, 2020Updated 6 years ago
changlin31 / DNA
View on GitHub
(CVPR 2020) Block-wisely Supervised Neural Architecture Search with Knowledge Distillation
☆234Sep 23, 2021Updated 4 years ago
hsharma35 / bitfusion
View on GitHub
Simulator for BitFusion
☆103Aug 6, 2020Updated 5 years ago
alexa / bort
View on GitHub
Repository for the paper "Optimal Subarchitecture Extraction for BERT"
☆470Jun 22, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
pku-liang / FlexTensor
View on GitHub
Automatic Schedule Exploration and Optimization Framework for Tensor Computations
☆184Apr 25, 2022Updated 4 years ago
yukang2017 / NAS-quantization
View on GitHub
The code for Joint Neural Architecture Search and Quantization
☆14Apr 10, 2019Updated 7 years ago
mit-han-lab / tinyml
View on GitHub
☆1,191Nov 29, 2023Updated 2 years ago
haolibai / APS-channel-search
View on GitHub
Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020
☆21Nov 15, 2020Updated 5 years ago
D-X-Y / AutoDL-Projects
View on GitHub
Automated deep learning algorithms implemented in PyTorch.
☆1,581Apr 24, 2022Updated 4 years ago
csyhhu / MetaQuant
View on GitHub
Codes for Accepted Paper : "MetaQuant: Learning to Quantize by Learning to Penetrate Non-differentiable Quantization" in NeurIPS 2019
☆54May 8, 2020Updated 6 years ago
kcyu2014 / multi-model-forgetting
View on GitHub
ICML2019 Accepted Paper. Overcoming Multi-Model Forgetting
☆14Jun 5, 2019Updated 7 years ago