AlirezaAzadbakht / kernel-sharing
Drastically Reducing the Number of Trainable Parameters in Deep CNNs by Inter-layer Kernel-sharing
☆12Updated last year
Related projects ⓘ
Alternatives and complementary repositories for kernel-sharing
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆12Updated 3 years ago
- ☆47Updated 2 years ago
- ☆13Updated last year
- Loop Nest - Linear algebra compiler and code generator.☆22Updated 2 years ago
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆17Updated last year
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆20Updated last week
- Benchmarking PyTorch 2.0 different models☆21Updated last year
- Official PyTorch implementation of LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification☆46Updated 2 years ago
- In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…☆22Updated 3 years ago
- High-performance tokenized language data-loader for Python C++ extension☆12Updated 4 months ago
- [TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA☆16Updated 2 years ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 2 years ago
- Open Source Projects from Pallas Lab☆20Updated 3 years ago
- ⛰️ RockyML - A High-Performance Scientific Computing Framework for Non-smooth Machine Learning Problems☆19Updated last year
- [ECCV 2022] SuperTickets: Drawing Task-Agnostic Lottery Tickets from Supernets via Jointly Architecture Searching and Parameter Pruning☆19Updated 2 years ago
- ACL 2023☆38Updated last year
- Curated content for DNN approximation, acceleration ... with a focus on hardware accelerator and deployment☆23Updated 6 months ago
- ☆12Updated 5 months ago
- Energy Consumption-Aware Tabular Benchmark For Neural Architecture Search☆10Updated 10 months ago
- A 8-/16-/32-/64-bit floating point number family☆16Updated 2 years ago
- Describe the format of image/text datasets☆11Updated 2 years ago
- ☆15Updated 10 months ago
- Library for the Test-based Calibration Error (TCE) metric to quantify the degree to classifier calibration.☆13Updated last year
- Official pytorch code for "APP: Anytime Progressive Pruning" (DyNN @ ICML, 2022; CLL @ ACML, 2022, SNN @ ICML, 2022 and SlowDNN 2023)☆17Updated 2 years ago
- Flexible simulator for mixed precision and format simulation of LLMs and vision transformers.☆43Updated last year
- A collection of research papers on efficient training of DNNs☆68Updated 2 years ago