AlirezaAzadbakht / kernel-sharing
Drastically Reducing the Number of Trainable Parameters in Deep CNNs by Inter-layer Kernel-sharing
☆12Updated last year
Alternatives and similar repositories for kernel-sharing:
Users that are interested in kernel-sharing are comparing it to the libraries listed below
- High-performance tokenized language data-loader for Python C++ extension☆12Updated 7 months ago
- Benchmarking PyTorch 2.0 different models☆21Updated last year
- In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…☆22Updated 3 years ago
- ☆48Updated 2 years ago
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆12Updated 3 years ago
- A library to train and deploy quantised Deep Neural Networks☆21Updated 2 months ago
- [TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA☆16Updated 2 years ago
- [ICML 2022] "Coarsening the Granularity: Towards Structurally Sparse Lottery Tickets" by Tianlong Chen, Xuxi Chen, Xiaolong Ma, Yanzhi Wa…☆32Updated last year
- Deep learning for spiking neural networks☆69Updated 9 months ago
- Official PyTorch implementation of LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification☆46Updated 2 years ago
- ☆28Updated 7 months ago
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆23Updated this week
- Parameterizing Mixing Links in Sparse Factors Works Better than Dot-Product Self-Attention (CVPR 2022)☆20Updated 2 years ago
- E2E AutoML Model Compression Package☆46Updated 3 weeks ago
- ⛰️ RockyML - A High-Performance Scientific Computing Framework for Non-smooth Machine Learning Problems☆19Updated last year
- A compressed alternative to matrix multiplication using state-of-the art compression ROBE-Z☆9Updated last year
- ☆12Updated 4 months ago
- Loop Nest - Linear algebra compiler and code generator.☆22Updated 2 years ago
- The official repository of Quamba☆23Updated 3 months ago
- Interpretability analysis of language model outlier and attempts to distill the model☆13Updated last year
- A collection of research papers on efficient training of DNNs☆70Updated 2 years ago
- Fork of upstream onnxruntime focused on supporting risc-v accelerators☆83Updated last year
- Official implementation for the paper "Understanding Hyperdimensional Computing for Parallel Single-Pass Learning"☆18Updated last year
- CS4362 - Hardware Description Languages. Implemented SNN on an FPGA for real-time image processing using VHDL☆11Updated last year
- [FPGA'21] CoDeNet is an efficient object detection model on PyTorch, with SOTA performance on VOC and COCO based on CenterNet and Co-Desi…☆25Updated 2 years ago
- Open Source Projects from Pallas Lab☆20Updated 3 years ago
- Design, train and compile neural networks optimized specifically for FPGAs.☆17Updated this week
- A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Updated 2 years ago
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 3 years ago