AlirezaAzadbakht / kernel-sharing
Drastically Reducing the Number of Trainable Parameters in Deep CNNs by Inter-layer Kernel-sharing
☆13Updated 2 years ago
Alternatives and similar repositories for kernel-sharing:
Users that are interested in kernel-sharing are comparing it to the libraries listed below
- ☆49Updated 2 years ago
- In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…☆24Updated 3 years ago
- ⛰️ RockyML - A High-Performance Scientific Computing Framework for Non-smooth Machine Learning Problems☆19Updated 2 years ago
- Code for High-Capacity Expert Binary Networks (ICLR 2021).☆27Updated 3 years ago
- Another implementation of the paper "Compound Word Transformer: Learning to Compose Full-Song Music over Dynamic Directed Hypergraphs" in…☆13Updated 3 years ago
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆12Updated 3 years ago
- Benchmarking PyTorch 2.0 different models☆21Updated 2 years ago
- Official PyTorch implementation of LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification☆46Updated 2 years ago
- ☆10Updated last year
- JAX Scalify: end-to-end scaled arithmetics☆16Updated 5 months ago
- Loop Nest - Linear algebra compiler and code generator.☆22Updated 2 years ago
- Code used in the paper “Nonideality-Aware Training for Accurate and Robust Low-Power Memristive Neural Networks”☆12Updated last year
- Open Source Projects from Pallas Lab☆20Updated 3 years ago
- 🧠 Benchmark facility to train networks on different datasets for PyTorch/Brevitas☆25Updated 2 years ago
- Utilities for working with videos☆13Updated 3 years ago
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆19Updated last year
- GoldFinch and other hybrid transformer components☆10Updated 3 weeks ago
- 🐆 A compiler from AI model to RTL (Verilog) accelerator in FPGA hardware with auto design space exploration for *AdderNet*☆19Updated 11 months ago
- A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)☆21Updated last year
- Quantization-aware training with spiking neural networks☆40Updated 3 years ago
- Designs, infrastructure, and experiments around Race Logic☆25Updated 4 years ago
- CHAI is a library for dynamic pruning of attention heads for efficient LLM inference.☆13Updated 4 months ago
- Official implementation for Wavelet Feature Maps Compression for Image-to-Image CNNs, NeurIPS 2022.☆33Updated 2 years ago
- Model zoo for the Quantized ONNX (QONNX) model format☆12Updated 2 months ago
- Converting a deep neural network to integer-only inference in native C via uniform quantization and the fixed-point representation.☆23Updated 3 years ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆16Updated last year
- [TCAD 2021] Block Convolution: Towards Memory-Efficient Inference of Large-Scale CNNs on FPGA☆16Updated 2 years ago
- Recursive Leasting Squares (RLS) with Neural Network for fast learning☆53Updated last year
- Fully opensource spiking neural network accelerator☆141Updated 2 years ago
- Directed masked autoencoders☆14Updated 2 years ago