AlirezaAzadbakht / kernel-sharingLinks
Drastically Reducing the Number of Trainable Parameters in Deep CNNs by Inter-layer Kernel-sharing
☆13Updated 2 years ago
Alternatives and similar repositories for kernel-sharing
Users that are interested in kernel-sharing are comparing it to the libraries listed below
Sorting:
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆13Updated 4 years ago
 - High-performance tokenized language data-loader for Python C++ extension☆13Updated last year
 - Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
 - Implementation of a holodeck, written in Pytorch☆18Updated 2 years ago
 - Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Updated 4 years ago
 - Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
 - Repository for CPU Kernel Generation for LLM Inference☆26Updated 2 years ago
 - JAX Scalify: end-to-end scaled arithmetics☆16Updated last year
 - Implementation of Hyena Hierarchy in JAX☆10Updated 2 years ago
 - Visualising Losses in Deep Neural Networks☆16Updated last year
 - "PyTorch in Rust"☆17Updated last year
 - APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆25Updated last week
 - A simple program to calculate and visualize the FLOPs and Parameters of Pytorch models, with handy CLI and easy-to-use Python API.☆131Updated 11 months ago
 - ☆22Updated 10 months ago
 - In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…☆24Updated 4 years ago
 - ☆54Updated 3 years ago
 - RWKV model implementation☆38Updated 2 years ago
 - Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
 - Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆14Updated last year
 - Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…☆47Updated 2 months ago
 - Training hybrid models for dummies.☆27Updated 2 weeks ago
 - CHAI is a library for dynamic pruning of attention heads for efficient LLM inference.☆22Updated 10 months ago
 - A minimal TPU compatible Jax implementation of NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis.☆13Updated 3 years ago
 - HyPe: Better Pre-trained Language Model Fine-tuning with Hidden Representation Perturbation [ACL 2023]☆14Updated 2 years ago
 - This is the official repo for Gradient Agreement Filtering (GAF).☆24Updated 9 months ago
 - Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆21Updated 2 years ago
 - Official PyTorch implementation of LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification☆47Updated 3 years ago
 - PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].☆17Updated 3 years ago
 - Official Code Repository for the paper "Key-value memory in the brain"☆29Updated 8 months ago
 - Rust bindings for CTranslate2☆14Updated 2 years ago