AlirezaAzadbakht / kernel-sharingLinks
Drastically Reducing the Number of Trainable Parameters in Deep CNNs by Inter-layer Kernel-sharing
☆14Updated 2 years ago
Alternatives and similar repositories for kernel-sharing
Users that are interested in kernel-sharing are comparing it to the libraries listed below
Sorting:
- High-performance tokenized language data-loader for Python C++ extension☆14Updated last year
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆13Updated 4 years ago
- Implementation of a holodeck, written in Pytorch☆18Updated 2 years ago
- JAX Scalify: end-to-end scaled arithmetics☆18Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Implementation of Hyena Hierarchy in JAX☆10Updated 2 years ago
- Implementation of Metaformer, but in an autoregressive manner☆26Updated 3 years ago
- Benchmarking PyTorch 2.0 different models☆20Updated 2 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- Visualising Losses in Deep Neural Networks☆16Updated last year
- A dashboard for exploring timm learning rate schedulers☆19Updated last year
- Toy genetic algorithm in Pytorch☆55Updated 9 months ago
- ☆29Updated last year
- ☆13Updated last month
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆55Updated 10 months ago
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆29Updated 5 years ago
- Official Code Repository for the paper "Key-value memory in the brain"☆31Updated 11 months ago
- Implementation of a Light Recurrent Unit in Pytorch☆49Updated last year
- In this repository, we explore model compression for transformer architectures via quantization. We specifically explore quantization awa…☆24Updated 4 years ago
- ☆24Updated last year
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Updated 2 years ago
- Utilities for Training Very Large Models☆58Updated last year
- A JAX nn library☆22Updated 4 months ago
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated last year
- This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.☆14Updated 4 years ago
- A GPT, made only of MLPs, in Jax☆59Updated 4 years ago
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Updated 4 years ago
- ☆12Updated last year
- Flax Image Models - State-of-the-art pre-trained vision backbones for Flax.☆23Updated 7 months ago