AlirezaAzadbakht / kernel-sharingLinks
Drastically Reducing the Number of Trainable Parameters in Deep CNNs by Inter-layer Kernel-sharing
☆13Updated 2 years ago
Alternatives and similar repositories for kernel-sharing
Users that are interested in kernel-sharing are comparing it to the libraries listed below
Sorting:
- ☆23Updated 9 months ago
- JAX Scalify: end-to-end scaled arithmetics☆16Updated 10 months ago
- Codes of the paper Deformable Butterfly: A Highly Structured and Sparse Linear Transform.☆13Updated 3 years ago
- High-performance tokenized language data-loader for Python C++ extension☆13Updated last year
- Visualising Losses in Deep Neural Networks☆16Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated last year
- Implementation of a holodeck, written in Pytorch☆18Updated last year
- ☆11Updated last year
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆53Updated 5 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Fork of Flame repo for training of some new stuff in development☆17Updated last week
- Training hybrid models for dummies.☆25Updated 8 months ago
- GoldFinch and other hybrid transformer components☆11Updated 2 months ago
- Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…☆46Updated last week
- Implementation of Hyena Hierarchy in JAX☆10Updated 2 years ago
- Rust bindings for CTranslate2☆14Updated 2 years ago
- PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].☆17Updated 3 years ago
- implementation of https://arxiv.org/pdf/2312.09299☆21Updated last year
- Implementation of Strassen attention, from Kozachinskiy et al. of National Center of AI in Chile☆41Updated 2 months ago
- CHAI is a library for dynamic pruning of attention heads for efficient LLM inference.☆19Updated 9 months ago
- Recursive Leasting Squares (RLS) with Neural Network for fast learning☆56Updated last year
- recipe for training fully-featured self supervised image jepa models☆10Updated 3 months ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆24Updated last year
- Code for the note "NF4 Isn't Information Theoretically Optimal (and that's Good)☆20Updated 2 years ago
- JAX implementations of RWKV☆19Updated last year
- Toy genetic algorithm in Pytorch☆53Updated 4 months ago
- RWKV model implementation☆38Updated 2 years ago
- Implementation of a Light Recurrent Unit in Pytorch☆48Updated 11 months ago
- How to use the Flax Linen API to build a convolutional neural network model and train it for image classification (using TensorFlow Datas…☆24Updated 2 years ago
- A dashboard for exploring timm learning rate schedulers☆19Updated 9 months ago