Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in All Domains" (Heinsen, 2019), for composing deep neural networks.
☆172Apr 13, 2023Updated 3 years ago
Alternatives and similar repositories for heinsen_routing
Users that are interested in heinsen_routing are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- BinaryVice is better than term_to_binary/1 at serializing structured Erlang data.☆25Oct 12, 2009Updated 16 years ago
- Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)☆25Jun 6, 2024Updated last year
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)☆98Dec 5, 2024Updated last year
- Implementation of deep implicit attention in PyTorch☆66Aug 2, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆49Jan 27, 2022Updated 4 years ago
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Jun 26, 2021Updated 4 years ago
- OCaml binding to LXC with idiomatic (and opionated) OCaml API design☆13Oct 25, 2019Updated 6 years ago
- Libhydrogen bindings for Erlang☆20Feb 10, 2019Updated 7 years ago
- A Building blocks for elixir CQRS segregated applications☆15Sep 25, 2019Updated 6 years ago
- ☆53Aug 21, 2025Updated 8 months ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆92Jun 18, 2024Updated last year
- A *tuned* minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training☆120Aug 9, 2021Updated 4 years ago
- Mechanistic Interpretability for Transformer Models☆53Jun 1, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Plug-and-play IP rate limiter in C☆26Sep 1, 2025Updated 8 months ago
- A Haskell derived programming language for systems development.☆13Sep 18, 2018Updated 7 years ago
- softpool implementation(Refining activation downsampling with SoftPool) This is an unofficial implementation. https://arxiv.org/pdf/2101.…☆15Jan 20, 2021Updated 5 years ago
- Erlang LRU cache☆12Mar 8, 2019Updated 7 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- A NIF module for Erlang to Mozilla's Spidermonkey Javascript runtime.☆13Apr 14, 2026Updated 3 weeks ago
- A Skew Binomial Heap for Erlang.☆15Jun 22, 2011Updated 14 years ago
- Optimizable stack of images at different resolutions, a useful representation of images for deep learning tasks. Docs: https://johnowhita…☆11Sep 8, 2022Updated 3 years ago
- Official Pytorch and JAX implementation of "Efficient-VDVAE: Less is more"☆200Aug 15, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …☆644Jul 17, 2023Updated 2 years ago
- Fun with wgpu: Simulating slime mold☆24Aug 22, 2024Updated last year
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆12Nov 25, 2021Updated 4 years ago
- This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…☆65May 14, 2023Updated 2 years ago
- PyTorch Capsule Layer☆31Mar 10, 2020Updated 6 years ago
- ☆50Mar 14, 2024Updated 2 years ago
- A PyTorch implementation of Parameter-sharing Capsule Network based on the paper "Evaluating Generalization Ability of Convolutional Neur…☆16Jan 15, 2022Updated 4 years ago
- Statistical discontinuous constituent parsing☆11Feb 15, 2018Updated 8 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Jan 16, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- JSON encode/decode library written in Erlang☆17Apr 12, 2025Updated last year
- Implementation of Mega, the Single-head Attention with Multi-headed EMA architecture that currently holds SOTA on Long Range Arena☆207Aug 26, 2023Updated 2 years ago
- The GraphBuilder library provides functions to construct large scale graphs. It is implemented on Apache Hadoop.☆100Oct 9, 2014Updated 11 years ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- ☆11Apr 14, 2022Updated 4 years ago
- GPU Accelerated, Distributed, Actor Model Language (WIP)☆30Jun 21, 2023Updated 2 years ago
- Official Pytorch code for (AAAI 2020) paper "Capsule Routing via Variational Bayes", https://arxiv.org/pdf/1905.11455.pdf☆101Jul 15, 2021Updated 4 years ago