glassroom / heinsen_routingLinks

Reference implementation of "An Algorithm for Routing Vectors in Sequences" (Heinsen, 2022) and "An Algorithm for Routing Capsules in All Domains" (Heinsen, 2019), for composing deep neural networks.

☆172

Alternatives and similar repositories for heinsen_routing

Users that are interested in heinsen_routing are comparing it to the libraries listed below

Sorting:

IDSIA / modern-srwm
Official repository for the paper "A Modern Self-Referential Weight Matrix That Learns to Modify Itself" (ICML 2022 & NeurIPS 2021 Deep R…
☆172Updated 5 months ago
glassroom / heinsen_sequence
Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)
☆97Updated 11 months ago
Futrell / ziplm
☆254Updated 2 years ago
tech-srl / RASP
An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"
☆322Updated last year
lucidrains / memorizing-transformers-pytorch
Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate …
☆637Updated 2 years ago
kir-gadjello / zipslicer
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
kingoflolz / swarm-jax
Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes
☆242Updated 2 years ago
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated 2 years ago
google-research / jestimator
Amos optimizer with JEstimator lib.
☆82Updated last year
FlorianDietz / comgra
A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…
☆293Updated 11 months ago
jxiw / BiGS
Official Repository of Pretraining Without Attention (BiGS), BiGS is the first model to achieve BERT-level transfer learning on the GLUE …
☆115Updated last year
RobertRiachi / nanoPALM
☆144Updated 2 years ago
cair / tmu
Implements the Tsetlin Machine, Coalesced Tsetlin Machine, Convolutional Tsetlin Machine, Regression Tsetlin Machine, and Weighted Tsetli…
☆158Updated 3 months ago
HazyResearch / H3
Language Modeling with the H3 State Space Model
☆519Updated 2 years ago
HomebrewML / HomebrewNLP-torch
A case study of efficient training of large language models using commodity hardware.
☆68Updated 3 years ago
BlinkDL / SmallInitEmb
LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence
☆61Updated 3 years ago
cyrilou242 / ftcc
Fast Text Classification with Compressors dictionary
☆150Updated 2 years ago
neurallambda / awesome-reasoning
a curated list of data for reasoning ai
☆140Updated last year
pbelcak / fastfeedforward
A repository for log-time feedforward networks
☆223Updated last year
lucidrains / PaLM-jax
Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)
☆189Updated 3 years ago
jxbz / agd
Automatic gradient descent
☆215Updated 2 years ago
srush / raspy
An interactive exploration of Transformer programming.
☆270Updated 2 years ago
OswaldHe / HMT-pytorch
[NAACL 2025] Official Implementation of "HMT: Hierarchical Memory Transformer for Long Context Language Processing"
☆76Updated 5 months ago
codekansas / rwkv
RWKV model implementation
☆38Updated 2 years ago
eth-sri / language-model-arithmetic
Controlled Text Generation via Language Model Arithmetic
☆223Updated last year
lucidrains / simple-hierarchical-transformer
Experiments around a simple idea for inducing multiple hierarchical predictive model within a GPT
☆224Updated last year
simran-arora / focus
This repo contains code for the paper: "Can Foundation Models Help Us Achieve Perfect Secrecy?"
☆24Updated 2 years ago
EleutherAI / improved-t5
Experiments for efforts to train a new and improved t5
☆76Updated last year
jmward01 / lmplay
A playground to make it easy to try crazy things
☆33Updated this week
jeffbinder / promptarray
Text generator prompting with Boolean operators
☆181Updated last week