glassroom / heinsen_sequenceLinks

Code implementing "Efficient Parallelization of a Ubiquitious Sequential Computation" (Heinsen, 2023)

☆94

Alternatives and similar repositories for heinsen_sequence

Users that are interested in heinsen_sequence are comparing it to the libraries listed below

Sorting:

johnryan465 / pscan
☆40Updated last year
dvruette / barrel-rec-pytorch
☆53Updated last year
shikaiqiu / compute-better-spent
☆53Updated 9 months ago
lixilinx / psgd_torch
Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…
☆179Updated last month
proger / accelerated-scan
Accelerated First Order Parallel Associative Scan
☆182Updated 10 months ago
vvvm23 / mamba-jax
Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX
☆84Updated last year
modula-systems / modula
🧱 Modula software package
☆204Updated 3 months ago
MatX-inc / seqax
seqax = sequence modeling + JAX
☆165Updated last month
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆79Updated last year
GallagherCommaJack / modulax
☆17Updated 10 months ago
athms / mad-lab
A MAD laboratory to improve AI architecture designs 🧪
☆123Updated 6 months ago
proger / nanokitchen
Parallel Associative Scan for Language Models
☆18Updated last year
google-deepmind / nanodo
☆273Updated last year
lxxue / prefix_sum
A PyTorch wrapper of parallel exclusive scan in CUDA
☆12Updated 2 years ago
HazyResearch / train-tk
train with kittens!
☆61Updated 8 months ago
AndPotap / einsum-search
☆32Updated 9 months ago
cloneofsimo / min-fsdp
☆79Updated last year
google-deepmind / spectral_ssm
☆32Updated last year
young-geng / mintext
Minimal but scalable implementation of large language models in JAX
☆35Updated last week
irhum / hyena
JAX/Flax implementation of the Hyena Hierarchy
☆34Updated 2 years ago
HazyResearch / zoology
Understand and test language model architectures on synthetic tasks.
☆219Updated last month
davisyoshida / qax
If it quacks like a tensor...
☆58Updated 8 months ago
graphcore-research / unit-scaling
A library for unit scaling in PyTorch
☆125Updated 7 months ago
berlino / seq_icl
☆53Updated last year
google-research / jestimator
Amos optimizer with JEstimator lib.
☆82Updated last year
nshepperd / flash_attn_jax
JAX bindings for Flash Attention v2
☆90Updated last year
jopetty / word-problem
Experiments on the impact of depth in transformers and SSMs.
☆32Updated 8 months ago
KhoomeiK / complexity-scaling
gzip Predicts Data-dependent Scaling Laws
☆35Updated last year
ClashLuke / tpucare
Automatically take good care of your preemptible TPUs
☆36Updated 2 years ago
davisyoshida / lorax
LoRA for arbitrary JAX models and functions
☆140Updated last year