proger / accelerated-scanLinks
Accelerated First Order Parallel Associative Scan
โ180Updated 9 months ago
Alternatives and similar repositories for accelerated-scan
Users that are interested in accelerated-scan are comparing it to the libraries listed below
Sorting:
- A MAD laboratory to improve AI architecture designs ๐งชโ116Updated 5 months ago
- This repository contains the experimental PyTorch native float8 training UXโ222Updated 10 months ago
- Experiment of using Tangent to autodiff tritonโ78Updated last year
- Understand and test language model architectures on synthetic tasks.โ195Updated 2 months ago
- FlashFFTConv: Efficient Convolutions for Long Sequences with Tensor Coresโ317Updated 5 months ago
- A library for unit scaling in PyTorchโ125Updated 6 months ago
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"โ233Updated 3 months ago
- JAX bindings for Flash Attention v2โ88Updated 10 months ago
- supporting pytorch FSDP for optimizersโ79Updated 5 months ago
- Efficient optimizersโ206Updated this week
- Some preliminary explorations of Mamba's context scaling.โ212Updated last year
- โ144Updated 2 years ago
- Triton-based implementation of Sparse Mixture of Experts.โ216Updated 6 months ago
- [ICLR 2025] Official PyTorch Implementation of Gated Delta Networks: Improving Mamba2 with Delta Ruleโ166Updated 2 months ago
- ๐งฑ Modula software packageโ194Updated 2 months ago
- โ267Updated 10 months ago
- Explorations into the recently proposed Taylor Series Linear Attentionโ98Updated 9 months ago
- Normalized Transformer (nGPT)โ181Updated 6 months ago
- Griffin MQA + Hawk Linear RNN Hybridโ86Updated last year
- Implementation of GateLoop Transformer in Pytorch and Jaxโ88Updated 11 months ago
- FlashRNN - Fast RNN Kernels with I/O Awarenessโ90Updated 2 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.โ126Updated 3 weeks ago
- Fast and memory-efficient exact attentionโ68Updated 2 months ago
- seqax = sequence modeling + JAXโ155Updated last month
- โ78Updated 10 months ago
- LoRA for arbitrary JAX models and functionsโ135Updated last year
- โ108Updated last year
- โ182Updated 5 months ago
- A simple library for scaling up JAX programsโ136Updated 7 months ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.โ544Updated this week