proger / nanokitchen
Parallel Associative Scan for Language Models
☆18Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for nanokitchen
- ☆36Updated 10 months ago
- Efficient PScan implementation in PyTorch☆15Updated 10 months ago
- Blog post☆16Updated 9 months ago
- Triton Implementation of HyperAttention Algorithm☆46Updated 11 months ago
- ☆45Updated 9 months ago
- ☆50Updated 6 months ago
- Awesome Triton Resources☆18Updated last month
- Experiment of using Tangent to autodiff triton☆72Updated 9 months ago
- ☆29Updated 2 months ago
- ☆46Updated last month
- ☆24Updated 8 months ago
- Minimal but scalable implementation of large language models in JAX☆26Updated 2 weeks ago
- ☆35Updated 7 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated 5 months ago
- ☆18Updated last month
- ☆21Updated last month
- If it quacks like a tensor...☆52Updated last week
- ☆28Updated 7 months ago
- ☆53Updated 3 weeks ago
- Implementation of Hyena Hierarchy in JAX☆10Updated last year
- ☆53Updated 10 months ago
- Source-to-Source Debuggable Derivatives in Pure Python☆14Updated 9 months ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆79Updated 9 months ago
- ☆16Updated 2 months ago
- JAX/Flax implementation of the Hyena Hierarchy☆31Updated last year
- ☆29Updated 2 years ago
- ☆11Updated last year
- ☆25Updated last month
- ☆31Updated 10 months ago