kvfrans / jax-diffusion-transformer
Implementation of Diffusion Transformer (DiT) in JAX
☆252Updated 5 months ago
Related projects ⓘ
Alternatives and complementary repositories for jax-diffusion-transformer
- For optimization algorithm research and development.☆449Updated this week
- UNet diffusion model in pure CUDA☆584Updated 4 months ago
- ☆197Updated 4 months ago
- Annotated version of the Mamba paper☆457Updated 8 months ago
- A subset of PyTorch's neural network modules, written in Python using OpenAI's Triton.☆483Updated 3 weeks ago
- Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI☆256Updated last week
- ☆133Updated 9 months ago
- Accelerated First Order Parallel Associative Scan☆163Updated 3 months ago
- Universal Tensor Operations in Einstein-Inspired Notation for Python.☆328Updated last month
- The Tensor (or Array)☆411Updated 3 months ago
- A Jax-based library for designing and training transformer models from scratch.☆276Updated 2 months ago
- The AdEMAMix Optimizer: Better, Faster, Older.☆172Updated 2 months ago
- Run PyTorch in JAX. 🤝☆200Updated last year
- ☆292Updated 4 months ago
- ☆82Updated 8 months ago
- seqax = sequence modeling + JAX☆133Updated 4 months ago
- Solve puzzles. Learn CUDA.☆61Updated 11 months ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆119Updated 3 months ago
- Simple Byte pair Encoding mechanism used for tokenization process . written purely in C☆120Updated last week
- 94% on CIFAR-10 in 2.6 seconds 💨 96% in 27 seconds☆177Updated last week
- A simple library for scaling up JAX programs☆127Updated 2 weeks ago
- ☆128Updated this week
- ☆139Updated 3 months ago
- A simple implimentation of Bayesian Flow Networks (BFN)☆239Updated 10 months ago
- The Multilayer Perceptron Language Model☆523Updated 3 months ago
- Legible, Scalable, Reproducible Foundation Models with Named Tensors and Jax☆516Updated this week
- σ-GPT: A New Approach to Autoregressive Models☆59Updated 3 months ago
- Helpful tools and examples for working with flex-attention☆469Updated 3 weeks ago
- Fast bare-bones BPE for modern tokenizer training☆142Updated last month
- ☆303Updated this week