srush / DiffRastLinks
☆13Updated last year
Alternatives and similar repositories for DiffRast
Users that are interested in DiffRast are comparing it to the libraries listed below
Sorting:
- Universal Notation for Tensor Operations in Python.☆464Updated 9 months ago
- CIFAR-10 speedruns: 94% in 2.6 seconds and 96% in 27 seconds☆352Updated 2 months ago
- Flexibly track outputs and grad-outputs of torch.nn.Module.☆13Updated 2 years ago
- WIP☆93Updated last year
- Code and weights for the paper "Cluster and Predict Latents Patches for Improved Masked Image Modeling"☆130Updated 3 weeks ago
- ☆28Updated 4 months ago
- Flow-matching algorithms in JAX☆115Updated last year
- seqax = sequence modeling + JAX☆170Updated 6 months ago
- JAX bindings for Flash Attention v2☆103Updated this week
- Integral Neural Networks in PyTorch☆127Updated last year
- Code accompanying the paper "Generalized Interpolating Discrete Diffusion"☆112Updated 7 months ago
- ☆123Updated 7 months ago
- Implementation of Diffusion Transformer (DiT) in JAX☆306Updated last year
- 🧱 Modula software package☆322Updated 5 months ago
- ☆32Updated last year
- Landing repository for the paper "Softpick: No Attention Sink, No Massive Activations with Rectified Softmax"☆86Updated 4 months ago
- ☆55Updated last year
- ☆55Updated last year
- The Energy Transformer block, in JAX☆63Updated 2 years ago
- Cellular Automata Accelerated in JAX (Oral at ICLR 2025)☆243Updated 2 months ago
- Video Diffusion Model. Autoregressive, long context, efficient training and inference. WIP☆34Updated 5 months ago
- ☆214Updated last month
- Flash Attention Triton kernel with support for second-order derivatives☆138Updated last month
- Jax Codebase for Evolutionary Strategies at the Hyperscale☆216Updated last month
- Latent Program Network (from the "Searching Latent Program Spaces" paper)☆107Updated 2 months ago
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆20Updated 6 months ago
- Experiment of using Tangent to autodiff triton☆82Updated 2 years ago
- tenstorrent kernel from twitch☆28Updated last year
- Attention in SRAM on Tenstorrent Grayskull☆40Updated last year
- Annotated version of the Mamba paper☆495Updated last year