titu1994 / simple_diffusion
Simple notebooks to learn diffusion models on toy datasets
☆17Updated 2 years ago
Alternatives and similar repositories for simple_diffusion:
Users that are interested in simple_diffusion are comparing it to the libraries listed below
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆35Updated 3 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- Implementation of Spectral Leakage and Rethinking the Kernel Size in CNNs in Pytorch☆14Updated 4 years ago
- Implementation of Kronecker Attention in Pytorch☆18Updated 4 years ago
- ☆23Updated 4 years ago
- A simple implementation of [Mamba: Linear-Time Sequence Modeling with Selective State Spaces](https://arxiv.org/abs/2312.00752)☆21Updated last year
- ☆21Updated 4 years ago
- A simple implementation of a deep linear Pytorch module☆19Updated 4 years ago
- Directed masked autoencoders☆14Updated 2 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- Experimental implementation for a sparse-dictionary based version of the VQ-VAE2 paper☆34Updated last year
- Authors implementation of LieTransformer: Equivariant Self-Attention for Lie Groups☆36Updated 4 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- High performance pytorch modules☆18Updated 2 years ago
- LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)☆18Updated last year
- Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"☆15Updated 3 years ago
- Implementation of Insertion-deletion Denoising Diffusion Probabilistic Models☆30Updated 2 years ago
- ICME2022 Special Session “Beyond Accuracy: Responsible, Responsive, and Robust Multimedia Retrieval ”☆12Updated 10 months ago
- PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations☆16Updated 4 years ago
- A convolution-free, transformer-only version of the CycleGAN framework☆33Updated 3 years ago
- Adam with minor modifications which give significant improvement☆19Updated 3 years ago
- Hacks for PyTorch☆19Updated 2 years ago
- Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk☆46Updated last year
- ☆8Updated last year
- ☆29Updated 2 years ago
- Graph neural network message passing reframed as a Transformer with local attention☆68Updated 2 years ago
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Updated 3 years ago
- Official code for NeurIPS paper "Combinatorial Optimization for Panoptic Segmentation: A Fully Differentiable Approach".☆16Updated 2 years ago
- A PyTorch Dataset that caches samples in shared memory, accessible globally to all processes☆20Updated 2 years ago
- PyTorch implementation of MLP-Mixer: An all-MLP Architecture for Vision☆23Updated 3 years ago