archinetai / difformer-pytorchLinks
Diffusion based transformer, in PyTorch (Experimental).
☆24Updated 3 years ago
Alternatives and similar repositories for difformer-pytorch
Users that are interested in difformer-pytorch are comparing it to the libraries listed below
Sorting:
- Experiment with diffusion models that you can run on your local jupyter instances☆63Updated 11 months ago
- Bayesian Attention Modules☆35Updated 4 years ago
- A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in y…☆44Updated 3 years ago
- Toloka Visual Question Answering Challenge at WSDM Cup 2023☆31Updated last year
- ☆38Updated last year
- Papers, authors and author affiliations from ICML, NeurIPS and ICLR 2006-2024☆40Updated 6 months ago
- Unofficial PyTorch implementation of "Step-unrolled Denoising Autoencoders for Text Generation"☆24Updated 2 years ago
- An implementation of (Induced) Set Attention Block, from the Set Transformers paper☆62Updated 2 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆101Updated 2 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆32Updated 2 years ago
- Implementation of Memformer, a Memory-augmented Transformer, in Pytorch☆123Updated 4 years ago
- Implementation of Discrete Key / Value Bottleneck, in Pytorch☆88Updated 2 years ago
- [ICML 2022] Latent Diffusion Energy-Based Model for Interpretable Text Modeling☆66Updated 3 years ago
- Axial Positional Embedding for Pytorch☆83Updated 7 months ago
- Graph neural network message passing reframed as a Transformer with local attention☆69Updated 2 years ago
- ☆109Updated 3 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- Official Code for ICLR 2024 Paper: Non-negative Contrastive Learning☆45Updated last year
- [EMNLP'19] Summary for Transformer Understanding☆53Updated 5 years ago
- Implementation of Retrieval-Augmented Denoising Diffusion Probabilistic Models in Pytorch☆65Updated 3 years ago
- ☆24Updated 4 years ago
- Official PyTorch implementation of A Quaternion-Valued Variational Autoencoder (QVAE).☆31Updated 3 years ago
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"☆59Updated 2 years ago
- Code to reproduce the results for Compositional Attention☆59Updated 2 years ago
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆28Updated 3 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆59Updated last year
- Official code for the paper: "Metadata Archaeology"☆19Updated 2 years ago
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆55Updated 2 years ago
- Code for the paper PermuteFormer☆42Updated 4 years ago