archinetai / difformer-pytorch
Diffusion based transformer, in PyTorch (Experimental).
☆25Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for difformer-pytorch
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated this week
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"☆52Updated last year
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- Position Prediction as an Effective Pretraining Strategy☆8Updated last year
- Bayesian Attention Modules☆35Updated 3 years ago
- [CVPR'23 Highlight] Heterogeneous Continual Learning.☆15Updated 11 months ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆19Updated last year
- Code repository for the paper "Meta-Learning via Classifier(-free) Diffusion Guidance"☆30Updated last year
- A Benchmark for Efficient and Compositional Visual Reasoning☆18Updated last year
- ☆36Updated 4 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in y…☆45Updated 2 years ago
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 7 months ago
- Papers, authors and author affiliations from ICML, NeurIPS and ICLR 2006-2023☆34Updated 11 months ago
- ☆26Updated 2 years ago
- Code for the paper PermuteFormer☆42Updated 3 years ago
- Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.☆23Updated last year
- [Preprint] AdaVAE: Exploring Adaptive GPT-2s in VAEs for Language Modeling PyTorch Implementation☆34Updated last year
- ☆13Updated last year
- Official code for the paper: "Metadata Archaeology"☆18Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆95Updated last year
- Official PyTorch implementation of A Quaternion-Valued Variational Autoencoder (QVAE).☆27Updated 2 years ago
- [NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)☆33Updated last year
- ☆33Updated 10 months ago
- A package for fine tuning of pretrained NLP transformers using Semi Supervised Learning☆15Updated 3 years ago
- Video descriptions of research papers relating to foundation models and scaling☆30Updated last year
- ☆31Updated 10 months ago
- Neural Diffusion Processes☆73Updated 3 months ago