archinetai / difformer-pytorchLinks
Diffusion based transformer, in PyTorch (Experimental).
☆24Updated 2 years ago
Alternatives and similar repositories for difformer-pytorch
Users that are interested in difformer-pytorch are comparing it to the libraries listed below
Sorting:
- ☆37Updated 10 months ago
- Code for the paper PermuteFormer☆42Updated 3 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in y…☆45Updated 2 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆39Updated 3 years ago
- ☆26Updated 3 years ago
- Bayesian Attention Modules☆35Updated 4 years ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated 3 weeks ago
- AdaCat☆49Updated 2 years ago
- Implementation of Multistream Transformers in Pytorch☆54Updated 3 years ago
- ☆51Updated 11 months ago
- Official code for the paper: "Metadata Archaeology"☆19Updated 2 years ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆51Updated 3 years ago
- [ICLR 2023] “ Layer Grafted Pre-training: Bridging Contrastive Learning And Masked Image Modeling For Better Representations”, Ziyu Jian…☆24Updated 2 years ago
- ☆30Updated 3 years ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆36Updated 3 years ago
- codebase for the SIMAT dataset and evaluation☆39Updated 3 years ago
- Implementation of Metaformer, but in an autoregressive manner☆25Updated 2 years ago
- ☆36Updated 4 years ago
- An implementation of (Induced) Set Attention Block, from the Set Transformers paper☆59Updated 2 years ago
- Github code for the paper Maximum Class Separation as Inductive Bias in One Matrix. Arxiv link: https://arxiv.org/abs/2206.08704☆29Updated 2 years ago
- [EMNLP'19] Summary for Transformer Understanding☆53Updated 5 years ago
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆27Updated 2 years ago
- [CVPR'23 Highlight] Heterogeneous Continual Learning.☆16Updated last year
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆27Updated 3 years ago
- Pytorch implementation for "The Surprising Positive Knowledge Transfer in Continual 3D Object Shape Reconstruction"☆33Updated 2 years ago
- [ICLR2024] (EvALign-ICL Benchmark) Beyond Task Performance: Evaluating and Reducing the Flaws of Large Multimodal Models with In-Context …☆22Updated last year
- Code for ICLR 2022 Paper, "Controlling Directions Orthogonal to a Classifier"☆35Updated 2 years ago
- Learning to Encode Position for Transformer with Continuous Dynamical Model☆60Updated 4 years ago
- reproduces experiments from "Grounding inductive biases in natural images: invariance stems from variations in data"☆17Updated 8 months ago