archinetai / difformer-pytorch
Diffusion based transformer, in PyTorch (Experimental).
☆25Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for difformer-pytorch
- Implementation of Metaformer, but in an autoregressive manner☆23Updated 2 years ago
- Official code for the paper: "Metadata Archaeology"☆18Updated last year
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Updated last year
- Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"☆15Updated last week
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 7 months ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆19Updated last year
- [CVPR'23 Highlight] Heterogeneous Continual Learning.☆15Updated 11 months ago
- A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.☆48Updated 4 months ago
- This is the public github for our paper "Transformer with a Mixture of Gaussian Keys"☆26Updated 2 years ago
- PyTorch implementation of FNet: Mixing Tokens with Fourier transforms☆25Updated 3 years ago
- ☆29Updated 2 years ago
- Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch☆36Updated 2 years ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated this week
- Position Prediction as an Effective Pretraining Strategy☆8Updated last year
- A Benchmark for Efficient and Compositional Visual Reasoning☆17Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆15Updated 2 years ago
- Bayesian Attention Modules☆35Updated 3 years ago
- Official implementation of the paper "Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Perform…☆19Updated last year
- Official implementation for the paper "A Cheaper and Better Diffusion Language Model with Soft-Masked Noise"☆52Updated last year
- ☆13Updated last year
- ☆35Updated 3 months ago
- Un-*** 50 billions multimodality dataset☆24Updated 2 years ago
- [CVPR2019] Synthesizing Environment-Aware Activities via Activity Sketches☆13Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆23Updated 4 months ago
- Papers, authors and author affiliations from ICML, NeurIPS and ICLR 2006-2023☆34Updated 10 months ago
- PyTorch implementation of Soft MoE by Google Brain in "From Sparse to Soft Mixtures of Experts" (https://arxiv.org/pdf/2308.00951.pdf)☆64Updated last year
- A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in y…☆45Updated 2 years ago
- ☆26Updated 2 years ago