egochao / transformer_with_einsum
Transformer from scratch with einsum method
☆10Updated 3 years ago
Alternatives and similar repositories for transformer_with_einsum:
Users that are interested in transformer_with_einsum are comparing it to the libraries listed below
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated last year
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- (ECCV 2022) BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks☆49Updated 2 years ago
- ☆51Updated 10 months ago
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆23Updated last year
- This repository contains the code for our paper "Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguo…☆40Updated 2 years ago
- [ICCV 2021] A Pytorch implementation of "Manifold Matching via Deep Metric Learning for Generative Modeling"☆80Updated 2 years ago
- [NeurIPS 2024, spotlight] Multivariate Learned Adaptive Noise for Diffusion Models☆19Updated 4 months ago
- Implementation of the Remixer Block from the Remixer paper, in Pytorch☆35Updated 3 years ago
- Code repo for ICLR 24 BlogPost titled "Building Diffusion Model's theory from ground up"☆18Updated last year
- ☆61Updated 2 years ago
- Code repository for the paper "Group Equivariant Stand-Alone Self Attention For Vision" published at ICLR 2021. https://openreview.net/fo…☆29Updated 4 years ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆50Updated 10 months ago
- ☆29Updated 2 years ago
- ☆41Updated 2 years ago
- Denoising Diffusion Implicit Models☆28Updated 4 years ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated last year
- An official pytorch implementation of EACL2024 short paper "Flow Matching for Conditional Text Generation in a Few Sampling Steps"☆14Updated 10 months ago
- ☆22Updated 3 months ago
- Implementation of LogAvgExp for Pytorch☆35Updated 2 weeks ago
- Code for ICLR 2023 Paper, "Stable Target Field for Reduced Variance Score Estimation in Diffusion Models”☆74Updated last year
- Jupyter Notebook corresponding to 'Going with the Flow: An Introduction to Normalizing Flows'☆26Updated 4 years ago
- ☆8Updated last year
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆99Updated 2 years ago
- PyTorch implementation of FNet: Mixing Tokens with Fourier transforms☆26Updated 3 years ago
- ☆17Updated 2 years ago
- AdaCat☆49Updated 2 years ago
- ☆33Updated 5 years ago
- HGRN2: Gated Linear RNNs with State Expansion☆54Updated 8 months ago
- ☆37Updated 8 months ago