egochao / transformer_with_einsum

Transformer from scratch with einsum method

☆10

Alternatives and similar repositories for transformer_with_einsum:

Users that are interested in transformer_with_einsum are comparing it to the libraries listed below

SamsungSAILMontreal / PAPA
Repository for the PopulAtion Parameter Averaging (PAPA) paper
☆26Updated last year
lucidrains / compositional-attention-pytorch
Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…
☆50Updated 2 years ago
ExplainableML / BayesCap
(ECCV 2022) BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks
☆49Updated 2 years ago
gregorbachmann / scaling_mlps
☆51Updated 10 months ago
mtkresearch / shortest-path-diffusion
Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.
☆23Updated last year
mkirchhof / Probabilistic_Contrastive_Learning
This repository contains the code for our paper "Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguo…
☆40Updated 2 years ago
dzld00 / pytorch-manifold-matching
[ICCV 2021] A Pytorch implementation of "Manifold Matching via Deep Metric Learning for Generative Modeling"
☆80Updated 2 years ago
s-sahoo / MuLAN
[NeurIPS 2024, spotlight] Multivariate Learned Adaptive Noise for Diffusion Models
☆19Updated 4 months ago
lucidrains / remixer-pytorch
Implementation of the Remixer Block from the Remixer paper, in Pytorch
☆35Updated 3 years ago
dasayan05 / iclr24_blog_code
Code repo for ICLR 24 BlogPost titled "Building Diffusion Model's theory from ground up"
☆18Updated last year
locuslab / deq-ddim
☆61Updated 2 years ago
dwromero / g_selfatt
Code repository for the paper "Group Equivariant Stand-Alone Self Attention For Vision" published at ICLR 2021. https://openreview.net/fo…
☆29Updated 4 years ago
oripress / EntropyEnigma
Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"
☆50Updated 10 months ago
ahennequ / pytorch-custom-mma
☆29Updated 2 years ago
minyoungg / overparam
☆41Updated 2 years ago
unixpickle / ddim
Denoising Diffusion Implicit Models
☆28Updated 4 years ago
rwightman / imagenet-12k
ImageNet-12k subset of ImageNet-21k (fall11)
☆21Updated last year
dongzhuoyao / flowseq
An official pytorch implementation of EACL2024 short paper "Flow Matching for Conditional Text Generation in a Few Sampling Steps"
☆14Updated 10 months ago
SriramB-98 / vit-decompose
☆22Updated 3 months ago
lucidrains / logavgexp-torch
Implementation of LogAvgExp for Pytorch
☆35Updated 2 weeks ago
Newbeeer / stf
Code for ICLR 2023 Paper, "Stable Target Field for Reduced Variance Score Estimation in Diffusion Models”
☆74Updated last year
gebob19 / introduction_to_normalizing_flows
Jupyter Notebook corresponding to 'Going with the Flow: An Introduction to Normalizing Flows'
☆26Updated 4 years ago
ravidziv / SimplifyingImbalancedTraining
☆8Updated last year
lucidrains / gated-state-spaces-pytorch
Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch
☆99Updated 2 years ago
jaketae / fnet
PyTorch implementation of FNet: Mixing Tokens with Fourier transforms
☆26Updated 3 years ago
yueliukth / PatchDropout
☆17Updated 2 years ago
ColinQiyangLi / AdaCat
AdaCat
☆49Updated 2 years ago
ermongroup / alignflow
☆33Updated 5 years ago
OpenNLPLab / HGRN2
HGRN2: Gated Linear RNNs with State Expansion
☆54Updated 8 months ago
google-deepmind / ssl_hsic
☆37Updated 8 months ago