egochao / transformer_with_einsum
Transformer from scratch with einsum method
☆10Updated 3 years ago
Alternatives and similar repositories for transformer_with_einsum:
Users that are interested in transformer_with_einsum are comparing it to the libraries listed below
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 9 months ago
- ☆12Updated last year
- This repository contains the code for our paper "Probabilistic Contrastive Learning Recovers the Correct Aleatoric Uncertainty of Ambiguo…☆39Updated last year
- (ECCV 2022) BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks☆49Updated 2 years ago
- Simple MAE (masked autoencoders) with pytorch and pytorch-lightning.☆41Updated 11 months ago
- ☆8Updated last year
- Code for "SCHA-VAE: Hierarchical Context Aggregation for Few-Shot Generation" @ ICML 2022☆15Updated 2 years ago
- Improving Transformation Invariance in Contrastive Representation Learning☆13Updated 3 years ago
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆22Updated last year
- Implementation of LogAvgExp for Pytorch☆32Updated 2 years ago
- ☆49Updated last year
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- [NeurIPS 2024, spotlight] Multivariate Learned Adaptive Noise for Diffusion Models☆15Updated last month
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆27Updated 3 years ago
- Active Learning Helps Pretrained Models Learn the Intended Task (https://arxiv.org/abs/2204.08491) by Alex Tamkin, Dat Nguyen, Salil Desh…☆11Updated 2 years ago
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆104Updated 3 years ago
- ☆32Updated 5 years ago
- Directed masked autoencoders☆14Updated 2 years ago
- An implementation of Squared Earth-Mover's Distance loss for Neural Networks.☆14Updated last year
- More dimensions = More fun☆21Updated 6 months ago
- Personal implementation of ASIF by Antonio Norelli☆25Updated 8 months ago
- ☆20Updated 3 weeks ago
- Github code for the paper Maximum Class Separation as Inductive Bias in One Matrix. Arxiv link: https://arxiv.org/abs/2206.08704☆28Updated last year
- [NeurIPS 2023] Official Implementation: "Ambient Diffusion: Learning Clean Distributions from Corrupted Data"☆77Updated last year
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆21Updated 2 months ago
- Official code for the paper "Why Do Self-Supervised Models Transfer? Investigating the Impact of Invariance on Downstream Tasks".☆16Updated 3 years ago
- HGRN2: Gated Linear RNNs with State Expansion☆52Updated 5 months ago
- ImageNet-12k subset of ImageNet-21k (fall11)☆21Updated last year
- [ICLR 2024] Official code for the paper 'Elucidating the Exposure Bias in Diffusion Models'☆24Updated 8 months ago
- ☆51Updated 7 months ago