borjanG / 2023-transformers
Codes for the paper The emergence of clusters in self-attention dynamics.
☆13Updated last year
Alternatives and similar repositories for 2023-transformers:
Users that are interested in 2023-transformers are comparing it to the libraries listed below
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆24Updated 3 years ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 2 years ago
- General Invertible Transformations for Flow-based Generative Models☆17Updated 4 years ago
- Repo to the paper "Lie Point Symmetry Data Augmentation for Neural PDE Solvers"☆49Updated last year
- Monotone operator equilibrium networks☆51Updated 4 years ago
- Jupyter Notebook corresponding to 'Going with the Flow: An Introduction to Normalizing Flows'☆25Updated 3 years ago
- Euclidean Wasserstein-2 optimal transportation☆44Updated last year
- Transformers with doubly stochastic attention☆44Updated 2 years ago
- [ICML'21 Oral] Improving Lossless Compression Rates via Monte Carlo Bits-Back Coding☆14Updated 3 years ago
- [NeurIPS 2020] Task-Agnostic Amortized Inference of Gaussian Process Hyperparameters (AHGP)☆21Updated 4 years ago
- code for "Neural Conservation Laws A Divergence-Free Perspective".☆35Updated 2 years ago
- Quantification of Uncertainty with Adversarial Models☆27Updated last year
- Refining continuous-in-depth neural networks☆39Updated 3 years ago
- ☆11Updated 3 years ago
- Model hub for all your DiffeqML needs. Pretrained weights, modules, and basic inference infrastructure☆24Updated last year
- A variational method for fast, approximate inference for stochastic differential equations.☆43Updated 6 years ago
- ☆18Updated 2 years ago
- ☆18Updated last year
- PyTorch implementation for "Probabilistic Circuits for Variational Inference in Discrete Graphical Models", NeurIPS 2020☆15Updated 3 years ago
- Blog post☆16Updated 11 months ago
- ☆9Updated last year
- Investigate the speed of adaptation of structural causal models☆16Updated 3 years ago
- ☆53Updated 5 months ago
- ☆14Updated 2 years ago
- Code for "Accelerating Natural Gradient with Higher-Order Invariance"☆30Updated 5 years ago
- Implementation of Action Matching for the Schrödinger equation☆24Updated last year
- Computing the eigenvalues of Neural Tangent Kernel and Conjugate Kernel (aka NNGP kernel) over the boolean cube☆47Updated 5 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated 2 years ago
- Implicit Deep Adaptive Design (iDAD): Policy-Based Experimental Design without Likelihoods☆17Updated 3 years ago
- This repository contains PyTorch implementations of various random feature maps for dot product kernels.☆19Updated 6 months ago