tonyduan / transformer-blocksLinks

Multi-Head Attention, Transformer, Perceiver, Linear Attention.

☆11

Alternatives and similar repositories for transformer-blocks

Users that are interested in transformer-blocks are comparing it to the libraries listed below

Sorting:

gbaydin / difftorch
A differentiation API for PyTorch
☆30Updated 5 years ago
CUAI / Neural-Manifold-Ordinary-Differential-Equations
[NeurIPS 2020] Neural Manifold Ordinary Differential Equations (https://arxiv.org/abs/2006.10254)
☆119Updated 2 years ago
fissoreg / relative-gradient-jacobian
Relative gradient optimization of the Jacobian term in unsupervised deep learning, NeurIPS 2020
☆21Updated 4 years ago
patrick-kidger / FasterNeuralDiffEq
Code for "'Hey, that's not an ODE:' Faster ODE Adjoints via Seminorms" (ICML 2021)
☆87Updated 2 years ago
wangleiphy / MongeAmpereFlow
Continuous-time gradient flow for generative modeling and variational inference
☆32Updated 6 years ago
facebookresearch / rcpm
Riemannian Convex Potential Maps
☆67Updated 2 years ago
CW-Huang / CP-Flow
Convex potential flows
☆83Updated 3 years ago
necludov / iMCMC
☆22Updated 5 years ago
zlin7 / CGNet
☆70Updated 2 years ago
toshas / torch-householder
Efficient Householder Transformation in PyTorch
☆66Updated 4 years ago
jaanli / hierarchical-variational-models-physics
Hierarchical variational models for physics.
☆17Updated 5 years ago
akandykeller / SelfNormalizingFlows
☆68Updated 2 years ago
ermongroup / mintnet
MintNet: Building Invertible Neural Networks with Masked Convolutions
☆39Updated 4 years ago
asteroidhouse / INN-exploding-inverses
Code for Understanding and Mitigating Exploding Inverses in Invertible Neural Networks (AISTATS 2021) http://arxiv.org/abs/2006.09347
☆30Updated 4 years ago
juntang-zhuang / TorchDiffEqPack
☆47Updated last year
mfinzi / residual-pathway-priors
☆20Updated 3 years ago
ratschlab / uRNN
Code for "Learning Unitary Operators with Help From u(n)", AAAI-17. (https://arxiv.org/abs/1607.04903)
☆17Updated 8 years ago
adamhaber / JaxEnt
Jax-based MaxEnt
☆17Updated 5 years ago
HaberGroup / SemiImplicitDNNs
Pytorch implementation of 'Semi-Implicit Methods for Deep Neural Networks'
☆25Updated 6 years ago
AlexanderMath / fasth
Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.
☆75Updated last year
glouppe / flowing-with-jax
☆15Updated 4 years ago
rgiordan / paragami
"Parameter origami" -- folding and unfolding collections of parameters for optimization and sensitivity analysis.
☆14Updated last year
Runjing-Liu120 / RaoBlackwellizedSGD
A public repository for our paper, Rao-Blackwellized Stochastic Gradients for Discrete Distributions
☆22Updated 6 years ago
JSC370 / jsc370.github.io
Course Website
☆9Updated 3 years ago
mbilos / stribor
Library for normalizing flows and neural flows.
☆25Updated 3 years ago
jrmcornish / cif
PyTorch implementation of Continuously Indexed Flows paper, with many baseline normalising flows
☆31Updated 3 years ago
nicola-decao / power_spherical
Pytorch implementation of the Power Spherical distribution
☆74Updated last year
locuslab / monotone_op_net
Monotone operator equilibrium networks
☆53Updated 5 years ago
pfnet-research / einconv
☆47Updated 5 years ago
joeybose / HyperbolicNF
ICML 2020 Paper: Latent Variable Modelling with Hyperbolic Normalizing Flows
☆54Updated 2 years ago