tk-rusch / LEM
Official code for Long Expressive Memory (ICLR 2022, Spotlight)
☆69Updated 2 years ago
Alternatives and similar repositories for LEM:
Users that are interested in LEM are comparing it to the libraries listed below
- Layerwise Batch Entropy Regularization☆22Updated 2 years ago
- AdaCat☆49Updated 2 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆122Updated last year
- Code repository of the paper "Wavelet Networks: Scale-Translation Equivariant Learning From Raw Time-Series, TMLR" https://arxiv.org/abs…☆81Updated last year
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆14Updated 3 months ago
- Sequence Modeling with Structured State Spaces☆62Updated 2 years ago
- Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/21…☆119Updated 2 years ago
- Supplementary code for the paper "Meta-Solver for Neural Ordinary Differential Equations" https://arxiv.org/abs/2103.08561☆24Updated 3 years ago
- ☆33Updated last year
- ☆31Updated 3 years ago
- Official code for Coupled Oscillatory RNN (ICLR 2021, Oral)☆43Updated 3 years ago
- Code for ICLR 2021 Paper, "Anytime Sampling for Autoregressive Models via Ordered Autoencoding"☆25Updated last year
- Code for "'Hey, that's not an ODE:' Faster ODE Adjoints via Seminorms" (ICML 2021)☆86Updated 2 years ago
- Transformers with doubly stochastic attention☆44Updated 2 years ago
- Official code for UnICORNN (ICML 2021)☆27Updated 3 years ago
- code for "Semi-Discrete Normalizing Flows through Differentiable Tessellation"☆24Updated 2 years ago
- Continuous Augmented Positional Embeddings (CAPE) implementation for PyTorch☆39Updated 2 years ago
- ☆36Updated last year
- repo for paper: Adaptive Checkpoint Adjoint (ACA) method for gradient estimation in neural ODE☆54Updated 3 years ago
- [AAAI 2020 Oral] Low-variance Black-box Gradient Estimates for the Plackett-Luce Distribution☆37Updated 4 years ago
- Implementation of Flow++ in PyTorch☆41Updated 5 years ago
- ☆23Updated 3 years ago
- Jax/Flax implementation of Variational-DiffWave.☆40Updated 2 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆97Updated last year
- Code for: "Neural Rough Differential Equations for Long Time Series", (ICML 2021)☆115Updated 3 years ago
- Very deep VAEs in JAX/Flax☆46Updated 3 years ago
- Easy-to-use AdaHessian optimizer (PyTorch)☆77Updated 4 years ago
- ☆60Updated 4 years ago
- An implementation of soft-DTW divergences.☆133Updated 3 years ago
- The accompanying code for "Simplifying and Understanding State Space Models with Diagonal Linear RNNs" (Ankit Gupta, Harsh Mehta, Jonatha…☆19Updated 2 years ago