ml-research / rational_activationsLinks
Rational Activation Functions - Replacing Padé Activation Units
☆102Updated 8 months ago
Alternatives and similar repositories for rational_activations
Users that are interested in rational_activations are comparing it to the libraries listed below
Sorting:
- Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/21…☆124Updated 2 years ago
- Package for working with hypernetworks in PyTorch.☆131Updated 2 years ago
- Easy Hypernetworks in Pytorch and Jax☆105Updated 2 years ago
- Gaussian-Bernoulli Restricted Boltzmann Machines☆105Updated 2 years ago
- TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch☆81Updated last year
- Transformers with doubly stochastic attention☆49Updated 3 years ago
- Padé Activation Units: End-to-end Learning of Activation Functions in Deep Neural Network☆63Updated 4 years ago
- Code for the paper: Complex-Valued Autoencoders for Object Discovery☆56Updated 2 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆127Updated 2 years ago
- Implementation of "Gradients without backpropagation" paper (https://arxiv.org/abs/2202.08587) using functorch☆114Updated 2 years ago
- Official implementation of Transformer Neural Processes☆78Updated 3 years ago
- ☆189Updated last year
- ☆164Updated 2 years ago
- Sequence Modeling with Structured State Spaces☆66Updated 3 years ago
- Differentiable Optimizers with Perturbations in Pytorch☆67Updated 4 years ago
- Modern Fixed Point Systems using Pytorch☆122Updated 2 years ago
- ☆65Updated 3 years ago
- A minimalist implementation of score-based diffusion model☆129Updated 4 years ago
- Official code for Long Expressive Memory (ICLR 2022, Spotlight)☆71Updated 3 years ago
- ☆42Updated last year
- Official code for Coupled Oscillatory RNN (ICLR 2021, Oral)☆50Updated 4 years ago
- Learnable Fourier Features for Multi-Dimensional Spatial Positional Encoding☆55Updated last year
- ☆31Updated 4 years ago
- Repository for the "Gotta Go Fast When Generating Data with Score-Based Models" paper☆105Updated 3 years ago
- Official code repository of the paper Linear Transformers Are Secretly Fast Weight Programmers.☆108Updated 4 years ago
- NF-Layers for constructing neural functionals.☆91Updated last year
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Updated 3 years ago
- Code for "'Hey, that's not an ODE:' Faster ODE Adjoints via Seminorms" (ICML 2021)☆88Updated 3 years ago
- An implementation of (Induced) Set Attention Block, from the Set Transformers paper☆65Updated 2 years ago
- A practical implementation of GradNorm, Gradient Normalization for Adaptive Loss Balancing, in Pytorch☆114Updated 2 months ago