ayulockin / debugNNwithWandB
Concepts Explored in/with Pytorch
☆18Updated 5 months ago
Alternatives and similar repositories for debugNNwithWandB:
Users that are interested in debugNNwithWandB are comparing it to the libraries listed below
- Collection of snippets for PyTorch users☆26Updated 2 years ago
- Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch☆97Updated last year
- Gaussian-Bernoulli Restricted Boltzmann Machines☆101Updated 2 years ago
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- Cyclemoid implementation for PyTorch☆87Updated 2 years ago
- Code for the article "What if Neural Networks had SVDs?", to be presented as a spotlight paper at NeurIPS 2020.☆72Updated 5 months ago
- Implementation of "compositional attention" from MILA, a multi-head attention variant that is reframed as a two-step attention process wi…☆50Updated 2 years ago
- Code repository of the paper "CKConv: Continuous Kernel Convolution For Sequential Data" published at ICLR 2022. https://arxiv.org/abs/21…☆119Updated 2 years ago
- Pytorch implementation of preconditioned stochastic gradient descent (Kron and affine preconditioner, low-rank approximation precondition…☆148Updated last month
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆21Updated 2 months ago
- Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…☆66Updated last month
- ☆36Updated 2 years ago
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- Machine Learning eXperiment Utilities☆45Updated 7 months ago
- The official repository for our paper "The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers". We s…☆66Updated 2 years ago
- Code for the paper PermuteFormer☆42Updated 3 years ago
- A simple hypernetwork implementation in jax using haiku.☆23Updated 2 years ago
- Implementation of deep implicit attention in PyTorch☆63Updated 3 years ago
- Simply Numpy implementation of the FAVOR+ attention mechanism, https://teddykoker.com/2020/11/performers/☆37Updated 4 years ago
- Official code for Coupled Oscillatory RNN (ICLR 2021, Oral)☆44Updated 3 years ago
- Sequence Modeling with Structured State Spaces☆61Updated 2 years ago
- ☆50Updated 3 months ago
- A pure-functional implementation of a machine learning transformer model in Python/JAX☆175Updated 2 years ago
- ☆72Updated 2 years ago
- JAX implementation of Learning to learn by gradient descent by gradient descent☆26Updated 3 months ago
- ML/DL Math and Method notes☆58Updated last year
- Simple Python scripts to clean up and flatten ArXiv LaTeX submissions.☆64Updated 2 years ago
- Toy implementations of some popular ML optimizers using Python/JAX☆43Updated 3 years ago
- PyTorch implementation of the vision transformer☆18Updated last year
- TensorLy-Torch: Deep Tensor Learning with TensorLy and PyTorch☆76Updated 7 months ago