devvrit / SONew
☆9Updated last year
Alternatives and similar repositories for SONew:
Users that are interested in SONew are comparing it to the libraries listed below
- Blog post☆17Updated last year
- ☆34Updated 4 months ago
- Efficient PScan implementation in PyTorch☆16Updated last year
- ☆31Updated last year
- Parallel Associative Scan for Language Models☆18Updated last year
- ☆52Updated 6 months ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19Updated 2 weeks ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆30Updated last year
- ☆30Updated 5 months ago
- Code for Accelerated Linearized Laplace Approximation for Bayesian Deep Learning (ELLA, NeurIPS 22')☆16Updated 2 years ago
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆83Updated last year
- ☆32Updated 6 months ago
- ☆14Updated last month
- Code for "SAM as an Optimal Relaxation of Bayes", ICLR 2023.☆25Updated last year
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last month
- ☆10Updated 7 months ago
- Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"☆28Updated 4 years ago
- Efficient Scaling laws and collaborative pretraining.☆16Updated 3 months ago
- Code for the paper "Function-Space Learning Rates"☆19Updated last week
- Code for Neural Execution Engines: Learning to Execute Subroutines☆17Updated 4 years ago
- ☆23Updated 7 months ago
- Layerwise Batch Entropy Regularization☆22Updated 2 years ago
- Triton Implementation of HyperAttention Algorithm☆47Updated last year
- Scalable Computation of Hessian Diagonals☆13Updated 10 months ago
- ☆51Updated 11 months ago
- Combining SOAP and MUON☆15Updated 2 months ago
- Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"☆14Updated 11 months ago
- ☆31Updated 6 months ago
- ☆22Updated 2 years ago