hunar4321 / RLS-neural-net
Recursive Leasting Squares (RLS) with Neural Network for fast learning
☆50Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for RLS-neural-net
- Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"☆57Updated last year
- Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX☆78Updated 9 months ago
- Implementation of GateLoop Transformer in Pytorch and Jax☆86Updated 4 months ago
- Implementations of growing and pruning in neural networks☆21Updated last year
- A State-Space Model with Rational Transfer Function Representation.☆70Updated 5 months ago
- ☆46Updated last month
- Utilities for PyTorch distributed☆23Updated last year
- Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️☆52Updated last year
- JAX/Flax implementation of the Hyena Hierarchy☆30Updated last year
- ☆33Updated last year
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆46Updated 3 months ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆43Updated last month
- ☆31Updated 2 months ago
- ☆21Updated last year
- Serialize JAX, Flax, Haiku, or Objax model params with 🤗`safetensors`☆42Updated 5 months ago
- FID computation in Jax/Flax.☆24Updated 3 months ago
- ☆27Updated 6 months ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 2 years ago
- CUDA implementation of autoregressive linear attention, with all the latest research findings☆43Updated last year
- ☆38Updated 2 months ago
- ☆25Updated 4 months ago
- My explorations into editing the knowledge and memories of an attention network☆34Updated last year
- Fast training of unitary deep network layers from low-rank updates☆28Updated last year
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆69Updated last year
- A JAX nn library☆21Updated 8 months ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated 6 months ago
- AdaCat☆49Updated 2 years ago
- Efficient optimizers☆42Updated this week