Z-T-WANG / LaProp-Optimizer
Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"
☆19Updated 4 years ago
Related projects: ⓘ
- ☆21Updated this week
- AdaCat☆49Updated 2 years ago
- ☆23Updated this week
- Recursive Leasting Squares (RLS) with Neural Network for fast learning☆48Updated 10 months ago
- Implementation of Spectral State Space Models☆16Updated 6 months ago
- ☆14Updated this week
- ☆18Updated 5 months ago
- General Invertible Transformations for Flow-based Generative Models☆17Updated 3 years ago
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆29Updated last year
- Easily serialize dataclasses to and from tensors (PyTorch, NumPy)☆18Updated 3 years ago
- JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"☆20Updated last year
- ☆19Updated last month
- ☆23Updated 6 months ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆23Updated 3 years ago
- A GPT, made only of MLPs, in Jax☆55Updated 3 years ago
- ☆30Updated 8 months ago
- ☆33Updated last year
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆13Updated 3 weeks ago
- Official code for Long Expressive Memory (ICLR 2022, Spotlight)☆69Updated 2 years ago
- Official repository for the paper "Goal-Conditioned Generators of Deep Policies"☆11Updated 2 years ago
- Meta-learning inductive biases in the form of useful conserved quantities.☆37Updated last year
- ☆13Updated this week
- A framework for implementing equivariant DL☆10Updated 3 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆31Updated 3 months ago
- A JAX nn library☆20Updated 6 months ago
- ☆28Updated last week
- High-performance tokenized language data-loader for Python C++ extension☆12Updated last month
- ☆30Updated this week
- JAX/Flax implementation of the Hyena Hierarchy☆29Updated last year
- ☆33Updated 8 months ago