Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"
☆29Jul 30, 2020Updated 5 years ago
Alternatives and similar repositories for LaProp-Optimizer
Users that are interested in LaProp-Optimizer are comparing it to the libraries listed below
Sorting:
- Code for "What really matters in matrix-whitening optimizers?"☆22Oct 31, 2025Updated 4 months ago
- ☆15Mar 2, 2025Updated last year
- recipe for training fully-featured self supervised image jepa models☆12Jun 4, 2025Updated 9 months ago
- ☆35Apr 12, 2024Updated last year
- Minimal Implimentation of VCRec (2024) for collapse provention.☆18Jan 28, 2025Updated last year
- The AGI Concept Map is my attempt at reconstructing all the internal knowledge I've acquired about artificial general intelligence over t…☆14Nov 28, 2019Updated 6 years ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- OpenAI CLIP based image generator with complex config file controlled transformation and training pipelines☆19Jan 4, 2022Updated 4 years ago
- H-Net Dynamic Hierarchical Architecture☆81Sep 11, 2025Updated 5 months ago
- High performance pytorch modules☆17Jan 14, 2023Updated 3 years ago
- 📄Small Batch Size Training for Language Models☆80Oct 4, 2025Updated 5 months ago
- Portfolio REgret for Confidence SEquences☆21Jan 6, 2026Updated 2 months ago
- an implementation of FAdam (Fisher Adam) in PyTorch☆50Jul 1, 2025Updated 8 months ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆33May 2, 2025Updated 10 months ago
- Automated Theorem Prover for Automatic Words☆21Apr 7, 2021Updated 4 years ago
- ☆24Sep 25, 2024Updated last year
- An implementation of PSGD Kron second-order optimizer for PyTorch☆98Jul 24, 2025Updated 7 months ago
- quick playground to animate pippin☆15Nov 11, 2024Updated last year
- PyTorch implementation of RWKV blocks☆32Jul 22, 2025Updated 7 months ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆77May 30, 2023Updated 2 years ago
- Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"☆32May 28, 2025Updated 9 months ago
- ☆34Jun 4, 2025Updated 9 months ago
- ☆30Nov 5, 2023Updated 2 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Oct 18, 2023Updated 2 years ago
- A red teaming agent☆18Oct 15, 2025Updated 4 months ago
- Forked from https://gitlab.com/MatejB/PrePoMax☆13Jan 8, 2024Updated 2 years ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Feb 19, 2026Updated 2 weeks ago
- Minimal implementation of PCA in PyTorch, tested against scikit-learn's implementation☆29Feb 24, 2025Updated last year
- (CVPR 2024) Bayesian Diffusion Models for 3D Shape Reconstruction☆37May 6, 2024Updated last year
- Efficient Memory-Augmented Transformers☆35Dec 5, 2022Updated 3 years ago
- CCQA A New Web-Scale Question Answering Dataset for Model Pre-Training☆32Jul 20, 2022Updated 3 years ago
- Repo to reproduce the First-Explore paper results☆39Dec 25, 2024Updated last year
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆129Jun 24, 2025Updated 8 months ago
- Libraries, guides, blueprints, and sample code, to enable rapidly building 0-1 applications on iOS, Android and web.☆11May 12, 2023Updated 2 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Compute distribution-based quality metrics for audio data using embeddings, with a focus on music.☆43Jan 15, 2026Updated last month
- ☆83Apr 16, 2024Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆52Feb 24, 2026Updated last week
- PyTorch interface for TrueGrad Optimizers☆43Aug 8, 2023Updated 2 years ago