Z-T-WANG / LaProp-OptimizerView external linksLinks
Codes accompanying the paper "LaProp: a Better Way to Combine Momentum with Adaptive Gradient"
☆29Jul 30, 2020Updated 5 years ago
Alternatives and similar repositories for LaProp-Optimizer
Users that are interested in LaProp-Optimizer are comparing it to the libraries listed below
Sorting:
- ☆14Mar 2, 2025Updated 11 months ago
- recipe for training fully-featured self supervised image jepa models☆12Jun 4, 2025Updated 8 months ago
- ☆35Apr 12, 2024Updated last year
- Minimal Implimentation of VCRec (2024) for collapse provention.☆18Jan 28, 2025Updated last year
- The AGI Concept Map is my attempt at reconstructing all the internal knowledge I've acquired about artificial general intelligence over t…☆14Nov 28, 2019Updated 6 years ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- H-Net Dynamic Hierarchical Architecture☆81Sep 11, 2025Updated 5 months ago
- High performance pytorch modules☆18Jan 14, 2023Updated 3 years ago
- 📄Small Batch Size Training for Language Models☆80Oct 4, 2025Updated 4 months ago
- Portfolio REgret for Confidence SEquences☆20Jan 6, 2026Updated last month
- an implementation of FAdam (Fisher Adam) in PyTorch☆50Jul 1, 2025Updated 7 months ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement …☆32May 2, 2025Updated 9 months ago
- Automated Theorem Prover for Automatic Words☆21Apr 7, 2021Updated 4 years ago
- ☆24Sep 25, 2024Updated last year
- Official implementation of the paper "You Do Not Fully Utilize Transformer's Representation Capacity"☆31May 28, 2025Updated 8 months ago
- PyTorch implementation of RWKV blocks☆32Jul 22, 2025Updated 6 months ago
- Official Implementation of the ICML 2023 paper: "Neural Wave Machines: Learning Spatiotemporally Structured Representations with Locally …☆77May 30, 2023Updated 2 years ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Oct 18, 2023Updated 2 years ago
- A red teaming agent☆18Oct 15, 2025Updated 4 months ago
- Forked from https://gitlab.com/MatejB/PrePoMax☆12Jan 8, 2024Updated 2 years ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Jan 27, 2026Updated 2 weeks ago
- Minimal implementation of PCA in PyTorch, tested against scikit-learn's implementation☆29Feb 24, 2025Updated 11 months ago
- (CVPR 2024) Bayesian Diffusion Models for 3D Shape Reconstruction☆37May 6, 2024Updated last year
- A simple library for scaling up JAX programs☆145Nov 4, 2025Updated 3 months ago
- Repo to reproduce the First-Explore paper results☆39Dec 25, 2024Updated last year
- An efficient implementation of the NSA (Native Sparse Attention) kernel☆129Jun 24, 2025Updated 7 months ago
- Libraries, guides, blueprints, and sample code, to enable rapidly building 0-1 applications on iOS, Android and web.☆11May 12, 2023Updated 2 years ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise☆40Aug 29, 2024Updated last year
- ☆82Apr 16, 2024Updated last year
- Official code for the paper "Attention as a Hypernetwork"☆47Jun 22, 2024Updated last year
- PyTorch interface for TrueGrad Optimizers☆43Aug 8, 2023Updated 2 years ago
- RWKV-X is a Linear Complexity Hybrid Language Model based on the RWKV architecture, integrating Sparse Attention to improve the model's l…☆54Jan 12, 2026Updated last month
- Embedding Recycling for Language models☆38Jul 11, 2023Updated 2 years ago
- ☆131May 29, 2025Updated 8 months ago
- Optical flow library, based on NumPy arrays☆11Nov 30, 2021Updated 4 years ago
- OpenVLA for AIRBOT☆14Aug 15, 2024Updated last year
- RockIt: A query engine for Markov logic☆11May 24, 2016Updated 9 years ago