Code for our NeurIPS 2022 paper
☆371Jan 13, 2023Updated 3 years ago
Alternatives and similar repositories for gradient-descent-the-ultimate-optimizer
Users that are interested in gradient-descent-the-ultimate-optimizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An inspection tool for sensor_msgs/PointCloud2 messages [ROS1/ROS2]☆19Jan 20, 2023Updated 3 years ago
- Testing various improvements to Ranger21 for 2022☆19Nov 6, 2024Updated last year
- TorchOpt is an efficient library for differentiable optimization built upon PyTorch.☆631May 4, 2026Updated 3 weeks ago
- Header-based library for robust GNC+BR☆15Jul 2, 2023Updated 2 years ago
- ☆803Apr 28, 2026Updated 3 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆12Jul 7, 2023Updated 2 years ago
- Schedule-Free Optimization in PyTorch☆2,284May 18, 2026Updated last week
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Feb 7, 2022Updated 4 years ago
- functorch is JAX-like composable function transforms for PyTorch.☆1,436Aug 21, 2025Updated 9 months ago
- D-Adaptation for SGD, Adam and AdaGrad☆532Jan 22, 2025Updated last year
- Convolutions for Sequence Modeling☆912Jun 13, 2024Updated last year
- torch-optimizer -- collection of optimizers for Pytorch☆3,167Mar 22, 2024Updated 2 years ago
- Implementation of Hinton's forward-forward (FF) algorithm - an alternative to back-propagation☆1,498Sep 6, 2023Updated 2 years ago
- Constrained optimization toolkit for PyTorch☆709Jul 29, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- maximal update parametrization (µP)☆1,710Jul 17, 2024Updated last year
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆1,001Jan 30, 2024Updated 2 years ago
- Hardware accelerated, batchable and differentiable optimizers in JAX.☆1,040Dec 17, 2025Updated 5 months ago
- ☆28Jul 28, 2022Updated 3 years ago
- Certifiable solvers for the relative pose problem (RPp) with known gravity vector☆13Feb 16, 2023Updated 3 years ago
- Tutorial examples and sample applications for DC-SAM.☆20Mar 4, 2026Updated 2 months ago
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,485Apr 19, 2026Updated last month
- CURL: Continuous, Ultra-compact Representation for LiDAR☆54Oct 11, 2023Updated 2 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.☆612Nov 28, 2025Updated 5 months ago
- Blog post☆17Feb 16, 2024Updated 2 years ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated 2 years ago
- Tensors, for human consumption☆1,386Apr 9, 2026Updated last month
- Hypergradient descent☆146May 31, 2024Updated last year
- For optimization algorithm research and development.☆568May 6, 2026Updated 2 weeks ago
- Official implementation of "Relational Proxies: Emergent Relationships as Fine-Grained Discriminators", NeurIPS 2022.☆14Feb 1, 2025Updated last year
- Elegant easy-to-use neural networks + scientific computing in JAX. https://docs.kidger.site/equinox/☆2,886May 11, 2026Updated 2 weeks ago
- Optimal transport tools implemented with the JAX framework, to solve large scale matching problems of any flavor.☆738Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ROS package] Background removal in (2D or 3D) lidar scan data that coincides with an occupancy grid map.☆17Oct 28, 2022Updated 3 years ago
- Code for visualizing the loss landscape of neural nets☆3,180Apr 5, 2022Updated 4 years ago
- Ranger deep learning optimizer rewrite to use newest components☆342Mar 17, 2026Updated 2 months ago
- ☆19Nov 25, 2022Updated 3 years ago
- ☆30Jan 9, 2026Updated 4 months ago
- Second Order Optimization and Curvature Estimation with K-FAC in JAX.☆324May 11, 2026Updated 2 weeks ago
- [Neurips 2022] “ Back Razor: Memory-Efficient Transfer Learning by Self-Sparsified Backpropogation”, Ziyu Jiang*, Xuxi Chen*, Xueqin Huan…☆19Mar 14, 2023Updated 3 years ago