This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"
☆184Jul 17, 2021Updated 4 years ago
Alternatives and similar repositories for Crowded-Valley---Results
Users that are interested in Crowded-Valley---Results are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DeepOBS: A Deep Learning Optimizer Benchmark Suite☆109Dec 21, 2023Updated 2 years ago
- [TMLR 2022] Curvature access through the generalized Gauss-Newton's low-rank structure: Eigenvalues, eigenvectors, directional derivative…☆17Jul 19, 2023Updated 2 years ago
- Code accompanying the NeurIPS 2021 Paper: A Probabilistic State Space Model for Joint Inference from Differential Equations and Data (Sch…☆13Nov 7, 2022Updated 3 years ago
- BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.☆607Nov 28, 2025Updated 3 months ago
- Probabilistic numerical finite differences. Compute finite difference weights and differentiation matrices on scattered data sites and wi…☆12May 8, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Probabilistic ODE solvers are fun, but are they fast? See also: https://github.com/pnkraemer/probdiffeq for JAX code or https://github.c…☆20Jul 20, 2024Updated last year
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networks☆488Jul 1, 2022Updated 3 years ago
- ☆37Feb 4, 2022Updated 4 years ago
- Source code for my PhD thesis: Backpropagation Beyond the Gradient☆20Feb 25, 2023Updated 3 years ago
- Efficient SDE samplers including Gaussian-based probabilistic solvers. Written in JAX.☆10Feb 8, 2025Updated last year
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆24Nov 4, 2024Updated last year
- Implements stochastic line search☆118Mar 14, 2023Updated 3 years ago
- This code accompanies the manuscript "A Generative Framework for Probabilistic, Spatiotemporally Coherent Downscaling of Climate Simulati…☆36Jul 21, 2025Updated 8 months ago
- Posterior Refinement Improves Sample Efficiency in Bayesian Neural Networks☆10Oct 21, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- A package for computing matrix exponentials and finite horizon Gramians☆11Jan 21, 2026Updated 2 months ago
- Sketched linear operations for PyTorch☆101Oct 24, 2025Updated 5 months ago
- A lightweight library for tensorflow 2.0☆65Dec 3, 2019Updated 6 years ago
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆15Nov 4, 2024Updated last year
- ☆12Sep 26, 2019Updated 6 years ago
- Limitations of the Empirical Fisher Approximation☆49Mar 3, 2025Updated last year
- Label shift experiments☆17Dec 3, 2020Updated 5 years ago
- Cyclemoid implementation for PyTorch☆90Apr 2, 2022Updated 3 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,167Mar 22, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Distributed K-FAC preconditioner for PyTorch☆95Mar 17, 2026Updated last week
- Natural Gradient, Variational Inference☆29Jan 13, 2020Updated 6 years ago
- A generic library for linear and non-linear Gaussian smoothing problems. The code leverages JAX and implements several linearization algo…☆13Dec 4, 2024Updated last year
- Figure sizes, font sizes, fonts, and more configurations at minimal overhead. Fix your journal papers, conference proceedings, and other …☆737Updated this week
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"☆1,068Aug 9, 2024Updated last year
- code for "Semi-Discrete Normalizing Flows through Differentiable Tessellation"☆26Dec 10, 2022Updated 3 years ago
- Simple CIFAR10 ResNet example with JAX.☆23Jun 1, 2021Updated 4 years ago
- The Concept Bottleneck Shift Detection (CBSD) methods for explaining and detecting various dataset shifts.☆14Jun 22, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Nov 2, 2021Updated 4 years ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆23Dec 4, 2024Updated last year
- "Layer-wise Adaptive Rate Scaling" in PyTorch☆87Jan 22, 2021Updated 5 years ago
- Probabilistic Numerics in Python.☆458Jul 3, 2025Updated 8 months ago
- Code for Unbiased Implicit Variational Inference (UIVI)☆15Jan 18, 2019Updated 7 years ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆259Oct 29, 2023Updated 2 years ago
- Python package to download and use the SSB datasets☆11Aug 3, 2023Updated 2 years ago