This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"
☆184Jul 17, 2021Updated 4 years ago
Alternatives and similar repositories for Crowded-Valley---Results
Users that are interested in Crowded-Valley---Results are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DeepOBS: A Deep Learning Optimizer Benchmark Suite☆110Dec 21, 2023Updated 2 years ago
- [TMLR 2022] Curvature access through the generalized Gauss-Newton's low-rank structure: Eigenvalues, eigenvectors, directional derivative…☆17Jul 19, 2023Updated 2 years ago
- Code accompanying the NeurIPS 2021 Paper: A Probabilistic State Space Model for Joint Inference from Differential Equations and Data (Sch…☆13Nov 7, 2022Updated 3 years ago
- BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.☆611Nov 28, 2025Updated 5 months ago
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networks☆487Jul 1, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆37Feb 4, 2022Updated 4 years ago
- Source code for my PhD thesis: Backpropagation Beyond the Gradient☆21Feb 25, 2023Updated 3 years ago
- Efficient SDE samplers including Gaussian-based probabilistic solvers. Written in JAX.☆10Feb 8, 2025Updated last year
- [ICML 2024] SINGD: KFAC-like Structured Inverse-Free Natural Gradient Descent (http://arxiv.org/abs/2312.05705)☆24Nov 4, 2024Updated last year
- Implements stochastic line search☆118Mar 14, 2023Updated 3 years ago
- This code accompanies the manuscript "A Generative Framework for Probabilistic, Spatiotemporally Coherent Downscaling of Climate Simulati…☆37Jul 21, 2025Updated 9 months ago
- Posterior Refinement Improves Sample Efficiency in Bayesian Neural Networks☆10Oct 21, 2022Updated 3 years ago
- A package for computing matrix exponentials and finite horizon Gramians☆11Jan 21, 2026Updated 3 months ago
- Sketched linear operations for PyTorch☆101Oct 24, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A lightweight library for tensorflow 2.0☆65Dec 3, 2019Updated 6 years ago
- [ICML 2024] SIRFShampoo: Structured inverse- and root-free Shampoo in PyTorch (https://arxiv.org/abs/2402.03496)☆15Nov 4, 2024Updated last year
- ☆12Sep 26, 2019Updated 6 years ago
- Limitations of the Empirical Fisher Approximation☆49Mar 3, 2025Updated last year
- Label shift experiments☆17Dec 3, 2020Updated 5 years ago
- Cyclemoid implementation for PyTorch☆90Apr 2, 2022Updated 4 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,168Mar 22, 2024Updated 2 years ago
- Distributed K-FAC preconditioner for PyTorch☆97Apr 30, 2026Updated last week
- Natural Gradient, Variational Inference☆29Jan 13, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A generic library for linear and non-linear Gaussian smoothing problems. The code leverages JAX and implements several linearization algo…☆13Apr 20, 2026Updated 2 weeks ago
- Github for the conference paper GLOD-Gaussian Likelihood OOD detector☆16Apr 18, 2022Updated 4 years ago
- Code for the ICCV 2023 paper "Benchmarking Low-Shot Robustness to Natural Distribution Shifts"☆11Jan 21, 2024Updated 2 years ago
- Figure sizes, font sizes, fonts, and more configurations at minimal overhead. Fix your journal papers, conference proceedings, and other …☆738Mar 24, 2026Updated last month
- [ICML2023] Instant Soup Cheap Pruning Ensembles in A Single Pass Can Draw Lottery Tickets from Large Models. Ajay Jaiswal, Shiwei Liu, Ti…☆11Nov 28, 2023Updated 2 years ago
- Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"☆1,072Aug 9, 2024Updated last year
- Simple CIFAR10 ResNet example with JAX.☆23Jun 1, 2021Updated 4 years ago
- code for "Semi-Discrete Normalizing Flows through Differentiable Tessellation"☆26Dec 10, 2022Updated 3 years ago
- The Concept Bottleneck Shift Detection (CBSD) methods for explaining and detecting various dataset shifts.☆14Jun 22, 2021Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Nov 2, 2021Updated 4 years ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆23Dec 4, 2024Updated last year
- "Layer-wise Adaptive Rate Scaling" in PyTorch☆87Jan 22, 2021Updated 5 years ago
- Probabilistic Numerics in Python.☆460Jul 3, 2025Updated 10 months ago
- Code for Unbiased Implicit Variational Inference (UIVI)☆15Jan 18, 2019Updated 7 years ago
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆260Oct 29, 2023Updated 2 years ago
- ☆28Oct 18, 2022Updated 3 years ago