SirRob1997 / Crowded-Valley---ResultsView external linksLinks
This repository contains the results for the paper: "Descending through a Crowded Valley - Benchmarking Deep Learning Optimizers"
☆184Jul 17, 2021Updated 4 years ago
Alternatives and similar repositories for Crowded-Valley---Results
Users that are interested in Crowded-Valley---Results are comparing it to the libraries listed below
Sorting:
- DeepOBS: A Deep Learning Optimizer Benchmark Suite☆108Dec 21, 2023Updated 2 years ago
- BackPACK - a backpropagation package built on top of PyTorch which efficiently computes quantities other than the gradient.☆604Nov 28, 2025Updated 2 months ago
- ☆37Feb 4, 2022Updated 4 years ago
- Cockpit: A Practical Debugging Tool for Training Deep Neural Networks☆488Jul 1, 2022Updated 3 years ago
- [TMLR 2022] Curvature access through the generalized Gauss-Newton's low-rank structure: Eigenvalues, eigenvectors, directional derivative…☆17Jul 19, 2023Updated 2 years ago
- Funny Application of Neural Head Reenactment to Naver Webtoon☆10Mar 22, 2021Updated 4 years ago
- Probabilistic numerical finite differences. Compute finite difference weights and differentiation matrices on scattered data sites and wi…☆11May 8, 2023Updated 2 years ago
- Label shift experiments☆17Dec 3, 2020Updated 5 years ago
- Implements stochastic line search☆118Mar 14, 2023Updated 2 years ago
- ☆12Sep 26, 2019Updated 6 years ago
- The Concept Bottleneck Shift Detection (CBSD) methods for explaining and detecting various dataset shifts.☆14Jun 22, 2021Updated 4 years ago
- Distributed K-FAC preconditioner for PyTorch☆95Updated this week
- Implicit MLE: Backpropagating Through Discrete Exponential Family Distributions☆259Oct 29, 2023Updated 2 years ago
- Implementation of the models and datasets used in "An Information-theoretic Approach to Distribution Shifts"☆25Nov 2, 2021Updated 4 years ago
- ☆12Dec 20, 2019Updated 6 years ago
- Probabilistic ODE solvers are fun, but are they fast? See also: https://github.com/pnkraemer/probdiffeq for JAX code or https://github.c…☆21Jul 20, 2024Updated last year
- A lightweight library for tensorflow 2.0☆65Dec 3, 2019Updated 6 years ago
- ☆29Oct 18, 2022Updated 3 years ago
- torch-optimizer -- collection of optimizers for Pytorch☆3,161Mar 22, 2024Updated last year
- Github for the conference paper GLOD-Gaussian Likelihood OOD detector☆16Apr 18, 2022Updated 3 years ago
- Cyclemoid implementation for PyTorch☆90Apr 2, 2022Updated 3 years ago
- Natural Gradient, Variational Inference☆29Jan 13, 2020Updated 6 years ago
- "Layer-wise Adaptive Rate Scaling" in PyTorch☆87Jan 22, 2021Updated 5 years ago
- ☆15Dec 28, 2020Updated 5 years ago
- ☆37May 28, 2023Updated 2 years ago
- Repository for NeurIPS 2020 Spotlight "AdaBelief Optimizer: Adapting stepsizes by the belief in observed gradients"☆1,068Aug 9, 2024Updated last year
- notebooks of cool EBM visualizations☆15Feb 12, 2021Updated 5 years ago
- ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning☆283Feb 27, 2023Updated 2 years ago
- Sketched linear operations for PyTorch☆100Oct 24, 2025Updated 3 months ago
- ☆22Dec 3, 2021Updated 4 years ago
- (ACL-IJCNLP 2021) Convolutions and Self-Attention: Re-interpreting Relative Positions in Pre-trained Language Models.☆21Jul 13, 2022Updated 3 years ago
- ☆34Aug 30, 2021Updated 4 years ago
- In this paper, we show that the performance of a learnt generative model is closely related to the model's ability to accurately represen…☆41Mar 26, 2021Updated 4 years ago
- MONeT framework for reducing memory consumption of DNN training☆174May 4, 2021Updated 4 years ago
- Code release for "Understanding Bias in Large-Scale Visual Datasets"☆22Dec 4, 2024Updated last year
- Measuring if attention is explanation with ROAR☆22Mar 3, 2023Updated 2 years ago
- Experiments for Meta-Learning Symmetries by Reparameterization☆58Apr 11, 2021Updated 4 years ago
- higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual tr…☆1,629Mar 25, 2022Updated 3 years ago
- TF/Keras code for DiffStride, a pooling layer with learnable strides.☆124Feb 7, 2022Updated 4 years ago