KellerJordan / top-sgdView external linksLinks
Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)
☆14Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for top-sgd
Users that are interested in top-sgd are comparing it to the libraries listed below
Sorting:
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- ☆34Jan 25, 2024Updated 2 years ago
- This is the public repo for the course HMMA238 'Software Development'☆10Apr 20, 2021Updated 4 years ago
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆13Nov 25, 2024Updated last year
- Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information☆11Sep 28, 2023Updated 2 years ago
- ☆10Jul 6, 2021Updated 4 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- A Julia package for adaptive proximal gradient and primal-dual algorithms☆11Jan 18, 2024Updated 2 years ago
- Software dev. for data science (Python)☆15May 1, 2025Updated 9 months ago
- Vecchia approximations for Gaussian log-likelihoods☆13Jan 27, 2026Updated 2 weeks ago
- Probabilistic Finite Volume Method based on Affine Gaussian Process inference☆11Jun 10, 2024Updated last year
- End-to-End Probabilistic Inference for Nonstationary Audio Analysis☆12Aug 7, 2019Updated 6 years ago
- ☆13May 3, 2024Updated last year
- Code to generate an infinite zoom animation.☆11Nov 9, 2023Updated 2 years ago
- Pytorch routines for (Ker)nel (Mac)hines☆10Oct 10, 2025Updated 4 months ago
- Implementations of the algorithms described in the paper: On the Convergence Theory for Hessian-Free Bilevel Algorithms.☆10Nov 1, 2024Updated last year
- UAI paper 'Expressive Priors in Bayesian Neural Networks: Kernel Combinations and Periodic Functions'☆11Jun 26, 2019Updated 6 years ago
- ☆11Dec 8, 2022Updated 3 years ago
- Emacs support for Jupyter notebooks☆15Nov 17, 2024Updated last year
- Code for the API, workload execution, and agents underlying the LLMail-Inject Adpative Prompt Injection Challenge☆19Oct 21, 2025Updated 3 months ago
- Functions and classes for doing Gaussian process models of proteins☆11Apr 13, 2018Updated 7 years ago
- ☆16Dec 7, 2025Updated 2 months ago
- macOS transliteration input method for Russian, Hebrew, Ukrainian and Belarusian☆21Feb 1, 2026Updated 2 weeks ago
- A benchmark of meaningful graph datasets with tabular node features☆14Oct 29, 2025Updated 3 months ago
- ☆13Sep 26, 2023Updated 2 years ago
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆29Feb 17, 2025Updated 11 months ago
- ☆16May 29, 2025Updated 8 months ago
- ☆10Aug 19, 2021Updated 4 years ago
- Bayesian variable selection for survival data (tutorial)☆15Jun 4, 2025Updated 8 months ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- Project repo for gpSLDS☆17Jan 12, 2026Updated last month
- ☆18Sep 21, 2023Updated 2 years ago
- ☆22Jan 27, 2026Updated 2 weeks ago
- Code for 'Memory-based dual Gaussian processes for sequential learning' (ICML 2023)☆12Aug 16, 2023Updated 2 years ago
- This repository regroups learning ressources about performance estimation problems☆14Sep 19, 2024Updated last year
- Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.☆15Mar 28, 2021Updated 4 years ago
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆13May 31, 2023Updated 2 years ago
- Meshed GP for Bayesian spatial big data regression☆15Sep 24, 2025Updated 4 months ago
- [NeurIPS 2025] Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting☆20Jan 8, 2026Updated last month