Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)
☆14Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for top-sgd
Users that are interested in top-sgd are comparing it to the libraries listed below
Sorting:
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- ☆34Jan 25, 2024Updated 2 years ago
- This is the public repo for the course HMMA238 'Software Development'☆10Apr 20, 2021Updated 4 years ago
- Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information☆11Sep 28, 2023Updated 2 years ago
- A Julia package for adaptive proximal gradient and primal-dual algorithms☆11Jan 18, 2024Updated 2 years ago
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆14Nov 25, 2024Updated last year
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- Probabilistic Finite Volume Method based on Affine Gaussian Process inference☆11Jun 10, 2024Updated last year
- Personalized and Interactive Music Recommendation with Bandit approach☆11Sep 15, 2019Updated 6 years ago
- Vecchia approximations for Gaussian log-likelihoods☆13Updated this week
- MMD-FUSE package implementing the MMD-FUSE test proposed in MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data …☆11May 31, 2024Updated last year
- Implementations of the algorithms described in the paper: On the Convergence Theory for Hessian-Free Bilevel Algorithms.☆10Nov 1, 2024Updated last year
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆16Jan 12, 2026Updated last month
- Code to generate an infinite zoom animation.☆11Nov 9, 2023Updated 2 years ago
- End-to-End Probabilistic Inference for Nonstationary Audio Analysis☆12Aug 7, 2019Updated 6 years ago
- Testing the consistency of binary classification performance scores reported in papers☆12Aug 21, 2025Updated 6 months ago
- Pytorch routines for (Ker)nel (Mac)hines☆11Oct 10, 2025Updated 4 months ago
- ☆13May 3, 2024Updated last year
- MATLAB implementations of a variety of machine learning/signal processing algorithms.☆11Aug 24, 2016Updated 9 years ago
- EPIDEMIC is an easy-to-run Matlab/Octave educational toolkit for epidemiological analysis.☆17Apr 4, 2025Updated 11 months ago
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- A library of tools for compiler construction.☆12May 18, 2016Updated 9 years ago
- Suppress mouse & keyboard events on MacOSX. Baby-proof my Mac!☆14Oct 19, 2023Updated 2 years ago
- Experiments with Super-Universal Newton method.☆13Aug 12, 2022Updated 3 years ago
- Bayesian inference on spatial and spatiotemporal data, faster than you can say "Cholesky!"☆12Dec 29, 2025Updated 2 months ago
- Puma plugin to export puma stats as prometheus metrics☆11Aug 17, 2022Updated 3 years ago
- Repository of PIXAR, a Pixel-based Auto-Regressive Language Model☆18Sep 15, 2025Updated 5 months ago
- ☆13Sep 26, 2023Updated 2 years ago
- A benchmark of meaningful graph datasets with tabular node features☆14Oct 29, 2025Updated 4 months ago
- ☆11Dec 8, 2022Updated 3 years ago
- Baselines for Model-Based Optimization installation fixes and compatible with newer AMPERE+ GPUs (e.g. 3090)☆11Apr 30, 2023Updated 2 years ago
- code for BINOCULARS and Multi-Step BO☆12Dec 7, 2020Updated 5 years ago
- ☆16May 29, 2025Updated 9 months ago
- [NeurIPS 2025] Elucidated Rolling Diffusion Models for Probabilistic Weather Forecasting☆20Jan 8, 2026Updated 2 months ago
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆29Feb 17, 2025Updated last year
- Code for 'Memory-based dual Gaussian processes for sequential learning' (ICML 2023)☆12Aug 16, 2023Updated 2 years ago
- Metadata converter for Breadcrumbs users. It is made for adapting metadata to Obsidian 1.4.0+'s link support in frontmatter.☆15Jul 29, 2023Updated 2 years ago
- Collaborative inference of latent diffusion via hivemind☆12May 29, 2023Updated 2 years ago
- Webcam demo for SKTBrain/DiscoGAN☆13Sep 11, 2019Updated 6 years ago