Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)
☆14Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for top-sgd
Users that are interested in top-sgd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Image Reconstructor that applies fast proximal gradient method (FISTA) to the wavelet transform of an image using L1 and Total Variati…☆11Sep 25, 2022Updated 3 years ago
- source for Stochastic Conjugate Gradient Algorithm with Variance Reduction☆10Nov 22, 2017Updated 8 years ago
- ☆34Jan 25, 2024Updated 2 years ago
- MATLAB implementations of a variety of machine learning/signal processing algorithms.☆11Aug 24, 2016Updated 9 years ago
- MATLAB MEX implementation of SVRG-SBB algorithms☆12Nov 28, 2017Updated 8 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Adaptive gradient descent without descent☆53Oct 12, 2021Updated 4 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information☆11Sep 28, 2023Updated 2 years ago
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆28Feb 17, 2025Updated last year
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- ☆10Jul 6, 2021Updated 4 years ago
- Code for the API, workload execution, and agents underlying the LLMail-Inject Adpative Prompt Injection Challenge☆23Apr 9, 2026Updated last week
- ☆17Dec 7, 2025Updated 4 months ago
- Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.☆15Mar 28, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆10Apr 8, 2021Updated 5 years ago
- JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"☆19Jun 10, 2023Updated 2 years ago
- Implementations of the algorithms described in the paper: On the Convergence Theory for Hessian-Free Bilevel Algorithms.☆11Nov 1, 2024Updated last year
- Code for AAAI 2024 paper: CR-SAM: Curvature Regularized Sharpness-Aware Minimization☆12Nov 29, 2024Updated last year
- ClockBench - Visual Reasoning AI Benchmark☆30Sep 4, 2025Updated 7 months ago
- A collection of stochastic proximal gradient methods for composite non-convex problems.☆24Jun 17, 2020Updated 5 years ago
- ☆21Jan 23, 2024Updated 2 years ago
- Proximal gradient algorithm for convex optimization, using a diagonal +/- rank-1 norm☆22Dec 27, 2024Updated last year
- Experiments with Super-Universal Newton method.☆13Aug 12, 2022Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- This is the PyTorch Implementation of "AETTA: Label-Free Accuracy Estimation for Test-Time Adaptation (CVPR '24)" by Taeckyung Lee, Sorn …☆14May 21, 2025Updated 10 months ago
- Software dev. for data science (Python)☆16May 1, 2025Updated 11 months ago
- Code for CVPR 2023 Robust Generalization against Photon-Limited Corruptions via Worst-Case Sharpness Minimization☆13Mar 27, 2023Updated 3 years ago
- Source codes of "Fast Continuous Subgraph Matching over Streaming Graphs via Backtracking Reduction", SIGMOD 2023☆13Sep 7, 2023Updated 2 years ago
- ZOSVRG-BlackBox-Adv☆13Oct 30, 2018Updated 7 years ago
- End-to-End Probabilistic Inference for Nonstationary Audio Analysis☆12Aug 7, 2019Updated 6 years ago
- ☆20Nov 5, 2019Updated 6 years ago
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆14Nov 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆49Dec 28, 2021Updated 4 years ago
- A suite of stochastic optimization methods for solving the empirical risk minimization problem.☆17Nov 20, 2019Updated 6 years ago
- ☆15Dec 1, 2016Updated 9 years ago
- A Fast sketching based solver for large scale ridge regression☆17Jun 7, 2024Updated last year
- ☆14May 4, 2024Updated last year
- MMD-FUSE package implementing the MMD-FUSE test proposed in MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data …☆11May 31, 2024Updated last year
- Notebooks from DS3 course on practical optimization☆15Jan 5, 2021Updated 5 years ago