Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)
☆14Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for top-sgd
Users that are interested in top-sgd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MATLAB MEX implementation of SVRG-SBB algorithms☆12Nov 28, 2017Updated 8 years ago
- Adaptive gradient descent without descent☆52Oct 12, 2021Updated 4 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- L4: Practical loss-based stepsize adaptation for PyTorch☆18May 7, 2021Updated 4 years ago
- Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information☆11Sep 28, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆29Feb 17, 2025Updated last year
- ☆11Dec 8, 2022Updated 3 years ago
- ☆14May 3, 2024Updated last year
- This is the public repo for the course HMMA238 'Software Development'☆11Apr 20, 2021Updated 4 years ago
- Unconstrained optimization algorithms in python, line search and trust region methods☆18Dec 19, 2018Updated 7 years ago
- Code for the API, workload execution, and agents underlying the LLMail-Inject Adpative Prompt Injection Challenge☆22Mar 1, 2026Updated 3 weeks ago
- Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.☆15Mar 28, 2021Updated 5 years ago
- ClockBench - Visual Reasoning AI Benchmark☆31Sep 4, 2025Updated 6 months ago
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆13May 31, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆21Jan 23, 2024Updated 2 years ago
- Proximal gradient algorithm for convex optimization, using a diagonal +/- rank-1 norm☆22Dec 27, 2024Updated last year
- ☆13Aug 7, 2023Updated 2 years ago
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- Software dev. for data science (Python)☆15May 1, 2025Updated 10 months ago
- El0ps: An Exact L0-Problem Solver☆13Jan 6, 2026Updated 2 months ago
- ZOSVRG-BlackBox-Adv☆13Oct 30, 2018Updated 7 years ago
- A Likelihood framework brought to you with from the Weizmann stat. team☆13Feb 8, 2020Updated 6 years ago
- End-to-End Probabilistic Inference for Nonstationary Audio Analysis☆12Aug 7, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This repository regroups learning ressources about performance estimation problems☆15Mar 18, 2026Updated last week
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆14Nov 25, 2024Updated last year
- A library of tools for compiler construction.☆12May 18, 2016Updated 9 years ago
- A suite of stochastic optimization methods for solving the empirical risk minimization problem.☆17Nov 20, 2019Updated 6 years ago
- ☆15Dec 1, 2016Updated 9 years ago
- A Fast sketching based solver for large scale ridge regression☆17Jun 7, 2024Updated last year
- ☆14May 4, 2024Updated last year
- MMD-FUSE package implementing the MMD-FUSE test proposed in MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data …☆11May 31, 2024Updated last year
- Notebooks from DS3 course on practical optimization☆15Jan 5, 2021Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆13Feb 4, 2026Updated last month
- Comparison between GFlowNets & Maximum Entropy RL☆19Feb 19, 2024Updated 2 years ago
- Modeling and Analysis of (Statistical) Genetics data in python☆16Jun 12, 2025Updated 9 months ago
- ☆23Jun 15, 2022Updated 3 years ago
- Baselines for Model-Based Optimization installation fixes and compatible with newer AMPERE+ GPUs (e.g. 3090)☆11Apr 30, 2023Updated 2 years ago
- Suppress mouse & keyboard events on MacOSX. Baby-proof my Mac!☆14Oct 19, 2023Updated 2 years ago
- PyTorch implementation of LARS (Layer-wise Adaptive Rate Scaling)☆19May 11, 2019Updated 6 years ago