Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)
☆14Oct 20, 2023Updated 2 years ago
Alternatives and similar repositories for top-sgd
Users that are interested in top-sgd are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An Image Reconstructor that applies fast proximal gradient method (FISTA) to the wavelet transform of an image using L1 and Total Variati…☆11Sep 25, 2022Updated 3 years ago
- ☆34Jan 25, 2024Updated 2 years ago
- Source code for paper "Trajectory of Alternating Direction Method of Multipliers and Adaptive Acceleration" of NeurIPS 2019☆10Jan 25, 2024Updated 2 years ago
- L4: Practical loss-based stepsize adaptation for PyTorch☆18May 7, 2021Updated 5 years ago
- 最优化方法、凸优化课程作业代码☆18Jan 31, 2020Updated 6 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Dec 8, 2022Updated 3 years ago
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆17Jan 12, 2026Updated 4 months ago
- This is the public repo for the course HMMA238 'Software Development'☆11Apr 20, 2021Updated 5 years ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆25Oct 14, 2024Updated last year
- ☆10Jul 6, 2021Updated 4 years ago
- ☆18Dec 7, 2025Updated 5 months ago
- Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.☆15Mar 28, 2021Updated 5 years ago
- Code for the API, workload execution, and agents underlying the LLMail-Inject Adpative Prompt Injection Challenge☆23Apr 9, 2026Updated last month
- ☆10Apr 8, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Implementations of the algorithms described in the paper: On the Convergence Theory for Hessian-Free Bilevel Algorithms.☆11Nov 1, 2024Updated last year
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆13May 31, 2023Updated 2 years ago
- A collection of stochastic proximal gradient methods for composite non-convex problems.☆24Jun 17, 2020Updated 5 years ago
- ☆22Jan 23, 2024Updated 2 years ago
- Experiments with Super-Universal Newton method.☆13Aug 12, 2022Updated 3 years ago
- [ICLR 2025] This repository contains the code to reproduce the results from our paper From Sparse Dependence to Sparse Attention: Unveili…☆12Mar 7, 2025Updated last year
- This is the PyTorch Implementation of "AETTA: Label-Free Accuracy Estimation for Test-Time Adaptation (CVPR '24)" by Taeckyung Lee, Sorn …☆14May 21, 2025Updated last year
- El0ps: An Exact L0-Problem Solver☆13Jan 6, 2026Updated 4 months ago
- Code for CVPR 2023 Robust Generalization against Photon-Limited Corruptions via Worst-Case Sharpness Minimization☆13Mar 27, 2023Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A Julia package for adaptive proximal gradient and primal-dual algorithms☆11Jan 18, 2024Updated 2 years ago
- End-to-End Probabilistic Inference for Nonstationary Audio Analysis☆12Aug 7, 2019Updated 6 years ago
- This repository regroups learning ressources about performance estimation problems☆15Mar 18, 2026Updated 2 months ago
- Reference implementation of Thin and Deep Gaussian Processes (NeurIPS 2023)☆14Nov 25, 2024Updated last year
- ☆13May 14, 2025Updated last year
- A suite of stochastic optimization methods for solving the empirical risk minimization problem.☆17Nov 20, 2019Updated 6 years ago
- A Fast sketching based solver for large scale ridge regression☆17Jun 7, 2024Updated last year
- Example code for paper "Bilevel Optimization: Nonasymptotic Analysis and Faster Algorithms"☆50Dec 28, 2021Updated 4 years ago
- MMD-FUSE package implementing the MMD-FUSE test proposed in MMD-FUSE: Learning and Combining Kernels for Two-Sample Testing Without Data …☆12May 31, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Notebooks from DS3 course on practical optimization☆15Jan 5, 2021Updated 5 years ago
- ☆13Feb 4, 2026Updated 3 months ago
- Comparison between GFlowNets & Maximum Entropy RL☆19Feb 19, 2024Updated 2 years ago
- ☆23Jun 15, 2022Updated 3 years ago
- Modeling and Analysis of (Statistical) Genetics data in python☆18May 19, 2026Updated last week
- The repo for HiRA paper☆37Jan 9, 2026Updated 4 months ago
- Hessian backpropagation (HBP): PyTorch extension of backpropagation for block-diagonal curvature matrix approximations☆22Mar 25, 2023Updated 3 years ago