This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explained Without the Implicit Bias of Gradient Descent"
☆39Mar 2, 2023Updated 3 years ago
Alternatives and similar repositories for optimizer
Users that are interested in optimizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Oct 12, 2022Updated 3 years ago
- Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)☆35Sep 28, 2025Updated 8 months ago
- ☆18Jan 17, 2024Updated 2 years ago
- We define and estimate smooth unique information of samples with respect to classifier weights and predictions. We compute these quantiti…☆11Mar 9, 2021Updated 5 years ago
- Code for NIPS 2015 "Gradient-Free Hamiltonian Monte Carlo via Effecient Kernel Exponential Families"☆26Jun 7, 2018Updated 8 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Unbiased Markov chain Monte Carlo with couplings☆31Jun 2, 2026Updated last week
- Official code for Deep Bayesian Video Frame Interpolation (ECCV2022)☆18May 29, 2023Updated 3 years ago
- Preparing for ML Interviews.☆53Jan 12, 2026Updated 5 months ago
- [NeurIPS2023] "Selectivity Drives Productivity: Efficient Dataset Pruning for Enhanced Transfer Learning" by Yihua Zhang*, Yimeng Zhang*,…☆14Oct 12, 2023Updated 2 years ago
- A simple shellscript for splitting the PDF of a paper into the main body and an appendix.☆18Jun 1, 2020Updated 6 years ago
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆20Jan 7, 2022Updated 4 years ago
- Repository for the code assignment of the Deep Learning 1 course, Fall 2022 edition☆20Dec 9, 2022Updated 3 years ago
- The implementation for the paper `Byte-Pair Encoding for Text-to-SQL Generation`.☆14Feb 26, 2020Updated 6 years ago
- Repo for our work "Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence"☆20Jun 2, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- MSCOCO caption evaluation codes for use with arbitrary image and text data☆11Apr 27, 2016Updated 10 years ago
- SqueezeNet in Tensorflow☆10Jun 7, 2017Updated 9 years ago
- Grounding statistical machine translation with semantic parsing☆14May 13, 2015Updated 11 years ago
- some scripts for the couplings enthusiasts!☆33Jul 21, 2020Updated 5 years ago
- T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)☆55Oct 6, 2025Updated 8 months ago
- Multilingual Compositional Wikidata Questions (MCWQ)☆21Jun 12, 2023Updated 3 years ago
- ☆39Oct 21, 2022Updated 3 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago
- [ICML2024] "FedLMT: Tackling System Heterogeneity of Federated Learning via Low-Rank Model Training with Theoretical Guarantees" by Jiaha…☆14Sep 22, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆18May 15, 2026Updated 3 weeks ago
- (ICLR 2026) Optimas: Optimizing Compound AI Systems☆80Feb 6, 2026Updated 4 months ago
- Training vision models with full-batch gradient descent and regularization☆40Feb 14, 2023Updated 3 years ago
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆13May 31, 2023Updated 3 years ago
- Towards Unified and Effective Domain Generalization☆34Nov 27, 2023Updated 2 years ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Oct 22, 2023Updated 2 years ago
- [ICLR'24] Heterogeneous Personalized Federated Learning by Local-Global Updates Mixing via Convergence Rate☆13Jun 17, 2025Updated 11 months ago
- ☆13Dec 22, 2024Updated last year
- Zero-Shot Cross-Lingual Semantic Parsing (Sherborne & Lapata, ACL 2022)☆17May 16, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Extending Conformal Prediction to LLMs☆70Jun 21, 2024Updated last year
- Cheat sheet for interacting with the SLURM scheulder☆17Jun 1, 2017Updated 9 years ago
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- Author implementation of the paper "Don’t paraphrase, detect! Rapid and Effective Data Collection for Semantic Parsing"☆20Oct 5, 2020Updated 5 years ago
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆38Feb 21, 2026Updated 3 months ago
- Normalized Wasserstein for Mixture Distributions☆11Mar 24, 2023Updated 3 years ago
- A collection of deep reinforcement learning-based & GFlowNet drug molecule generators focused on generation of molecules using Graphs/SEL…☆10Dec 11, 2022Updated 3 years ago