This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explained Without the Implicit Bias of Gradient Descent"
☆39Mar 2, 2023Updated 3 years ago
Alternatives and similar repositories for optimizer
Users that are interested in optimizer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆18Oct 12, 2022Updated 3 years ago
- Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)☆33Sep 28, 2025Updated 5 months ago
- ☆18Jan 17, 2024Updated 2 years ago
- The Happy Faces Benchmark☆15Jul 20, 2023Updated 2 years ago
- We define and estimate smooth unique information of samples with respect to classifier weights and predictions. We compute these quantiti…☆11Mar 9, 2021Updated 5 years ago
- Code for NIPS 2015 "Gradient-Free Hamiltonian Monte Carlo via Effecient Kernel Exponential Families"☆26Jun 7, 2018Updated 7 years ago
- ☆24Feb 16, 2024Updated 2 years ago
- Effective Attention Sheds Light On Interpretability - Findings of ACL2021☆11May 16, 2021Updated 4 years ago
- Custom triton kernels for training Karpathy's nanoGPT.☆19Oct 21, 2024Updated last year
- Official code for Deep Bayesian Video Frame Interpolation (ECCV2022)☆18May 29, 2023Updated 2 years ago
- Method to find contrastive dimensions between experimental conditions☆28Mar 5, 2026Updated 2 weeks ago
- ☆35Sep 23, 2022Updated 3 years ago
- (NeurIPS 2022) Official Implementation of Public Wisdom Matters! Discourse-Aware Hyperbolic Fourier Co-Attention for Social-Text Classifi…☆32Jan 16, 2025Updated last year
- Code for Generalization Guarantees for (Multi-Modal) Imitation Learning☆11Jul 14, 2022Updated 3 years ago
- [ICML2025] Official Repo for Paper "Optimizing Temperature for Language Models with Multi-Sample Inference"☆22Feb 16, 2025Updated last year
- symbolic regression☆40Jul 20, 2022Updated 3 years ago
- Effect of tokenization on transformers for biological sequence☆22Dec 31, 2025Updated 2 months ago
- ☆18Dec 20, 2018Updated 7 years ago
- Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)☆10Dec 12, 2023Updated 2 years ago
- Code for replicating experiments from the paper, Preference Exploration for Efficient Bayesian Optimization with Multiple Outcomes, publi…☆13Jun 22, 2023Updated 2 years ago
- Spectral Graph Attention Network with Fast Eigen-approximation☆12Dec 24, 2021Updated 4 years ago
- Code for the paper Self-Supervised Learning of Split Invariant Equivariant Representations☆31Sep 4, 2023Updated 2 years ago
- A starter using @SPF13's ported Hyde Theme for Hugo and Forestry as a Content Manager. Demo Site:☆13Jul 26, 2021Updated 4 years ago
- some scripts for the couplings enthusiasts!☆32Jul 21, 2020Updated 5 years ago
- ☆100Dec 8, 2021Updated 4 years ago
- Get up and running with Llama 2 and other large language models locally☆15Updated this week
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- ☆16Jan 28, 2026Updated last month
- Towards Unified and Effective Domain Generalization☆32Nov 27, 2023Updated 2 years ago
- Zero-Shot Cross-Lingual Semantic Parsing (Sherborne & Lapata, ACL 2022)☆17May 16, 2022Updated 3 years ago
- Extending Conformal Prediction to LLMs☆69Jun 21, 2024Updated last year
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆13Aug 14, 2024Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 2 years ago
- ⭐ User Contributions to NeoMutt☆13Jun 12, 2022Updated 3 years ago
- A collection of deep reinforcement learning-based & GFlowNet drug molecule generators focused on generation of molecules using Graphs/SEL…☆10Dec 11, 2022Updated 3 years ago
- Probabilistic Circuits in Julia☆10Dec 27, 2023Updated 2 years ago
- brute but stronger☆11Aug 4, 2022Updated 3 years ago
- LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold☆32Mar 9, 2026Updated 2 weeks ago
- Code in support of the paper Continuous Mixtures of Tractable Probabilistic Models☆12Oct 12, 2024Updated last year