bgrimmer / LongStepCertificatesLinks
Certificates proving the convergence rates claimed in Table 1 of the (forthcoming) paper "Provably Faster Gradient Descent via Long Steps" by Benjamin Grimmer. The Mathematica notebooks include everything in rational form and computations (exact arithmetic) verifying all of the need (spectral) properties of the certificates.
☆8Updated last year
Alternatives and similar repositories for LongStepCertificates
Users that are interested in LongStepCertificates are comparing it to the libraries listed below
Sorting:
- Repo for solving arc problems with an Neural Cellular Automata☆16Updated last month
- ☆16Updated last year
- Implementation of Spectral State Space Models☆16Updated last year
- Source-to-Source Debuggable Derivatives in Pure Python☆15Updated last year
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆24Updated 8 months ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated 2 years ago
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last month
- 📰 Computing the information content of trained neural networks☆21Updated 3 years ago
- Minimum Description Length probing for neural network representations☆18Updated 4 months ago
- Latent Large Language Models☆18Updated 10 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆23Updated last year
- ☆23Updated 6 months ago
- ☆11Updated last year
- Exploration into the Firefly algorithm in Pytorch☆40Updated 4 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated last month
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆26Updated last year
- Implementation of a holodeck, written in Pytorch☆18Updated last year
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year
- ☆11Updated 4 months ago
- ☆18Updated last year
- Repository for "GIST: Distributed training for large-scale graph convolutional networks"☆15Updated 2 years ago
- A scalable implementation of diffusion and flow-matching with XGBoost models, applied to calorimeter data.☆18Updated 7 months ago
- Implementation of Metaformer, but in an autoregressive manner☆25Updated 3 years ago
- MPI Code Generation through Domain-Specific Language Models☆14Updated 7 months ago
- ☆25Updated last month
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 7 months ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated last year
- ☆14Updated 2 years ago
- Implementation and explorations into Blackbox Gradient Sensing (BGS), an evolutionary strategies approach proposed in a Google Deepmind p…☆16Updated 3 weeks ago
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated 5 months ago