bgrimmer / LongStepCertificatesLinks
Certificates proving the convergence rates claimed in Table 1 of the (forthcoming) paper "Provably Faster Gradient Descent via Long Steps" by Benjamin Grimmer. The Mathematica notebooks include everything in rational form and computations (exact arithmetic) verifying all of the need (spectral) properties of the certificates.
☆8Updated last year
Alternatives and similar repositories for LongStepCertificates
Users that are interested in LongStepCertificates are comparing it to the libraries listed below
Sorting:
- Repo for solving arc problems with an Neural Cellular Automata☆17Updated last month
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆15Updated last week
- Guides, courses & reading lists for learning to build autonomous LLM agents☆31Updated last month
- Code to generate an infinite zoom animation.☆11Updated last year
- Training hybrid models for dummies.☆25Updated 6 months ago
- Code for Arxiv Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle☆27Updated last year
- Implementation of SelfExtend from the paper "LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning" from Pytorch and Zeta☆13Updated 8 months ago
- We study toy models of skill learning.☆29Updated 5 months ago
- ☆25Updated 3 months ago
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- MPI Code Generation through Domain-Specific Language Models☆14Updated 7 months ago
- Implementation of Spectral State Space Models☆16Updated last year
- Lottery Ticket Adaptation☆39Updated 7 months ago
- Repository for "GIST: Distributed training for large-scale graph convolutional networks"☆15Updated 2 years ago
- ☆11Updated last year
- Repository to create traveling waves integrate special information through time☆53Updated 4 months ago
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆13Updated 8 months ago
- ☆39Updated last year
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆30Updated this week
- Flexible, efficient, and context-aware generation from large unstructured knowledge sources.☆16Updated last year
- ☆16Updated last week
- All the World's a (Hyper)Graph: A Data Drama (DSH 2023)☆15Updated 3 years ago
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆24Updated 9 months ago
- ☆33Updated 6 months ago
- Because it's there.☆16Updated 9 months ago
- ☆55Updated 3 weeks ago
- The application is a end-user training and evaluation system for standard knowledge graph embedding models. It was developed to optimise …☆18Updated last month
- CogNetX is an advanced, multimodal neural network architecture inspired by human cognition. It integrates speech, vision, and video proce…☆15Updated 2 weeks ago
- ☆40Updated 2 months ago