D-Adaptation for SGD, Adam and AdaGrad
☆529Jan 22, 2025Updated last year
Alternatives and similar repositories for dadaptation
Users that are interested in dadaptation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The Prodigy optimizer and its variants for training neural networks.☆456Jan 16, 2025Updated last year
- ☆36Jan 23, 2024Updated 2 years ago
- 🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch☆2,183Nov 27, 2024Updated last year
- Schedule-Free Optimization in PyTorch☆2,271May 21, 2025Updated 10 months ago
- Experimental scripts for researching data adaptive learning rate scheduling.☆22Oct 18, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Parameter-Free Optimizers for Pytorch☆131Apr 23, 2024Updated last year
- Euclidean Wasserstein-2 optimal transportation☆46Aug 19, 2023Updated 2 years ago
- The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”☆991Jan 30, 2024Updated 2 years ago
- maximal update parametrization (µP)☆1,695Jul 17, 2024Updated last year
- ☆213Oct 10, 2022Updated 3 years ago
- FFCV-SSL Fast Forward Computer Vision for Self-Supervised Learning.☆215Aug 1, 2023Updated 2 years ago
- A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.☆1,254Updated this week
- Hackable and optimized Transformers building blocks, supporting a composable construction.☆10,411Mar 30, 2026Updated 2 weeks ago
- Effortless plugin and play Optimizer to cut model training costs by 50%. New optimizer that is 2x faster than Adam on LLMs.☆382Jun 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- TensorDict is a pytorch dedicated tensor container.☆1,019Updated this week
- AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)☆416Jan 13, 2021Updated 5 years ago
- Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.☆2,490Feb 12, 2026Updated 2 months ago
- optimizer & lr scheduler & loss function collections in PyTorch☆399Mar 31, 2026Updated last week
- MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvement…☆417Apr 3, 2026Updated last week
- Named tensors with first-class dimensions for PyTorch☆332Jun 14, 2023Updated 2 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- Notionに毎日新しいarXiv論文のアブストラクト日本語訳 + αを表示するスクリプト☆13Jan 22, 2023Updated 3 years ago
- Lightweight Cluster/Cloud VM Job Management 🚀☆42Aug 27, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A playbook for systematically maximizing the performance of deep learning models.☆29,988Jun 18, 2024Updated last year
- A concise but complete full-attention transformer with a set of promising experimental features from various papers☆5,816Mar 27, 2026Updated 2 weeks ago
- FFCV: Fast Forward Computer Vision (and other ML workloads!)☆2,990Jun 16, 2024Updated last year
- Trains Transformer model variants. Data isn't shuffled between batches.☆143Oct 5, 2022Updated 3 years ago
- Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.☆1,476May 2, 2025Updated 11 months ago
- ☆22Nov 9, 2024Updated last year
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,450Feb 20, 2026Updated last month
- Scaling Data-Constrained Language Models☆343Jun 28, 2025Updated 9 months ago
- Accessible large language models via k-bit quantization for PyTorch.☆8,107Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- For optimization algorithm research and development.☆562Mar 19, 2026Updated 3 weeks ago
- Fast, Modern, and Low Precision PyTorch Optimizers☆130Dec 29, 2025Updated 3 months ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 2 years ago
- Code for our NeurIPS 2022 paper☆371Jan 13, 2023Updated 3 years ago
- Neural Ensemble Search for Uncertainty Estimation and Dataset Shift☆35Jan 10, 2026Updated 3 months ago
- PyTorch extensions for high performance and large scale training.☆3,405Apr 26, 2025Updated 11 months ago
- Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models☆813Jun 8, 2025Updated 10 months ago