Learning Rate Warmup in PyTorch
☆415Jun 19, 2025Updated 10 months ago
Alternatives and similar repositories for pytorch_warmup
Users that are interested in pytorch_warmup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Gradually-Warmup Learning Rate Scheduler for PyTorch☆990Oct 10, 2024Updated last year
- ☆469Apr 8, 2023Updated 3 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,207Dec 22, 2023Updated 2 years ago
- A learning rate range test implementation in PyTorch☆1,004Jun 24, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- torch-optimizer -- collection of optimizers for Pytorch☆3,168Mar 22, 2024Updated 2 years ago
- ☆13Oct 8, 2021Updated 4 years ago
- SAM: Sharpness-Aware Minimization (PyTorch)☆1,975Feb 21, 2024Updated 2 years ago
- Chainer implementation of CIFAR-10 dataset training☆12Dec 7, 2022Updated 3 years ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,739Apr 29, 2026Updated last week
- neural network based speaker embedder☆25Jan 7, 2023Updated 3 years ago
- Deploy KoGPT with Triton Inference Server☆14Nov 18, 2022Updated 3 years ago
- On the Variance of the Adaptive Learning Rate and Beyond☆2,551Jul 31, 2021Updated 4 years ago
- A new architecture of semantic segmentation called Dense-Attention Networks.☆14Nov 10, 2018Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch☆2,185Nov 27, 2024Updated last year
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-…☆66Jul 31, 2023Updated 2 years ago
- An implementation of capsule routing for sound event detection☆15Jan 29, 2019Updated 7 years ago
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆25,139May 1, 2026Updated last week
- A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection☆22Nov 14, 2024Updated last year
- Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models☆816Jun 8, 2025Updated 11 months ago
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆91Sep 6, 2022Updated 3 years ago
- Configuration classes enabling type-safe PyTorch configuration for Hydra apps☆228Mar 12, 2026Updated last month
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆9,658Apr 29, 2026Updated last week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code release for ConvNeXt model☆6,365Jan 8, 2023Updated 3 years ago
- AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)☆417Jan 13, 2021Updated 5 years ago
- Utility functions that I reuse across different projects☆15Jun 4, 2021Updated 4 years ago
- Polynomial Learning Rate Decay Scheduler for PyTorch☆65Dec 25, 2021Updated 4 years ago
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆12,795Apr 25, 2026Updated 2 weeks ago
- Collection of common code that's shared among different research projects in FAIR computer vision team.☆2,237Mar 15, 2026Updated last month
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,476Apr 19, 2026Updated 2 weeks ago
- ☆12May 26, 2022Updated 3 years ago
- Count the MACs / FLOPs of your PyTorch model.☆5,086Jul 8, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆31,128Updated this week
- Code release for ConvNeXt V2 model☆2,021Aug 14, 2024Updated last year
- Machine learning metrics for distributed, scalable PyTorch applications.☆2,432May 2, 2026Updated last week
- Implementation of Online Label Smoothing in PyTorch☆96Sep 27, 2022Updated 3 years ago
- View model summaries in PyTorch!☆2,932May 3, 2026Updated last week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,956Apr 30, 2026Updated last week
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year