Learning Rate Warmup in PyTorch
☆415Jun 19, 2025Updated 10 months ago
Alternatives and similar repositories for pytorch_warmup
Users that are interested in pytorch_warmup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Gradually-Warmup Learning Rate Scheduler for PyTorch☆991Oct 10, 2024Updated last year
- ☆468Apr 8, 2023Updated 3 years ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase☆1,206Dec 22, 2023Updated 2 years ago
- A learning rate range test implementation in PyTorch☆1,004Jun 24, 2025Updated 9 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- torch-optimizer -- collection of optimizers for Pytorch☆3,170Mar 22, 2024Updated 2 years ago
- ☆13Oct 8, 2021Updated 4 years ago
- SAM: Sharpness-Aware Minimization (PyTorch)☆1,969Feb 21, 2024Updated 2 years ago
- Chainer implementation of CIFAR-10 dataset training☆12Dec 7, 2022Updated 3 years ago
- The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights --…☆36,637Apr 9, 2026Updated last week
- neural network based speaker embedder☆25Jan 7, 2023Updated 3 years ago
- Deploy KoGPT with Triton Inference Server☆14Nov 18, 2022Updated 3 years ago
- On the Variance of the Adaptive Learning Rate and Beyond☆2,551Jul 31, 2021Updated 4 years ago
- A new architecture of semantic segmentation called Dense-Attention Networks.☆14Nov 10, 2018Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🦁 Lion, new optimizer discovered by Google Brain using genetic algorithms that is purportedly better than Adam(w), in Pytorch☆2,182Nov 27, 2024Updated last year
- A lightweight wrapper for PyTorch that provides a simple declarative API for context switching between devices, distributed modes, mixed-…☆66Jul 31, 2023Updated 2 years ago
- An implementation of capsule routing for sound event detection☆15Jan 29, 2019Updated 7 years ago
- Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Py…☆25,043Apr 6, 2026Updated last week
- A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection☆22Nov 14, 2024Updated last year
- Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models☆813Jun 8, 2025Updated 10 months ago
- PyTorch implementation of some learning rate schedulers for deep learning researcher.☆90Sep 6, 2022Updated 3 years ago
- Configuration classes enabling type-safe PyTorch configuration for Hydra apps☆228Mar 12, 2026Updated last month
- 🚀 A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (i…☆9,608Updated this week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Code release for ConvNeXt model☆6,354Jan 8, 2023Updated 3 years ago
- AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights (ICLR 2021)☆416Jan 13, 2021Updated 5 years ago
- Utility functions that I reuse across different projects☆15Jun 4, 2021Updated 4 years ago
- Advanced AI Explainability for computer vision. Support for CNNs, Vision Transformers, Classification, Object detection, Segmentation, I…☆12,750Mar 29, 2026Updated 3 weeks ago
- Collection of common code that's shared among different research projects in FAIR computer vision team.☆2,233Mar 15, 2026Updated last month
- Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)☆9,456Apr 9, 2026Updated last week
- ☆12May 26, 2022Updated 3 years ago
- Count the MACs / FLOPs of your PyTorch model.☆5,082Jul 8, 2024Updated last year
- Pretrain, finetune ANY AI model of ANY size on 1 or 10,000+ GPUs with zero code changes.☆31,056Updated this week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Code release for ConvNeXt V2 model☆2,010Aug 14, 2024Updated last year
- Machine learning metrics for distributed, scalable PyTorch applications.☆2,429Updated this week
- Implementation of Online Label Smoothing in PyTorch☆96Sep 27, 2022Updated 3 years ago
- View model summaries in PyTorch!☆2,912Updated this week
- A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch☆8,949Updated this week
- RWKV6 in native pytorch and triton:)☆11Aug 4, 2024Updated last year
- PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. ⚡🔥⚡☆5,219Aug 16, 2024Updated last year