zeke-xie / stable-weight-decay-regularizationView external linksLinks
[NeurIPS 2023] The PyTorch Implementation of Scheduled (Stable) Weight Decay.
☆62Feb 3, 2024Updated 2 years ago
Alternatives and similar repositories for stable-weight-decay-regularization
Users that are interested in stable-weight-decay-regularization are comparing it to the libraries listed below
Sorting:
- [ICML 2021] The official PyTorch Implementations of Positive-Negative Momentum Optimizers.☆27Aug 30, 2022Updated 3 years ago
- [Neural Computation, MIT Press] The PyTorch Implementation of Variable Optimizers/ Neural Variable Risk Minimization proposed in our Neur…☆33Aug 3, 2021Updated 4 years ago
- [ICML 2022, Oral] The PyTorch Implementation of Adaptive Inertia Methods. The algorithms are based on our paper: "Adaptive Inertia: Dise…☆149Feb 17, 2023Updated 2 years ago
- Code for "AutoPose: Searching Multi-Scale Branch Aggregation for Pose Estimation"☆10Dec 30, 2021Updated 4 years ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- Python implementation for paper: Feature Distillation: DNN-Oriented JPEG Compression Against Adversarial Examples☆11Jun 12, 2018Updated 7 years ago
- ☆35Dec 5, 2022Updated 3 years ago
- Multiple GEMM operators are constructed with cutlass to support LLM inference.☆20Aug 3, 2025Updated 6 months ago
- ☆12Aug 22, 2025Updated 5 months ago
- ☆38Aug 7, 2025Updated 6 months ago
- Unofficial Pytorch implementation of the paper Filter Response Normalization.☆19Dec 9, 2019Updated 6 years ago
- This is a helper for PyTorch-BigGraph☆22Apr 7, 2020Updated 5 years ago
- An object detection codebase based on MegEngine.☆28Dec 14, 2022Updated 3 years ago
- Official Code for ICLR2022 Paper: Chaos is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation Overlap☆28Sep 28, 2025Updated 4 months ago
- CaCo: Both Positive and Negative Samples are Directly Learnable via Cooperative-adversarial Contrastive Learning☆24Mar 10, 2024Updated last year
- ☆31Jun 29, 2022Updated 3 years ago
- AdaTask: A Task-Aware Adaptive Learning Rate Approach to Multi-Task Learning. AAAI, 2023.☆29Sep 29, 2023Updated 2 years ago
- This project is divided in a two parts. In first study, Lame parameters are identified using tanh activation function. After that, six a…☆13Nov 17, 2022Updated 3 years ago
- ☆16Nov 2, 2025Updated 3 months ago
- The dataset repo of "CLCIFAR: CIFAR-Derived Benchmark Datasets with Human Annotated Complementary Labels" paper☆16Aug 8, 2025Updated 6 months ago
- 深度学习和NLP随笔☆27Jun 17, 2019Updated 6 years ago
- ☆38Jul 19, 2025Updated 6 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- The project is an attempt to implement the paper Content Based Image Retrieval using Color Difference Histogram by Guang-Hai Liu et all. …☆13Dec 16, 2020Updated 5 years ago
- A documentation automation system for SaMD and medical device software. Documentation-as-code for ISO62304 compliant development processe…☆13Feb 8, 2026Updated last week
- In-browser Real-time Mask Detection | Deployment part. Based on NCNN and Web-Assembly.☆28Nov 21, 2025Updated 2 months ago
- Numerical assessments of a nonintrusive surrogate model based on recurrent neural networks and proper orthogonal decomposition: Rayleigh …☆10Dec 2, 2022Updated 3 years ago
- The official repo of INF-34B models trained by INF Technology.