☆35Dec 5, 2022Updated 3 years ago
Alternatives and similar repositories for SAF
Users that are interested in SAF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch repository for ICLR 2022 paper (GSAM) which improves generalization (e.g. +3.8% top-1 accuracy on ImageNet with ViT-B/32)☆147Aug 23, 2022Updated 3 years ago
- [CVPR 2024] Friendly Sharpness-Aware Minimization☆35Oct 29, 2024Updated last year
- ☆22Jan 23, 2024Updated 2 years ago
- ☆58Feb 13, 2023Updated 3 years ago
- Code for AAAI 2024 paper: CR-SAM: Curvature Regularized Sharpness-Aware Minimization☆12Nov 29, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆11Dec 8, 2022Updated 3 years ago
- [NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation☆48Jun 29, 2023Updated 2 years ago
- This is unofficial repository for Towards Efficient and Scalable Sharpness-Aware Minimization.☆37Apr 15, 2024Updated 2 years ago
- DiT (training + flow matching) in Jax☆11Jan 5, 2025Updated last year
- The official codes of our CVPR-2023 paper: Sharpness-Aware Gradient Matching for Domain Generalization☆80May 31, 2023Updated 2 years ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆29Sep 22, 2023Updated 2 years ago
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆40Mar 25, 2023Updated 3 years ago
- [ICLR 2023] Eva: Practical Second-order Optimization with Kronecker-vectorized Approximation☆12Jul 31, 2023Updated 2 years ago
- ☆18Aug 24, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [NeurIPS 2021 | AIJ 2024] Multi-Objective Meta Learning☆17Jul 31, 2024Updated last year
- ☆10Apr 24, 2022Updated 4 years ago
- Turn jitted jax functions back into python source code☆23Dec 16, 2024Updated last year
- Using a modified version of Werner Duvaud's MuZero implementation (https://github.com/werner-duvaud/muzero-general) this reinforcement ag…☆19Jun 30, 2021Updated 4 years ago
- SEAT☆21Oct 10, 2023Updated 2 years ago
- Darknet Neural Network Backend and Frontend for ONNX☆10Oct 12, 2018Updated 7 years ago
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆34Dec 24, 2025Updated 4 months ago
- Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"☆12Oct 14, 2025Updated 7 months ago
- "Towards Scaling Difference Target Propagation by Learning Backprop Targets" (ICML 2022)☆13Jan 17, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- GitHub Repository for KDD 2022 paper "Saliency-Regularized Deep Multi-Task Learning"☆12Sep 26, 2023Updated 2 years ago
- Resilient Model-Based RL by Regularizing Posterior Predictability☆22Mar 4, 2024Updated 2 years ago
- The official repo for CVPR2023 highlight paper "Gradient Norm Aware Minimization Seeks First-Order Flatness and Improves Generalization".☆84Jun 20, 2023Updated 2 years ago
- ☆13Jul 2, 2024Updated last year
- Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information☆12Sep 28, 2023Updated 2 years ago
- Spectral Tensor Train Parameterization of Deep Learning Layers☆17Jul 1, 2021Updated 4 years ago
- Codes for "Efficient Offline Policy Optimization with a Learned Model", ICLR2023☆30Jul 18, 2023Updated 2 years ago
- Source code of "Leaky Thoughts: Large Reasoning Models Are Not Private Thinkers" EMNLP 2025☆17Jan 12, 2026Updated 4 months ago
- ☆10Aug 20, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Experimental version of jxbz/agd implementing support for bias terms, affine parameters, transformers, etc.☆12Jul 30, 2023Updated 2 years ago
- ☆10Mar 25, 2024Updated 2 years ago
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆29Feb 17, 2025Updated last year
- ☆14Nov 13, 2024Updated last year
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆19Oct 19, 2023Updated 2 years ago
- ☆628May 12, 2026Updated last week
- The implementation for the paper `Byte-Pair Encoding for Text-to-SQL Generation`.☆14Feb 26, 2020Updated 6 years ago