[ICML 2023] Decentralized SGD and Average-direction SAM are Asymptotically Equivalent
☆20Dec 4, 2023Updated 2 years ago
Alternatives and similar repositories for ICML-2023-DSGD-and-SAM
Users that are interested in ICML-2023-DSGD-and-SAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Dec 23, 2022Updated 3 years ago
- Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356☆74Sep 10, 2020Updated 5 years ago
- ☆14May 31, 2025Updated 11 months ago
- cifiar10 联邦学习☆10Apr 6, 2020Updated 6 years ago
- Recycling Model Updates in Federated Learning: Are Gradient Subspaces Low-Rank?☆15Mar 24, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Short Course on Optimization for Machine Learning - Slides and Practical Labs - DS3 Data Science Summer School, June 24 to 28, 2019, Pari…☆20Jul 5, 2019Updated 6 years ago
- 自己动手实现的联邦学习相关代码☆10Oct 3, 2021Updated 4 years ago
- A Survey of Direct Preference Optimization (DPO)☆97Jul 4, 2025Updated 10 months ago
- This repository contains an experimental PyTorch implementation exploring the NoProp algorithm, presented in the paper "NOPROP: TRAINING …☆16Apr 19, 2026Updated 3 weeks ago
- ☆19Oct 6, 2024Updated last year
- [TPAMI] Interaction Pattern Disentangling for Multi-Agent Reinforcement Learning☆33May 17, 2024Updated last year
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆18Feb 20, 2025Updated last year
- Experiments for distributed optimization algorithms☆82May 24, 2023Updated 2 years ago
- Code for Federated Generalized Bayesian Learning via Distributed Stein Variational Gradient Descent☆10Nov 19, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- Evaluating and reproducing real-world robot manipulation policies (e.g., RT-1, RT-1-X, Octo) in simulation under common setups (e.g., Goo…☆11Dec 30, 2024Updated last year
- A weak supervision framework for (partial) labeling functions☆16Jul 15, 2024Updated last year
- ICLR 2023: Learning to Extrapolate: A Transductive Approach☆11Aug 15, 2023Updated 2 years ago
- Code repository for the AISTATS 2021 paper "Towards Understanding the Optimal Behaviors of Deep Active Learning Algorithms"☆14Mar 20, 2021Updated 5 years ago
- ManifoldNet Paper Implementation for SPD(n)☆11Nov 10, 2021Updated 4 years ago
- 这是一个基于Minist数据集的横向联邦学习实现☆16Apr 14, 2021Updated 5 years ago
- Associated codebase for Byzantine-resilient distributed / decentralized machine learning papers from INSPIRE Lab☆14Oct 11, 2021Updated 4 years ago
- ☆16Jun 29, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Confident Adaptive Transformers☆14Apr 18, 2021Updated 5 years ago
- [ICML 2019] The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects☆15Apr 12, 2020Updated 6 years ago
- [CVPR 2023] Federated Domain Generalization with Generalization Adjustment☆53Jun 14, 2023Updated 2 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆44Sep 11, 2023Updated 2 years ago
- The dataset and codes of the paper UniMod1K: Towards a More Universal Large-Scale Dataset and Benchmark for Multi-Modal Learning.☆17Sep 21, 2025Updated 7 months ago
- Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks☆17Jan 15, 2025Updated last year
- SeRL: Self-Play Reinforcement Learning for Large Language Models with Limited Data☆23Jan 24, 2026Updated 3 months ago
- Transformer Doctor: Diagnosing and Treating Vision Transformers☆11Jan 15, 2025Updated last year
- Fine-grained attention in hierarchical transformers for tabular time-series.☆12Dec 24, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Action recognition with STIP features and my own Fisher vector implementation☆14Mar 29, 2017Updated 9 years ago
- ☆19Dec 31, 2025Updated 4 months ago
- Unofficial pytorch implementation of Piecewise Linear Unit dynamic activation function☆18Feb 8, 2023Updated 3 years ago
- Velocity Obstacle for Polytopic Collision Avoidance for Distributed Multi-Robot Systems (RA-L 2023)☆20Jul 20, 2025Updated 9 months ago
- 联邦学习论文以及论文笔记☆21Dec 16, 2020Updated 5 years ago
- ☆14Jan 3, 2025Updated last year
- LDS-toolbox: a matlab toolbox for linear dynamical systems (LDSs) modeling☆13Mar 23, 2018Updated 8 years ago