zzp1012/SAM-in-Late-Phase

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zzp1012/SAM-in-Late-Phase)

zzp1012 / SAM-in-Late-Phase

[ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"

☆19

Alternatives and similar repositories for SAM-in-Late-Phase

Users that are interested in SAM-in-Late-Phase are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zzp1012 / Cross-Task-Linearity
View on GitHub
[ICML 2024] Code release for "On the Emergence of Cross-Task Linearity in Pretraining-Finetuning Paradigm"
☆11Feb 20, 2025Updated last year
zzp1012 / LLFC
View on GitHub
[NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"
☆19Oct 19, 2023Updated 2 years ago
nblt / Flat-LoRA
View on GitHub
[ICML 2025] Flat-LoRA: Low-Rank Adaptation over a Flat Loss Landscape
☆18May 18, 2025Updated last year
czhang024 / ParallelControl
View on GitHub
ICML-2025 (Spotlight) "From Weight-Based to State-Based Fine-Tuning: Further Memory Reduction on LoRA with Parallel Control"
☆15May 7, 2026Updated 2 months ago
FFTYYY / RaanA
View on GitHub
Implementation of "RaanA: A Fast, Flexible, and Data-Efficient Post-Training Quantization Algorithm"
☆17Apr 11, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
WeiHuang05 / Awesome_Large_Foundation_Model_Theory
View on GitHub
Welcome to the 'In Context Learning Theory' Reading Group
☆31Nov 8, 2024Updated last year
byeonghu-na / vae-pu
View on GitHub
Official Tensorflow implementation for Deep Generative Positive-Unlabeled Learning under Selection Bias (VAE-PU) in CIKM 2020.
☆15Dec 11, 2021Updated 4 years ago
tmlr-group / G-effect
View on GitHub
[ICLR 2025] "Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond"
☆16Feb 27, 2025Updated last year
wxr99 / HolisticPU
View on GitHub
Beyond Myopia: Learning from Positive and Unlabeled Data through Holistic Predictive Trends [NeurIPS 2023]
☆10Jan 28, 2024Updated 2 years ago
aadityasingh / icl-dynamics
View on GitHub
☆26Feb 20, 2026Updated 5 months ago
liaoning97 / FineRMoE
View on GitHub
The official code of FineRMoE.
☆21Mar 17, 2026Updated 4 months ago
saic-fi / LFA
View on GitHub
[ICCV 2023] Black Box Few-Shot Adaptation for Vision-Language models
☆27May 14, 2024Updated 2 years ago
WeiHuang05 / Awesome-Feature-Learning-in-Deep-Learning-Thoery
View on GitHub
Welcome to the Awesome Feature Learning in Deep Learning Thoery Reading Group! This repository serves as a collaborative platform for sch…
☆211Apr 13, 2026Updated 3 months ago
MediaBrain-SJTU / OC_LT
View on GitHub
Official code base for "Long-Tailed Diffusion Models With Oriented Calibration" ICLR2024
☆19Jul 11, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aNOnWhyMooS / connectivity
View on GitHub
☆18Jan 17, 2024Updated 2 years ago
jonathanwilton / PUExtraTrees
View on GitHub
uPU, nnPU and PN learning with Extra Trees classifier.
☆20Dec 2, 2024Updated last year
guangxinsuuu / Positive-and-Unlabeled-Learning-from-Imbalanced-Data
View on GitHub
Code for the paper named "Positive-Unlabeled Learning from Imbalanced Data" which has been accepted by IJCAI-21
☆16Sep 14, 2021Updated 4 years ago
allenai / signal-and-noise
View on GitHub
Measuring the Signal to Noise Ratio in Language Model Evaluation
☆31Aug 19, 2025Updated 11 months ago
Thinklab-SJTU / Fast-T2T
View on GitHub
[NeurIPS2024] Fast T2T: Optimization Consistency Speeds Up Diffusion-Based Training-to-Testing Solving for Combinatorial Optimization; [N…
☆22Jul 2, 2025Updated last year
LeiBAI / Paper-Writing-Rebuttal
View on GitHub
Some thoughts about writing scientific papers
☆23Nov 8, 2024Updated last year
MinghuiChen43 / awesome-deep-phenomena
View on GitHub
A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...
☆404Jul 21, 2026Updated last week
chenjianhuii / Mechanistic-Data-Attribution
View on GitHub
☆16May 25, 2026Updated 2 months ago
napoles-uach / Medium_Mol
View on GitHub
☆11Apr 22, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
facebookresearch / SIMAT
View on GitHub
codebase for the SIMAT dataset and evaluation
☆39Feb 16, 2022Updated 4 years ago
matanr / Memories_of_Forgotten_Concepts
View on GitHub
Evaluation of concept erasing diffusion models should include latent likelihood
☆22Nov 3, 2025Updated 8 months ago
uuujf / SGDNoise
View on GitHub
[ICML 2019] The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects
☆15Apr 12, 2020Updated 6 years ago
Olivia-fsm / DoGE
View on GitHub
Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"
☆21Feb 29, 2024Updated 2 years ago
eepperly / Randomly-Pivoted-Cholesky
View on GitHub
Code for the paper "Randomly pivoted Cholesky: Practical approximation of a kernel matrix with few entry evaluations"
☆36Dec 4, 2025Updated 7 months ago
Furyton / awesome-language-model-analysis
View on GitHub
This paper list focuses on the theoretical and empirical analysis of language models, especially large language models (LLMs). The papers…
☆101Updated this week
Shevlev / Co-occurrence-Neural-Network
View on GitHub
☆10Jun 3, 2019Updated 7 years ago
ul-fmf / mlfmf-data
View on GitHub
Machine Learning for Mathematical Formalization
☆11Jul 20, 2024Updated 2 years ago
tmlr-group / CoPA
View on GitHub
[NeurIPS 2024] "Mind the Gap between Prototypes and Images in Cross-domain Finetuning"
☆11Nov 15, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
luli-git / MAP
View on GitHub
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
☆18Sep 2, 2024Updated last year
Ray-rui / Dist-PU-Positive-Unlabeled-Learning-from-a-Label-Distribution-Perspective
View on GitHub
PyTorch implementation of Dist-PU (CVPR 2022)
☆33Jun 19, 2022Updated 4 years ago
EnnengYang / RepresentationSurgery
View on GitHub
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆49Oct 10, 2024Updated last year
YongHyun-Ahn / LINe-Out-of-Distribution-Detection-by-Leveraging-Important-Neurons
View on GitHub
LINe: Out-of-Distribution Detection by Leveraging Important Neurons (CVPR 2023)
☆13Jun 13, 2023Updated 3 years ago
Raiden-Zhu / ICML-2023-DSGD-and-SAM
View on GitHub
[ICML 2023] Decentralized SGD and Average-direction SAM are Asymptotically Equivalent
☆20Dec 4, 2023Updated 2 years ago
ZFancy / awesome-activation-engineering
View on GitHub
A curated list of resources for activation engineering
☆139Oct 2, 2025Updated 9 months ago
HC-Feynman / vpu
View on GitHub
A PyTorch implementation of the Variational approach for PU learning
☆31Oct 11, 2020Updated 5 years ago