Westlake-AI / SEMALinks
Switch EMA: A Free Lunch for Better Flatness and Sharpness
☆28Updated last year
Alternatives and similar repositories for SEMA
Users that are interested in SEMA are comparing it to the libraries listed below
Sorting:
- ☆54Updated 2 years ago
- An official PyTorch implementation for CLIPPR☆29Updated 2 years ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated 2 years ago
- Code release for paper Extremely Simple Activation Shaping for Out-of-Distribution Detection☆54Updated last year
- Original code base for On Pretraining Data Diversity for Self-Supervised Learning☆14Updated 10 months ago
- ☆58Updated 2 years ago
- Code for T-MARS data filtering☆35Updated 2 years ago
- ☆53Updated last year
- Code base of SynthCLIP: CLIP training with purely synthetic text-image pairs from LLMs and TTIs.☆101Updated 7 months ago
- [AAAI 2025] Does VLM Classification Benefit from LLM Description Semantics?☆23Updated 3 months ago
- Visualizing representations with diffusion based conditional generative model.☆102Updated 2 years ago
- This is a offical PyTorch/GPU implementation of SupMAE.☆79Updated 3 years ago
- Code release for "Improved baselines for vision-language pre-training"☆61Updated last year
- (ICML 2022) Official PyTorch implementation of “Blurs Behave Like Ensembles: Spatial Smoothings to Improve Accuracy, Uncertainty, and Rob…☆79Updated 3 years ago
- Repo for the paper "Extrapolating from a Single Image to a Thousand Classes using Distillation"☆37Updated last year
- Minimal multi-gpu implementation of EDM2: "Analyzing and Improving the Training Dynamics of Diffusion Models"☆39Updated last year
- Gradient-Free Textual Inversion for Personalized Text-to-Image Generation☆43Updated 2 years ago
- [CVPR 2025] Official PyTorch implementation of MaskSub "Masking meets Supervision: A Strong Learning Alliance"☆45Updated 7 months ago
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Updated 2 years ago
- [ICML24] Official Implementation of "ETHER: Efficient Finetuning of Large-Scale Models with Hyperplane Reflections"☆16Updated last year
- ☆39Updated last year
- Official PyTorch Implementation for the "Distilling Datasets Into Less Than One Image" paper.☆38Updated last year
- Code for experiments for "ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy"☆101Updated last year
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆28Updated last year
- Official code and data for NeurIPS 2023 paper "ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial …☆40Updated last year
- Code and benchmark for the paper: "A Practitioner's Guide to Continual Multimodal Pretraining" [NeurIPS'24]☆60Updated 11 months ago
- Implementation for <Orthogonal Over-Parameterized Training> in CVPR'21.☆22Updated 4 years ago
- ☆35Updated last year
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆32Updated last year
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆37Updated last year