Switch EMA: A Free Lunch for Better Flatness and Sharpness
☆28Feb 16, 2024Updated 2 years ago
Alternatives and similar repositories for SEMA
Users that are interested in SEMA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR'25] MergeVQ: A Unified Framework for Visual Generation and Representation with Token Merging and Quantization☆49Jul 22, 2025Updated 10 months ago
- Official Implementation for NorMuon paper☆71Apr 30, 2026Updated 3 weeks ago
- Code and results accompanying our paper titled Leveraging Unlabeled Data to Predict Out-of-Distribution Performance at ICLR 2022☆10Dec 8, 2022Updated 3 years ago
- [ECCV 2024] Official implementation of Multiscale Graph Texture Network☆18Oct 1, 2024Updated last year
- A tiny paper rating web☆40Mar 19, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- TPU에서 한국어용 LLM 추론을 위한 Jax/Flax 구 현체입니다.☆12Jun 12, 2023Updated 2 years ago
- 커버리스트 - 북 커버 생성 AI 서비스☆13Sep 11, 2022Updated 3 years ago
- Serving large language model with transformers☆13Oct 18, 2022Updated 3 years ago
- a Jax/Flax inference code of StarCoder☆12Jun 12, 2023Updated 2 years ago
- 2021 NIPA 한국인 헤어스타일 경진대회 2등 솔루션 자료입니다.☆23Sep 16, 2021Updated 4 years ago
- 🥈12th place solution on G2Net Detecting Continuous Gravitational Waves🥈☆14Jan 4, 2023Updated 3 years ago
- Submission to the inverse scaling prize☆23Jul 23, 2023Updated 2 years ago
- ☆58Feb 13, 2023Updated 3 years ago
- An official repository for GPTailor☆18Jun 29, 2025Updated 10 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [COLM 2024] Early Weight Averaging meets High Learning Rates for LLM Pre-training☆19Oct 12, 2024Updated last year
- The offical repository of "IPMix: Label-Preserving Data Augmentation Method for Training Robust Classifiers"☆15May 7, 2024Updated 2 years ago
- ☆56Nov 26, 2024Updated last year
- 🎖️ 5th place solution in the Google American Sign Language Fingerspelling Recognition Competition🎖️☆16Sep 19, 2023Updated 2 years ago
- Official PyTorch Implementation for Paper "No More Adam: Learning Rate Scaling at Initialization is All You Need"☆55Jan 27, 2025Updated last year
- [arXiv 2024] Is Oracle Pruning the True Oracle?☆26Jan 10, 2025Updated last year
- Accompanying code for "Analyzing Vision Tranformers in Class Embedding Space" (NeurIPS '23)☆15Jun 10, 2024Updated last year
- 🥇 LG-AI-Challenge 2022 1위 솔루션 입니다.☆13Jun 6, 2023Updated 2 years ago
- ☆14Jan 12, 2026Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A minimal, educational HEVC (H.265) encoder written in Python.☆51Feb 23, 2026Updated 3 months ago
- A paper list about Token Merge, Reduce, Resample, Drop for MLLMs.☆89Oct 26, 2025Updated 7 months ago
- Repository for Skill Set Optimization☆14Jul 26, 2024Updated last year
- Project to visualize the kernels and the outputs of the individual layers of a CNN built in pytorch.☆18Feb 28, 2020Updated 6 years ago
- ☆13Oct 8, 2021Updated 4 years ago
- Official Pytorch Implementation of Self-emerging Token Labeling☆35Mar 27, 2024Updated 2 years ago
- Code for the ICLR'24 paper "Self-supervised Representation Learning From Random Data Projectors☆16Mar 16, 2024Updated 2 years ago
- Recipes for some popular Rust tools☆14Jul 11, 2025Updated 10 months ago
- Saliency Toolbox☆20Sep 17, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- The official repository of paper "ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection" (N…☆50Oct 23, 2023Updated 2 years ago
- WideRange4D: Enabling High-Quality 4D Reconstruction with Wide-Range Movements and Scenes☆110Mar 19, 2025Updated last year
- Evaluating language models on word puzzle games☆10Oct 25, 2024Updated last year
- Code for "The Unreasonable Effectiveness of Linear Prediction as a Perceptual Metric"☆24Jan 26, 2024Updated 2 years ago
- Supplementary repository for the Emognition Wearable Dataset 2020☆20Jan 29, 2024Updated 2 years ago
- Jax/Flax implementation of DeiT and DeiT-III (ViT)☆19Dec 21, 2024Updated last year
- Official implementation of MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis☆86Jul 16, 2024Updated last year