Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"
☆12Oct 14, 2025Updated 5 months ago
Alternatives and similar repositories for SMS
Users that are interested in SMS are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] LoRA Recycle: Unlocking Tuning-Free Few-Shot Adaptability in Visual Foundation Models by Recycling Pre-Tuned LoRAs☆13Jun 20, 2025Updated 9 months ago
- ☆16Sep 27, 2023Updated 2 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 13, 2026Updated last week
- Here is some Python code that allows you to read in SVG files and approximate their paths using a Fourier series. The Fourier series can …☆20Apr 15, 2022Updated 3 years ago
- Official implementation of "OptMerge: Unifying Multimodal LLM Capabilities and Modalities via Model Merging".☆45Oct 30, 2025Updated 4 months ago
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆30Dec 24, 2025Updated 2 months ago
- GitHub Repository for KDD 2022 paper "Saliency-Regularized Deep Multi-Task Learning"☆12Sep 26, 2023Updated 2 years ago
- ☆12Jun 9, 2025Updated 9 months ago
- Statistics and Visualization of acceptance rate, main keyword of CVPR 2023 accepted papers for the main Computer Vision conference (CVPR)☆12May 4, 2023Updated 2 years ago
- ☆13Dec 17, 2021Updated 4 years ago
- This repository includes the code to reproduce our paper "Raw Differentiable Architecture Search for Speech Deepfake and Spoofing Detecti…☆11Jul 11, 2023Updated 2 years ago
- Spectral Tensor Train Parameterization of Deep Learning Layers☆17Jul 1, 2021Updated 4 years ago
- ☆18Apr 16, 2025Updated 11 months ago
- Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs☆23Nov 11, 2025Updated 4 months ago
- A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA.☆13Jan 13, 2026Updated 2 months ago
- ☆19Nov 7, 2022Updated 3 years ago
- The implementation of Continual Variational Autoencoder Learning via Online Cooperative Memorization☆12Jul 12, 2023Updated 2 years ago
- Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"☆30Jun 30, 2025Updated 8 months ago
- Zero-shot Learning by Generating Task-specific Adapters☆14Apr 2, 2021Updated 4 years ago
- [NeurIPS 2024] The official repository of "Distribution-Aware Data Expansion with Diffusion Models".☆16Dec 15, 2025Updated 3 months ago
- Feature Fusion for Online Mutual Knowledge Distillation Code☆27Jul 21, 2020Updated 5 years ago
- Korean Text Data Generator for OCR tasks.☆10Aug 20, 2020Updated 5 years ago
- codebase release for EMNLP2023 paper publication☆19Sep 18, 2025Updated 6 months ago
- ☆16Apr 6, 2023Updated 2 years ago
- Structured Neuron Level Pruning to compress Transformer-based models [ECCV'24]☆17Aug 7, 2024Updated last year
- ☆17Dec 11, 2022Updated 3 years ago
- 2019~2021年间Zero-shot/Data-free知识蒸馏的论文合集☆11Sep 8, 2021Updated 4 years ago
- Official This-Is-My Dataset published in CVPR 2023☆16Jul 18, 2024Updated last year
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆33Mar 5, 2024Updated 2 years ago
- Official code for Cumulative Spatial Knowledge Distillation for Vision Transformers (ICCV-2023) https://openaccess.thecvf.com/content/ICC…☆15Nov 5, 2023Updated 2 years ago
- ☆17Oct 7, 2022Updated 3 years ago
- Object Detection and Localization for RoboCup SSL using Jetson Nano☆16Oct 31, 2023Updated 2 years ago
- ☆29Nov 29, 2023Updated 2 years ago
- [AAAI-25 Oral] Adaptive Calibration☆15Jul 6, 2025Updated 8 months ago
- Code for the paper: Prompts have evil twins (EMNLP 2024)☆23Feb 10, 2025Updated last year
- Code for Tangent Model Composition for Ensembling and Continual Fine-tuning (ICCV 2023) and Tangent Transformers for Composition, Privacy…☆13May 14, 2024Updated last year
- ☆15Jul 1, 2024Updated last year
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 8 months ago
- ☆10Mar 2, 2024Updated 2 years ago