☆14May 4, 2024Updated last year
Alternatives and similar repositories for Random-Masking-Finds-Winning-Tickets-for-Parameter-Efficient-Fine-tuning
Users that are interested in Random-Masking-Finds-Winning-Tickets-for-Parameter-Efficient-Fine-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated last year
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- [ICLR 2026] Official Implementation of ProxyThinker: Test-Time Guidance through Small Visual Reasoners.☆21Sep 24, 2025Updated 6 months ago
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆45Feb 13, 2024Updated 2 years ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆31Jun 7, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆52Jan 1, 2024Updated 2 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- The implementation for FREE-Merging: Fourier Transform for Model Merging with Lightweight Experts (ICCV25)☆14Jun 26, 2025Updated 9 months ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- This is the open-source code for TokenCarve.☆26Jan 23, 2026Updated 2 months ago
- Official repository for Activation-Informed Merging (AIM) of Large Language Models☆22Feb 10, 2025Updated last year
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- Efficient empirical NTKs in PyTorch☆22Jun 13, 2022Updated 3 years ago
- Pytorch implementation for DA-VPT (CVPR2025)☆17Dec 15, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- acnn for text-independent speaker recognition☆10Feb 8, 2022Updated 4 years ago
- A much powerful probing method to tune your model with promising performance and linear probing training cost!☆15Jul 26, 2023Updated 2 years ago
- ☆10Apr 16, 2024Updated 2 years ago
- ☆10Dec 22, 2023Updated 2 years ago
- ☆10Jul 6, 2021Updated 4 years ago
- Source code for "Taming GANs with Lookahead–Minmax", ICLR 2021.☆15Mar 28, 2021Updated 5 years ago
- My dot files and automatic environment deployment scripts☆14Updated this week
- ☆12Oct 24, 2023Updated 2 years ago
- [ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang☆14Jan 4, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Official code of MoSA (Mixture of Sparse Adapters).☆13Dec 14, 2023Updated 2 years ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- python programs and procedures that facilitate local application of the earth2observe global water resources reanalysis☆10Nov 21, 2017Updated 8 years ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆18Dec 17, 2025Updated 4 months ago
- JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"☆19Jun 10, 2023Updated 2 years ago
- ☆74Dec 7, 2024Updated last year
- The code for paper "ProQA: Structural Prompt-based Pre-training for Unified Question Answering"☆11Feb 7, 2023Updated 3 years ago
- Implementations of the algorithms described in the paper: On the Convergence Theory for Hessian-Free Bilevel Algorithms.☆11Nov 1, 2024Updated last year
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Optimization algorithm which fits a ResNet to CIFAR-10 5x faster than SGD / Adam (with terrible generalization)☆14Oct 20, 2023Updated 2 years ago
- CoT-Valve: Length-Compressible Chain-of-Thought Tuning☆92Feb 14, 2025Updated last year
- Cross Visual Prompt Tuning [ICCV 2025]☆13Aug 3, 2025Updated 8 months ago
- Code for Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities (NeurIPS'24)☆36Dec 17, 2024Updated last year
- Experiments with Super-Universal Newton method.☆13Aug 12, 2022Updated 3 years ago
- ☆20Oct 13, 2024Updated last year
- Randomized algorithm class at CU☆15Jul 8, 2025Updated 9 months ago