mit-han-lab / offsite-tuningView external linksLinks
Offsite-Tuning: Transfer Learning without Full Model
☆385Nov 27, 2023Updated 2 years ago
Alternatives and similar repositories for offsite-tuning
Users that are interested in offsite-tuning are comparing it to the libraries listed below
Sorting:
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆1,607Jul 12, 2024Updated last year
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆91Mar 16, 2023Updated 2 years ago
- A method to increase the speed and lower the memory footprint of existing vision transformers.☆1,170Jun 17, 2024Updated last year
- [CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language☆1,342Oct 5, 2023Updated 2 years ago
- Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning" (ICLR 2022)☆543Mar 24, 2022Updated 3 years ago
- A Unified Library for Parameter-Efficient and Modular Transfer Learning☆2,802Oct 12, 2025Updated 4 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆40Nov 11, 2024Updated last year
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- UniTAB: Unifying Text and Box Outputs for Grounded VL Modeling, ECCV 2022 (Oral Presentation)☆89Jun 12, 2023Updated 2 years ago
- ☆646Aug 4, 2023Updated 2 years ago
- ICML'2022: Black-Box Tuning for Language-Model-as-a-Service & EMNLP'2022: BBTv2: Towards a Gradient-Free Future with Large Language Model…☆271Nov 8, 2022Updated 3 years ago
- [ICLR 2024] Fine-tuning LLaMA to follow Instructions within 1 Hour and 1.2M Parameters☆5,936Mar 14, 2024Updated last year
- [IJCV] FastComposer: Tuning-Free Multi-Subject Image Generation with Localized Attention☆714Jan 10, 2025Updated last year
- [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333☆1,143Jan 11, 2024Updated 2 years ago
- ☆10Aug 19, 2023Updated 2 years ago
- SILO Language Models code repository☆83Feb 23, 2024Updated last year
- Serving multiple LoRA finetuned LLM as one☆1,139May 8, 2024Updated last year
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆409May 17, 2024Updated last year
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆73Jul 7, 2022Updated 3 years ago
- [ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆82Feb 7, 2026Updated last week
- code release of research paper "Exploring Long-Sequence Masked Autoencoders"☆100Oct 14, 2022Updated 3 years ago
- ☆65Jun 2, 2023Updated 2 years ago
- [ICLR 2024] Efficient Streaming Language Models with Attention Sinks☆7,188Jul 11, 2024Updated last year
- Paper List for In-context Learning 🌷☆20Jan 3, 2023Updated 3 years ago
- General technology for enabling AI capabilities w/ LLMs and MLLMs☆4,284Dec 22, 2025Updated last month
- PyTorch codes for "LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning"☆242Jan 20, 2023Updated 3 years ago
- OpenICL is an open-source framework to facilitate research, development, and prototyping of in-context learning.☆584Oct 3, 2023Updated 2 years ago
- Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".☆871Aug 20, 2024Updated last year
- Grounded Language-Image Pre-training☆2,573Jan 24, 2024Updated 2 years ago
- A simple and effective LLM pruning approach.☆848Aug 9, 2024Updated last year
- Enhancing Efficiency in Multidevice Federated Learning through Data Selection☆13Apr 15, 2024Updated last year
- [NeurIPS 2022, T-PAMI 2023] Efficient Spatially Sparse Inference for Conditional GANs and Diffusion Models☆268Mar 18, 2024Updated last year
- [MLSys 2024 Best Paper Award] AWQ: Activation-aware Weight Quantization for LLM Compression and Acceleration☆3,436Jul 17, 2025Updated 6 months ago
- Code and models for the paper "One Transformer Fits All Distributions in Multi-Modal Diffusion"☆1,473May 31, 2023Updated 2 years ago
- [ICML 2023] Exploring the Benefits of Training Expert Language Models over Instruction Tuning☆98Apr 26, 2023Updated 2 years ago
- Foundation Architecture for (M)LLMs☆3,130Apr 11, 2024Updated last year
- ☆47Apr 29, 2024Updated last year
- PyTorch code for "VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks" (CVPR2022)☆209Dec 18, 2022Updated 3 years ago
- 🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.☆20,619Updated this week