[ACL 2024 Findings] Light-PEFT: Lightening Parameter-Efficient Fine-Tuning via Early Pruning
☆13Sep 2, 2024Updated last year
Alternatives and similar repositories for Light-PEFT
Users that are interested in Light-PEFT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Jan 3, 2025Updated last year
- Structured Pruning Adapters in PyTorch☆19Aug 30, 2023Updated 2 years ago
- [ICLR'25] Code for KaSA, an official implementation of "KaSA: Knowledge-Aware Singular-Value Adaptation of Large Language Models"☆20Jan 16, 2025Updated last year
- Efficient Scaling laws and collaborative pretraining.☆21Sep 18, 2025Updated 6 months ago
- [ACL 2025] Official code for ''Learning to Reason from Feedback at Test-Time''.☆13May 16, 2025Updated 10 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- This is the repository of DEER, a Dynamic Early Exit in Reasoning method for Large Reasoning Language Models.☆181Jul 7, 2025Updated 8 months ago
- Residual vector quantization for KV cache compression in large language model☆12Oct 22, 2024Updated last year
- DCPO: Dynamic Adaptive Clipping for RL☆48Dec 20, 2025Updated 3 months ago
- Github repo for ICLR-2025 paper, Fine-tuning Large Language Models with Sparse Matrices☆25Feb 2, 2026Updated last month
- ☆21Feb 5, 2024Updated 2 years ago
- Code for our EMNLP-2022 paper: "Language Prior Is Not the Only Shortcut: A Benchmark for Shortcut Learning in VQA"☆40Nov 1, 2022Updated 3 years ago
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- Official repository for FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models☆34Sep 19, 2025Updated 6 months ago
- [ICML 2025] Logits are All We Need to Adapt Closed Models☆21May 2, 2025Updated 10 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Official implementation of "TailorKV: A Hybrid Framework for Long-Context Inference via Tailored KV Cache Optimization" (Findings of ACL …☆21Jul 25, 2025Updated 8 months ago
- 包含了LLM的一些手撕代码,如强化学习。可以帮助从代码层面深入理解原理,以及有助于准备大模型面试可能出现的手撕。后续会更新Transformer等更多手撕☆79Mar 15, 2026Updated last week
- This repo contains the code for Late Prompt Tuning.☆12Dec 22, 2025Updated 3 months ago
- ☆10Feb 6, 2025Updated last year
- This repository presents the source code for the paper "MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Qu…☆23Apr 2, 2025Updated 11 months ago
- ☆14Oct 7, 2023Updated 2 years ago
- Model for processing text sequences with coreference annotations☆14Nov 29, 2018Updated 7 years ago
- ☆14Apr 18, 2020Updated 5 years ago
- ☆126Jul 6, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Finetuning LLaMA with DeepSpeed☆10Apr 14, 2023Updated 2 years ago
- Code for the paper "Knowledge-Aware Federated Active Learning with Non-IID Data", ICCV2023☆10Sep 8, 2023Updated 2 years ago
- ☆15Apr 30, 2022Updated 3 years ago
- ☆15Jan 27, 2026Updated last month
- Repository for the paper "Exploring Image Augmentations for Siamese Representation Learning with Chest X-Rays"☆11Aug 29, 2023Updated 2 years ago
- [ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"☆102Apr 10, 2024Updated last year
- Better coding experience for Flask☆16Oct 21, 2025Updated 5 months ago
- ☆65Jul 14, 2025Updated 8 months ago
- Learning adapter weights from task descriptions☆19Nov 12, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [CVPR 2025] LION-FS: Fast & Slow Video-Language Thinker as Online Video Assistant☆27Dec 2, 2025Updated 3 months ago
- An Efficient Supply Chain Management System using Blockchain & Machine Learning.☆10Nov 27, 2019Updated 6 years ago
- ☆31Jun 6, 2025Updated 9 months ago
- Rookie's guide☆12Aug 10, 2024Updated last year
- ☆56Jul 7, 2025Updated 8 months ago
- Official Implementation of SAM-Decoding: Speculative Decoding via Suffix Automaton☆45Feb 13, 2025Updated last year
- MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models.☆17Jul 20, 2023Updated 2 years ago