GraphPKU / PiSSA
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)
☆345Updated 2 months ago
Alternatives and similar repositories for PiSSA:
Users that are interested in PiSSA are comparing it to the libraries listed below
- ☆191Updated 5 months ago
- ☆132Updated 8 months ago
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆319Updated last year
- ☆216Updated 9 months ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆406Updated last year
- ☆219Updated 10 months ago
- State-of-the-art Parameter-Efficient MoE Fine-tuning Method☆156Updated 7 months ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆136Updated 2 months ago
- ☆172Updated 9 months ago
- ☆255Updated last year
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆600Updated last year
- ☆185Updated 2 months ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆318Updated 11 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆431Updated 6 months ago
- [ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation☆763Updated 6 months ago
- ☆630Updated 2 weeks ago
- L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning☆190Updated last month
- Official PyTorch implementation of QA-LoRA☆131Updated last year
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆359Updated 3 months ago
- ☆99Updated 9 months ago
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆394Updated 11 months ago
- TokenSkip: Controllable Chain-of-Thought Compression in LLMs☆128Updated last month
- Awesome list for LLM pruning.☆222Updated 4 months ago
- Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"☆123Updated 11 months ago
- [ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Langua…☆407Updated 3 months ago
- An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT☆90Updated last month
- ☆178Updated this week
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆199Updated 11 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆186Updated 4 months ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆207Updated last month