GraphPKU / PiSSA
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)
☆324Updated 2 weeks ago
Alternatives and similar repositories for PiSSA:
Users that are interested in PiSSA are comparing it to the libraries listed below
- ☆177Updated 4 months ago
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆291Updated last year
- [ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation☆717Updated 4 months ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆284Updated 9 months ago
- ☆123Updated 6 months ago
- ☆217Updated 8 months ago
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆128Updated 2 weeks ago
- ☆212Updated 7 months ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆398Updated 10 months ago
- Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"☆123Updated 9 months ago
- ☆163Updated 7 months ago
- Implementation of DoRA☆290Updated 8 months ago
- [ICML 2024] LESS: Selecting Influential Data for Targeted Instruction Tuning☆412Updated 4 months ago
- Rectified Rotary Position Embeddings☆351Updated 9 months ago
- [ICLR 2024] Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning☆585Updated 11 months ago
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆102Updated 3 months ago
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆348Updated last month
- ☆251Updated last year
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆186Updated 5 months ago
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆389Updated 9 months ago
- [NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward☆821Updated this week
- MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning☆351Updated 6 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆145Updated 6 months ago
- A series of technical report on Slow Thinking with LLM☆411Updated last week
- [ECCV 2024 Oral] Code for paper: An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Langua…☆364Updated last month
- Codebase for Merging Language Models (ICML 2024)☆795Updated 9 months ago
- ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)☆575Updated last month
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆534Updated 2 months ago
- Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718☆307Updated 4 months ago