Outsider565 / LoRA-GA
☆171Updated 3 months ago
Alternatives and similar repositories for LoRA-GA:
Users that are interested in LoRA-GA are comparing it to the libraries listed below
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆120Updated 3 weeks ago
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆96Updated 3 months ago
- ☆162Updated 6 months ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆273Updated 9 months ago
- ☆122Updated 6 months ago
- PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)☆312Updated last week
- ☆92Updated 6 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆143Updated 6 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆107Updated 2 months ago
- Rectified Rotary Position Embeddings☆348Updated 8 months ago
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆288Updated last year
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆154Updated 7 months ago
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆157Updated last month
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆335Updated last week
- A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to…☆53Updated last year
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆238Updated last month
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆120Updated 2 weeks ago
- Low-bit optimizers for PyTorch☆125Updated last year
- ☆186Updated last year
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆76Updated 10 months ago
- ☆137Updated 4 months ago
- A series of technical report on Slow Thinking with LLM☆359Updated this week
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆138Updated 4 months ago
- adds Sequence Parallelism into LLaMA-Factory☆146Updated 2 weeks ago
- ☆94Updated 4 months ago
- Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models☆239Updated 4 months ago
- Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)☆173Updated 4 months ago
- ☆250Updated last year
- Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"☆123Updated 9 months ago
- ☆95Updated 9 months ago