Outsider565 / LoRA-GA
☆179Updated 4 months ago
Alternatives and similar repositories for LoRA-GA:
Users that are interested in LoRA-GA are comparing it to the libraries listed below
- A generalized framework for subspace tuning methods in parameter efficient fine-tuning.☆128Updated 3 weeks ago
- Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "☆102Updated 4 months ago
- ☆165Updated 7 months ago
- PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)☆326Updated last month
- ☆124Updated 7 months ago
- ☆95Updated 7 months ago
- LoRAMoE: Revolutionizing Mixture of Experts for Maintaining World Knowledge in Language Model Alignment☆290Updated 10 months ago
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆295Updated last year
- [NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning☆165Updated 3 months ago
- [SIGIR'24] The official implementation code of MOELoRA.☆147Updated 7 months ago
- ☆142Updated 5 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆114Updated 4 months ago
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆153Updated 8 months ago
- ☆71Updated this week
- Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"☆124Updated 10 months ago
- Implementation for "Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs"☆350Updated last month
- [ICML'24] The official implementation of “Rethinking Optimization and Architecture for Tiny Language Models”☆120Updated last month
- [ICLR 2024] CLEX: Continuous Length Extrapolation for Large Language Models☆76Updated 11 months ago
- Rectified Rotary Position Embeddings☆353Updated 9 months ago
- [EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs☆241Updated 2 months ago
- [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning☆143Updated 5 months ago
- A repository sharing the literatures about long-context large language models, including the methodologies and the evaluation benchmarks☆256Updated 7 months ago
- A simple toolkit for benchmarking LLMs on mathematical reasoning tasks. 🧮✨☆176Updated 10 months ago
- [EMNLP 2024 Findings🔥] Official implementation of "LOOK-M: Look-Once Optimization in KV Cache for Efficient Multimodal Long-Context Infe…☆91Updated 3 months ago
- A prototype repo for hybrid training of pipeline parallel and distributed data parallel with comments on core code snippets. Feel free to…☆55Updated last year
- [ICLR 2024 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"☆73Updated 8 months ago