Implementation of DoRA
☆307Jun 7, 2024Updated last year
Alternatives and similar repositories for dora
Users that are interested in dora are comparing it to the libraries listed below
Sorting:
- LoRA and DoRA from Scratch Implementations☆218Mar 5, 2024Updated 2 years ago
- Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"☆124Apr 28, 2024Updated last year
- ☆233Jun 24, 2024Updated last year
- ☆202Dec 5, 2024Updated last year
- [EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models☆85Mar 5, 2024Updated 2 years ago
- [ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation☆943Oct 1, 2024Updated last year
- Training LLMs with QLoRA + FSDP☆1,540Nov 9, 2024Updated last year
- ☆274Oct 31, 2023Updated 2 years ago
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,684Oct 28, 2024Updated last year
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆474Apr 21, 2024Updated last year
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,234May 8, 2024Updated last year
- Codebase for Merging Language Models (ICML 2024)☆864May 5, 2024Updated last year
- MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning☆361Aug 7, 2024Updated last year
- ☆235Jun 11, 2024Updated last year
- Cascade Speculative Drafting☆33Apr 2, 2024Updated last year
- A pytorch quantization backend for optimum☆1,032Nov 21, 2025Updated 4 months ago
- Implementation of Spectral State Space Models☆16Feb 23, 2024Updated 2 years ago
- ☆24Dec 18, 2023Updated 2 years ago
- This is a repository for "PMET: Precise Model Editing in a Transformer"☆55Sep 28, 2023Updated 2 years ago
- PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…☆1,449Updated this week
- Accessible large language models via k-bit quantization for PyTorch.☆8,052Updated this week
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,564Mar 5, 2026Updated 2 weeks ago
- Merge safetensor files using the technique described in "Language Models are Super Mario: Absorbing Abilities from Homologous Models as a…☆82Oct 17, 2024Updated last year
- PyTorch native post-training library☆5,707Updated this week
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated last year
- ☆177Jul 22, 2024Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated 2 years ago
- Token Omission Via Attention☆127Oct 13, 2024Updated last year
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,903Jan 21, 2024Updated 2 years ago
- Let's create synthetic textbooks together :)☆76Jan 29, 2024Updated 2 years ago
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- Schedule-Free Optimization in PyTorch☆2,265May 21, 2025Updated 10 months ago
- The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction☆390Jul 9, 2024Updated last year
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆372Jun 1, 2023Updated 2 years ago
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆411May 17, 2024Updated last year
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆462Apr 18, 2024Updated last year
- Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)☆2,697Aug 14, 2024Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- ☆18Mar 18, 2024Updated 2 years ago