Implementation of DoRA
☆310Jun 7, 2024Updated 2 years ago
Alternatives and similar repositories for dora
Users that are interested in dora are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LoRA and DoRA from Scratch Implementations☆222Mar 5, 2024Updated 2 years ago
- Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"☆122Apr 28, 2024Updated 2 years ago
- ☆233Jun 24, 2024Updated last year
- ☆206Dec 5, 2024Updated last year
- [EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models☆86Mar 5, 2024Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [ICML2024 (Oral)] Official PyTorch implementation of DoRA: Weight-Decomposed Low-Rank Adaptation☆975Mar 24, 2026Updated 2 months ago
- Training LLMs with QLoRA + FSDP☆1,548Nov 9, 2024Updated last year
- ☆276Oct 31, 2023Updated 2 years ago
- GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection☆1,695Oct 28, 2024Updated last year
- Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates☆473Apr 21, 2024Updated 2 years ago
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,241May 8, 2024Updated 2 years ago
- Codebase for Merging Language Models (ICML 2024)☆868May 5, 2024Updated 2 years ago
- MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning☆362Aug 7, 2024Updated last year
- ☆235Jun 11, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Cascade Speculative Drafting☆33Apr 2, 2024Updated 2 years ago
- A pytorch quantization backend for optimum☆1,042Jun 4, 2026Updated last week
- Implementation of Spectral State Space Models☆16Feb 23, 2024Updated 2 years ago
- ☆23Dec 18, 2023Updated 2 years ago
- This is a repository for "PMET: Precise Model Editing in a Transformer"☆57Sep 28, 2023Updated 2 years ago
- PyTorch compiler that accelerates training and inference. Get built-in optimizations for performance, memory, parallelism, and easily wri…☆1,460Jun 1, 2026Updated last week
- Stanford NLP Python library for Representation Finetuning (ReFT)☆1,569Mar 5, 2026Updated 3 months ago
- Accessible large language models via k-bit quantization for PyTorch.☆8,258Updated this week
- Low-Rank adapter extraction for fine-tuned transformers models☆181May 2, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PyTorch native post-training library☆5,768Updated this week
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,409Apr 11, 2024Updated 2 years ago
- ☆179Jul 22, 2024Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated 2 years ago
- Token Omission Via Attention☆129Oct 13, 2024Updated last year
- S-LoRA: Serving Thousands of Concurrent LoRA Adapters☆1,914Jan 21, 2024Updated 2 years ago
- Let's create synthetic textbooks together :)☆76Jan 29, 2024Updated 2 years ago
- Utilities for PyTorch distributed☆25Feb 27, 2025Updated last year
- Schedule-Free Optimization in PyTorch☆2,299May 18, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction☆393Jul 9, 2024Updated last year
- AdaLoRA: Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning (ICLR 2023).☆388Jun 1, 2023Updated 3 years ago
- Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch☆547May 16, 2025Updated last year
- Official repository of NEFTune: Noisy Embeddings Improves Instruction Finetuning☆411May 17, 2024Updated 2 years ago
- Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.☆468Apr 18, 2024Updated 2 years ago
- Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)☆2,692Aug 14, 2024Updated last year
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated 2 years ago