rasbt / dora-from-scratch
LoRA and DoRA from Scratch Implementations
☆202Updated last year
Alternatives and similar repositories for dora-from-scratch
Users that are interested in dora-from-scratch are comparing it to the libraries listed below
Sorting:
- Implementation of DoRA☆294Updated 11 months ago
- ☆219Updated 10 months ago
- An extension of the nanoGPT repository for training small MOE models.☆142Updated 2 months ago
- Q-GaLore: Quantized GaLore with INT4 Projection and Layer-Adaptive Low-Rank Gradients.☆199Updated 9 months ago
- minimal GRPO implementation from scratch☆90Updated 2 months ago
- The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".☆169Updated last month
- Code for "LayerSkip: Enabling Early Exit Inference and Self-Speculative Decoding", ACL 2024☆294Updated last week
- ☆186Updated 3 months ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 6 months ago
- Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793☆412Updated this week
- ☆198Updated 5 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆256Updated 10 months ago
- Official PyTorch implementation of QA-LoRA☆135Updated last year
- A set of scripts and notebooks on LLM finetunning and dataset creation