Low-Rank adapter extraction for fine-tuned transformers models
☆181May 2, 2024Updated last year
Alternatives and similar repositories for LoRD
Users that are interested in LoRD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆159Feb 9, 2024Updated 2 years ago
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated last year
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 2 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated last year
- A bagel, with everything.☆326Apr 11, 2024Updated last year
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆233Oct 31, 2024Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆91Feb 27, 2024Updated 2 years ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆263Apr 23, 2024Updated last year
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆271Jan 10, 2026Updated 2 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated 2 years ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆147Sep 10, 2023Updated 2 years ago
- ☆338Mar 5, 2026Updated 3 weeks ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Mar 1, 2024Updated 2 years ago
- The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction☆390Jul 9, 2024Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Apr 29, 2024Updated last year
- Tools for merging pretrained large language models.☆6,895Mar 15, 2026Updated last week
- ☆17Dec 16, 2024Updated last year
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Easily view and modify JSON datasets for large language models☆87May 16, 2025Updated 10 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 6 months ago
- Official repository for ORPO☆473May 31, 2024Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆112Jul 3, 2024Updated last year
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- ☆11Dec 11, 2024Updated last year
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 6 months ago
- ☆13Feb 18, 2024Updated 2 years ago
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated last month
- A framework for evaluating function calls made by LLMs☆40Jul 23, 2024Updated last year
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆21Feb 23, 2024Updated 2 years ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆253Oct 30, 2024Updated last year
- Let's create synthetic textbooks together :)☆76Jan 29, 2024Updated 2 years ago
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated last year
- Create Custom LLMs☆1,820Nov 8, 2025Updated 4 months ago