Low-Rank adapter extraction for fine-tuned transformers models
☆181May 2, 2024Updated 2 years ago
Alternatives and similar repositories for LoRD
Users that are interested in LoRD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…☆159Feb 9, 2024Updated 2 years ago
- 5X faster 60% less memory QLoRA finetuning☆21May 28, 2024Updated last year
- This is our own implementation of 'Layer Selective Rank Reduction'☆240May 26, 2024Updated 2 years ago
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 3 years ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Jun 21, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31May 22, 2024Updated 2 years ago
- A bagel, with everything.☆326Apr 11, 2024Updated 2 years ago
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆234Oct 31, 2024Updated last year
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆92Feb 27, 2024Updated 2 years ago
- Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models☆267Apr 23, 2024Updated 2 years ago
- Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…☆276Jan 10, 2026Updated 4 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Mar 2, 2024Updated 2 years ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆151Sep 10, 2023Updated 2 years ago
- ☆345Mar 5, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- implementation of https://arxiv.org/pdf/2312.09299☆21Jul 3, 2024Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Mar 1, 2024Updated 2 years ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Apr 29, 2024Updated 2 years ago
- The Truth Is In There: Improving Reasoning in Language Models with Layer-Selective Rank Reduction☆392Jul 9, 2024Updated last year
- Tools for merging pretrained large language models.☆7,100May 6, 2026Updated 2 weeks ago
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,409Apr 11, 2024Updated 2 years ago
- 🚀 Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platform☆38Jan 29, 2024Updated 2 years ago
- ☆16Dec 16, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Easily view and modify JSON datasets for large language models☆88May 16, 2025Updated last year
- QLoRA with Enhanced Multi GPU Support☆38Aug 8, 2023Updated 2 years ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 8 months ago
- Official repository for ORPO☆482May 31, 2024Updated last year
- A simple experiment on letting two local LLM have a conversation about anything!☆113Jul 3, 2024Updated last year
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- Full finetuning of large language models without large memory requirements☆94Sep 22, 2025Updated 8 months ago
- ☆11Dec 11, 2024Updated last year
- ☆13Feb 18, 2024Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- Collection of autoregressive model implementation☆85Feb 23, 2026Updated 3 months ago
- A framework for evaluating function calls made by LLMs☆40Jul 23, 2024Updated last year
- Code accompanying the paper "A Language Model's Guide Through Latent Space". It contains functionality for training and using concept vec…☆21Feb 23, 2024Updated 2 years ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆257Oct 30, 2024Updated last year
- Let's create synthetic textbooks together :)☆76Jan 29, 2024Updated 2 years ago
- [WIP] Transformer to embed Danbooru labelsets☆13Mar 31, 2024Updated 2 years ago