sabetAI/BLoRA

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sabetAI/BLoRA)

sabetAI / BLoRA

batched loras

☆350

Alternatives and similar repositories for BLoRA

Users that are interested in BLoRA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

S-LoRA / S-LoRA
View on GitHub
S-LoRA: Serving Thousands of Concurrent LoRA Adapters
☆1,920Jan 21, 2024Updated 2 years ago
punica-ai / punica
View on GitHub
Serving multiple LoRA finetuned LLM as one
☆1,167May 8, 2024Updated 2 years ago
TUDB-Labs / mLoRA
View on GitHub
An Efficient "Factory" to Build Multiple LoRA Adapters
☆383Feb 13, 2025Updated last year
VikParuchuri / textbook_quality
View on GitHub
Generate textbook-quality synthetic LLM pretraining data
☆508Oct 19, 2023Updated 2 years ago
SkunkworksAI / hydra-moe
View on GitHub
☆416Nov 2, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sail-sg / lorahub
View on GitHub
[COLM 2024] LoraHub: Efficient Cross-Task Generalization via Dynamic LoRA Composition
☆671Jul 22, 2024Updated 2 years ago
FasterDecoding / Medusa
View on GitHub
Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
☆2,758Jun 25, 2024Updated 2 years ago
jquesnelle / yarn
View on GitHub
YaRN: Efficient Context Window Extension of Large Language Models
☆1,740Apr 17, 2024Updated 2 years ago
IST-DASLab / qmoe
View on GitHub
Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".
☆278Nov 3, 2023Updated 2 years ago
turboderp / exllama
View on GitHub
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆2,934Sep 30, 2023Updated 2 years ago
imoneoi / multipack
View on GitHub
Multipack distributed sampler for fast padding-free training of LLMs
☆207Aug 10, 2024Updated last year
explosion / curated-transformers
View on GitHub
🤖 A PyTorch library of curated Transformer models and their composable components
☆892Apr 17, 2024Updated 2 years ago
abacaj / train-with-fsdp
View on GitHub
☆93Oct 5, 2023Updated 2 years ago
teknium1 / transformers-gptq-quant
View on GitHub
☆46Oct 13, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
hao-ai-lab / LookaheadDecoding
View on GitHub
[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding
☆1,340Mar 6, 2025Updated last year
euclaise / supertrainer2000
View on GitHub
☆50Mar 14, 2024Updated 2 years ago
Cohere-Labs-Community / parameter-efficient-moe
View on GitHub
☆278Oct 31, 2023Updated 2 years ago
abacusai / Long-Context
View on GitHub
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…
☆603Nov 17, 2023Updated 2 years ago
muellerzr / minimal-trainer-zoo
View on GitHub
Minimal example scripts of the Hugging Face Trainer, focused on staying under 150 lines
☆197May 6, 2024Updated 2 years ago
dwzhu-pku / PoSE
View on GitHub
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆208May 20, 2024Updated 2 years ago
AblateIt / finetune-study
View on GitHub
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Sep 10, 2023Updated 2 years ago
IBM / ModuleFormer
View on GitHub
ModuleFormer is a MoE-based architecture that includes two different types of experts: stick-breaking attention heads and feedforward exp…
☆225Sep 18, 2025Updated 10 months ago
jondurbin / airoboros
View on GitHub
Customizable implementation of the self-instruct paper.
☆1,051Mar 7, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
teknium1 / LLM-Benchmark-Logs
View on GitHub
Just a bunch of benchmark logs for different LLMs
☆130Jul 28, 2024Updated last year
yacineMTB / talk
View on GitHub
Let's make sand talk
☆590Oct 17, 2023Updated 2 years ago
chaitanyamalaviya / ExpertQA
View on GitHub
[Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers
☆139Mar 14, 2024Updated 2 years ago
Guitaricet / relora
View on GitHub
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
☆474Apr 21, 2024Updated 2 years ago
predibase / lorax
View on GitHub
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
☆3,820May 28, 2026Updated last month
abacaj / fine-tune-mistral
View on GitHub
Fine-tune mistral-7B on 3090s, a100s, h100s
☆735Oct 11, 2023Updated 2 years ago
Preemo-Inc / text-generation-inference
View on GitHub
☆199Feb 9, 2024Updated 2 years ago
mit-han-lab / streaming-llm
View on GitHub
[ICLR 2024] Efficient Streaming Language Models with Attention Sinks
☆7,249Jul 11, 2024Updated 2 years ago
huggingface / text-generation-inference
View on GitHub
Large Language Model Text Generation Inference
☆10,882Mar 21, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
salesforce / xgen
View on GitHub
Salesforce open-source LLMs with 8k sequence length.
☆727Jun 2, 2026Updated last month
alexrs / herd
View on GitHub
Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.
☆11Feb 11, 2024Updated 2 years ago
bitsandbytes-foundation / bitsandbytes
View on GitHub
Accessible large language models via k-bit quantization for PyTorch.
☆8,338Updated this week
OpenLemur / Lemur
View on GitHub
[ICLR 2024] Lemur: Open Foundation Models for Language Agents
☆557Oct 28, 2023Updated 2 years ago
microsoft / Llama-2-Onnx
View on GitHub
☆1,026Jan 4, 2024Updated 2 years ago
teknium1 / GPTeacher
View on GitHub
A collection of modular datasets generated by GPT-4, General-Instruct - Roleplay-Instruct - Code-Instruct - and Toolformer
☆1,668Sep 15, 2023Updated 2 years ago
CarperAI / treasure_trove
View on GitHub
☆21Aug 27, 2023Updated 2 years ago