Gryphe/BlockMerge_Gradient

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Gryphe/BlockMerge_Gradient)

Gryphe / BlockMerge_Gradient

Merge Transformers language models by use of gradient parameters.

☆214

Alternatives and similar repositories for BlockMerge_Gradient

Users that are interested in BlockMerge_Gradient are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

TehVenomm / LM_Transformers_BlockMerge
View on GitHub
Image Diffusion block merging technique applied to transformers based Language Models.
☆55May 8, 2023Updated 3 years ago
Digitous / ModelREVOLVER
View on GitHub
Model REVOLVER, a human in the loop model mixing system.
☆33Aug 2, 2023Updated 2 years ago
zarakiquemparte / zaraki-tools
View on GitHub
☆28Aug 30, 2023Updated 2 years ago
Gryphe / MergeMonster
View on GitHub
An unsupervised model merging algorithm for Transformers-based language models.
☆107Apr 29, 2024Updated 2 years ago
CoffeeVampir3 / ez-trainer
View on GitHub
Train Llama Loras Easily
☆30Aug 3, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
serp-ai / Parameter-Efficient-MoE
View on GitHub
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31May 22, 2024Updated 2 years ago
jondurbin / airoboros
View on GitHub
Customizable implementation of the self-instruct paper.
☆1,051Mar 7, 2024Updated 2 years ago
arcee-ai / mergekit
View on GitHub
Tools for merging pretrained large language models.
☆7,246Jun 17, 2026Updated last month
CarperAI / squeakily
View on GitHub
A library for squeakily cleaning and filtering language datasets.
☆50Jul 10, 2023Updated 3 years ago
Digitous / LLM-SLERP-Merge
View on GitHub
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆153Sep 10, 2023Updated 2 years ago
QuixiAI / laserRMT
View on GitHub
This is our own implementation of 'Layer Selective Rank Reduction'
☆240May 26, 2024Updated 2 years ago
dibrale / webui_autonomics
View on GitHub
Dynamic parameter modulation for oobabooga's text-generation-webui that adjusts generation parameters to better mirror user affect.
☆37Jul 28, 2023Updated 2 years ago
the-crypt-keeper / the-muse
View on GitHub
Experimental sampler to make LLMs more creative
☆31Aug 2, 2023Updated 2 years ago
yule-BUAA / MergeLM
View on GitHub
Codebase for Merging Language Models (ICML 2024)
☆869May 5, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
PygmalionAI / logbooks
View on GitHub
Where we keep our notes about model training runs.
☆16Mar 12, 2023Updated 3 years ago
turboderp-org / exllamav2
View on GitHub
A fast inference library for running LLMs locally on modern consumer-class GPUs
☆4,586Mar 4, 2026Updated 4 months ago
jb-01 / LoRA-TLE
View on GitHub
Token-level adaptation of LoRA matrices for downstream task generalization.
☆15Apr 14, 2024Updated 2 years ago
YanniKouloumbis / next-js-window-ai
View on GitHub
A Next.js chatbot app demonstrating seamless integration with window.ai.
☆15Jun 25, 2023Updated 3 years ago
turboderp-org / exui
View on GitHub
Web UI for ExLlamaV2
☆512Feb 5, 2025Updated last year
thomasgauthier / LoRD
View on GitHub
Low-Rank adapter extraction for fine-tuned transformers models
☆181May 2, 2024Updated 2 years ago
axolotl-ai-cloud / axolotl
View on GitHub
Go ahead and axolotl questions
☆12,219Updated this week
turboderp / exllama
View on GitHub
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆2,931Sep 30, 2023Updated 2 years ago
AblateIt / finetune-study
View on GitHub
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Sep 10, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
FartyPants / the_muse
View on GitHub
oobabooga extension - Experimental sampler to make LLMs more creative
☆23Aug 2, 2023Updated 2 years ago
Aratako / Task-Vector-Merge-Optimzier
View on GitHub
☆16Apr 11, 2024Updated 2 years ago
kaiokendev / cutoff-len-is-context-len
View on GitHub
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Jun 21, 2023Updated 3 years ago
p-e-w / chatbot_clinic
View on GitHub
Science-driven chatbot development
☆66May 5, 2024Updated 2 years ago
dphnAI / sonar
View on GitHub
Large-scale LLM inference engine
☆1,803Updated this week
IST-DASLab / qmoe
View on GitHub
Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".
☆278Nov 3, 2023Updated 2 years ago
jondurbin / bagel
View on GitHub
A bagel, with everything.
☆326Apr 11, 2024Updated 2 years ago
ThereforeGames / echoproof
View on GitHub
Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …
☆32Nov 20, 2023Updated 2 years ago
teknium1 / ShareGPT-Builder
View on GitHub
☆126Dec 18, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jquesnelle / literAI
View on GitHub
Generate visual podcasts about novels using open source models
☆33Feb 15, 2023Updated 3 years ago
brucepro / Memoir
View on GitHub
Memoir+ a persona memory extension for Text Gen Web UI.
☆226Feb 5, 2026Updated 5 months ago
arcee-ai / PruneMe
View on GitHub
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
☆266Apr 23, 2024Updated 2 years ago
fishiatee / Tumera
View on GitHub
Yet another frontend for LLM, written using .NET and WinUI 3
☆11Sep 14, 2025Updated 10 months ago
prateeky2806 / ties-merging
View on GitHub
☆215Feb 3, 2024Updated 2 years ago
AutoGPTQ / AutoGPTQ
View on GitHub
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
☆5,073Apr 11, 2025Updated last year
hadasah / btm
View on GitHub
☆79Apr 29, 2024Updated 2 years ago