TehVenomm/LM_Transformers_BlockMerge

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TehVenomm/LM_Transformers_BlockMerge)

TehVenomm / LM_Transformers_BlockMerge

Image Diffusion block merging technique applied to transformers based Language Models.

☆55

Alternatives and similar repositories for LM_Transformers_BlockMerge

Users that are interested in LM_Transformers_BlockMerge are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Digitous / ModelREVOLVER
View on GitHub
Model REVOLVER, a human in the loop model mixing system.
☆33Aug 2, 2023Updated 2 years ago
zarakiquemparte / zaraki-tools
View on GitHub
☆28Aug 30, 2023Updated 2 years ago
Gryphe / BlockMerge_Gradient
View on GitHub
Merge Transformers language models by use of gradient parameters.
☆215Aug 8, 2024Updated last year
practical-dreamer / vicuna_to_alpacan
View on GitHub
Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer
☆12Jun 21, 2023Updated 3 years ago
Digitous / LLM-SLERP-Merge
View on GitHub
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆153Sep 10, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
paniphons / open-textbot-datasets
View on GitHub
Collection of various text datasets to assist ML researchers in training or fine-tuning their models
☆21Apr 1, 2023Updated 3 years ago
jb-01 / LoRA-TLE
View on GitHub
Token-level adaptation of LoRA matrices for downstream task generalization.
☆15Apr 14, 2024Updated 2 years ago
aiczk / sound-helper
View on GitHub
Python scripts for AI voice changers
☆14Apr 25, 2023Updated 3 years ago
AblateIt / finetune-study
View on GitHub
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆82Sep 10, 2023Updated 2 years ago
kaiokendev / cutoff-len-is-context-len
View on GitHub
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Jun 21, 2023Updated 3 years ago
CoffeeVampir3 / ez-trainer
View on GitHub
Train Llama Loras Easily
☆30Aug 3, 2023Updated 2 years ago
huu4ontocord / MDEL
View on GitHub
Multi-Domain Expert Learning
☆66Jan 23, 2024Updated 2 years ago
bloomberg / MixCE-acl2023
View on GitHub
Implementation of MixCE method described in ACL 2023 paper by Zhang et al.
☆20May 29, 2023Updated 3 years ago
argilla-io / distilabel-spin-dibt
View on GitHub
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Mar 12, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
vaguenebula / AlpacaDataReflect
View on GitHub
An experiment to see if chatgpt can improve the output of the stanford alpaca dataset
☆12Mar 29, 2023Updated 3 years ago
emrgnt-cmplxty / SmolTrainer
View on GitHub
☆21Oct 6, 2023Updated 2 years ago
Gryphe / MergeMonster
View on GitHub
An unsupervised model merging algorithm for Transformers-based language models.
☆107Apr 29, 2024Updated 2 years ago
AdjointOperator / Augmented-DDTagger
View on GitHub
Multi-backend (WD taggers, deepdanbooru) fast automatic tagging utility
☆26Feb 3, 2023Updated 3 years ago
Aemon-Algiz / LoRAExamples
View on GitHub
Small repository for my video on LoRA
☆16May 14, 2023Updated 3 years ago
CarperAI / squeakily
View on GitHub
A library for squeakily cleaning and filtering language datasets.
☆50Jul 10, 2023Updated 3 years ago
OpenAccess-AI-Collective / ggml-webui
View on GitHub
Deploy your GGML models to HuggingFace Spaces with Docker and gradio
☆37Jun 6, 2023Updated 3 years ago
PygmalionAI / logbooks
View on GitHub
Where we keep our notes about model training runs.
☆16Mar 12, 2023Updated 3 years ago
iwalton3 / mpt-lora-patch
View on GitHub
Patch for MPT-7B which allows using and training a LoRA
☆57May 20, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HypherX / Evolution-Analysis
View on GitHub
☆25Dec 13, 2024Updated last year
official-elinas / zeus-llm-trainer
View on GitHub
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Aug 27, 2023Updated 2 years ago
rosmineb / unit_test_rl
View on GitHub
Project code for training LLMs to write better unit tests + code
☆22May 19, 2025Updated last year
reactorsh / ambrosia
View on GitHub
clean up your LLM datasets
☆113May 30, 2023Updated 3 years ago
prateeky2806 / ties-merging
View on GitHub
☆216Feb 3, 2024Updated 2 years ago
AlpinDale / RPTQ-for-LLaMA
View on GitHub
Efficient 3bit/4bit quantization of LLaMA models
☆18May 18, 2023Updated 3 years ago
r-three / mats
View on GitHub
☆33Jul 8, 2024Updated 2 years ago
AlpinDale / sparsegpt-for-LLaMA
View on GitHub
Code for the paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot" with LLaMA implementation.
☆71Mar 30, 2023Updated 3 years ago
recoilme / losslessmix
View on GitHub
Mixing models of stable diffusion without weights loss
☆68Apr 19, 2023Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
crosstyan / generate-forever
View on GitHub
a userscript to generate forever for a novel site
☆12Jun 30, 2025Updated last year
lablab-ai / webgpu-llm-chrome-extension-starter
View on GitHub
WebLLM Chrome Extension Starter Pack.
☆12Aug 10, 2023Updated 2 years ago
dashends / CodeSyntax
View on GitHub
Code and dataset for EMNLP 2022 Findings paper "Benchmarking Language Models for Code Syntax Understanding"
☆16Oct 24, 2022Updated 3 years ago
zhangir-azerbayev / proof-pile
View on GitHub
Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.
☆22Nov 26, 2022Updated 3 years ago
kwea123 / awesome-NeRF
View on GitHub
A curated list of awesome neural radiance fields papers
☆13Mar 11, 2021Updated 5 years ago
euclaise / SlimTrainer
View on GitHub
Full finetuning of large language models without large memory requirements
☆92Sep 22, 2025Updated 10 months ago
andersonbcdefg / dpo-lora
View on GitHub
direct preference optimization with only 1 model copy :)
☆14Oct 2, 2023Updated 2 years ago