donaldafeith / Pytorch_MergeLinks

Merge LLM that are split in to parts

☆27

Alternatives and similar repositories for Pytorch_Merge

Users that are interested in Pytorch_Merge are comparing it to the libraries listed below

Sorting:

huu4ontocord / MDEL
Multi-Domain Expert Learning
☆67Updated last year
zarakiquemparte / zaraki-tools
☆27Updated 2 years ago
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated 2 years ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
rmihaylov / mpttune
Tune MPTs
☆84Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
☆74Updated 2 years ago
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆104Updated 6 months ago
thooton / muse
Let's create synthetic textbooks together :)
☆75Updated last year
euclaise / supertrainer2000
☆50Updated last year
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated 2 months ago
jondurbin / qlora
QLoRA: Efficient Finetuning of Quantized LLMs
☆79Updated last year
ConiferLabsWA / flan-ul2-alpaca
☆33Updated 2 years ago
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆70Updated 2 years ago
uukuguy / multi_loras
Load multiple LoRA modules simultaneously and automatically switch the appropriate combination of LoRA modules to generate the best answe…
☆157Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆35Updated last year
CERC-AAI / Robin
☆63Updated last year
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆116Updated 2 years ago
TehVenomm / LM_Transformers_BlockMerge
Image Diffusion block merging technique applied to transformers based Language Models.
☆56Updated 2 years ago
qwopqwop200 / gptqlora
GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ
☆102Updated 2 years ago
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated last year
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆179Updated last year
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated 2 years ago
CarperAI / treasure_trove
☆22Updated 2 years ago
AblateIt / finetune-study
Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.
☆83Updated 2 years ago
Zyphra / Zyda_processing
☆39Updated last year
arcee-ai / DAM
☆55Updated last year
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆170Updated last year
Digitous / ModelREVOLVER
Model REVOLVER, a human in the loop model mixing system.
☆33Updated 2 years ago
kyegomez / Finetuning-Suite
Finetune any model on HF in less than 30 seconds
☆56Updated last month
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆67Updated last year