rasbt / gradient-accumulation-blogLinks

Finetuning BLOOM on a single GPU using gradient-accumulation

☆31

Alternatives and similar repositories for gradient-accumulation-blog

Users that are interested in gradient-accumulation-blog are comparing it to the libraries listed below

Sorting:

kyegomez / Finetuning-Suite
Finetune any model on HF in less than 30 seconds
☆57Updated 3 months ago
deep-diver / LLM-Pref-Mark-UI
☆37Updated 2 years ago
geronimi73 / 3090_shorts
minimal scripts for 24GB VRAM GPUs. training, inference, whatever
☆41Updated last month
Upaya07 / NeurIPS-llm-efficiency-challenge
Code for NeurIPS LLM Efficiency Challenge
☆59Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
v-prgmr / mergekit
Tools for merging pretrained large language models.
☆19Updated last year
dmarx / zero-shot-intent-classifier
Minimal zero-shot intent classifier for arbitrary intent slot filling, via LLM prompting w LangChain.
☆33Updated 2 years ago
leehanchung / lora-instruct
Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA
☆104Updated last month
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
pacman100 / peft-codegen-25
☆23Updated 2 years ago
austinsilveria / tricksy
Fast approximate inference on a single GPU with sparsity aware offloading
☆38Updated last year
philschmid / optimum-static-quantization
☆28Updated 2 years ago
lamini-ai / lamini-earnings-calls
☆41Updated last year
camenduru / nvidia-llm-colab
☆14Updated last year
kyegomez / Kosmos-X
The Next Generation Multi-Modality Superintelligence
☆70Updated 10 months ago
tanyuqian / cappy
NeurIPS 2023 - Cappy: Outperforming and Boosting Large Multi-Task LMs with a Small Scorer
☆43Updated last year
lilakk / BLEUBERI
Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"
☆25Updated last month
Percent-BFD / neurips_submission
☆16Updated last year
edumunozsala / llama-2-7B-4bit-python-coder
Fine-tune and quantize Llama-2-like models to generate Python code using QLoRA, Axolot,..
☆64Updated last year
johnrobinsn / redpajama
Training and Inference Notebooks for the RedPajama (OpenLlama) models
☆18Updated 2 years ago
google-research-datasets / QAmeleon
QAmeleon introduces synthetic multilingual QA data using PaLM, a 540B large language model. This dataset was generated by prompt tuning P…
☆34Updated last year
boschresearch / switchprompt
Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domain…
☆52Updated 2 years ago
ibm-granite / granite-embedding-models
☆29Updated 3 weeks ago
mzbac / qlora-inference-multi-gpu
☆12Updated 2 years ago
padas-lab-de / ir-rag-sigir24-persona-rag
☆47Updated 9 months ago
haotian-liu / transformers_llava
☆14Updated 2 years ago
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Updated 8 months ago
geronimi73 / phi2-finetune
☆87Updated last year
Eureka6174 / LearnNLPlan
Learning to Program with Natural Language
☆6Updated last year
ConiferLabsWA / flan-ul2-alpaca
☆32Updated 2 years ago