kuleshov-group / llmtoolsLinks

Finetuning Large Language Models on One Consumer GPU in 2 Bits

☆732

Alternatives and similar repositories for llmtools

Users that are interested in llmtools are comparing it to the libraries listed below

Sorting:

arielnlee / Platypus
Code for fine-tuning Platypus fam LLMs using LoRA
☆629Updated last year
tomaarsen / attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆729Updated last year
rmihaylov / falcontune
Tune any FALCON in 4-bit
☆464Updated 2 years ago
mbzuai-nlp / LaMini-LM
LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions
☆822Updated 2 years ago
johnsmith0031 / alpaca_lora_4bit
☆534Updated last year
yuchenlin / LLM-Blender
[ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…
☆970Updated last year
Vahe1994 / SpQR
☆548Updated 11 months ago
zphang / minimal-llama
☆457Updated 2 years ago
OpenLMLab / LOMO
LOMO: LOw-Memory Optimization
☆990Updated last year
DachengLi1 / LongChat
Official repository for LongChat and LongEval
☆531Updated last year
SqueezeAILab / SqueezeLLM
[ICML 2024] SqueezeLLM: Dense-and-Sparse Quantization
☆708Updated last year
salesforce / xgen
Salesforce open-source LLMs with 8k sequence length.
☆722Updated 9 months ago
sabetAI / BLoRA
batched loras
☆348Updated 2 years ago
epfml / landmark-attention
Landmark Attention: Random-Access Infinite Context Length for Transformers
☆426Updated last year
jondurbin / airoboros
Customizable implementation of the self-instruct paper.
☆1,050Updated last year
SkunkworksAI / hydra-moe
☆415Updated 2 years ago
gururise / AlpacaDataCleaned
Alpaca dataset from Stanford, cleaned and curated
☆1,576Updated 2 years ago
abacusai / Long-Context
This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…
☆597Updated 2 years ago
abacaj / fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
☆714Updated 2 years ago
datamllab / LongLM
[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning
☆660Updated last year
jondurbin / bagel
A bagel, with everything.
☆324Updated last year
abertsch72 / unlimiformer
Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"
☆1,063Updated last year
jquesnelle / yarn
YaRN: Efficient Context Window Extension of Large Language Models
☆1,637Updated last year
dzhulgakov / llama-mistral
Inference code for Mistral and Mixtral hacked up into original Llama implementation
☆369Updated last year
IBM / Dromedary
Dromedary: towards helpful, ethical and reliable LLMs.
☆1,143Updated 2 months ago
persimmon-ai-labs / adept-inference
Inference code for Persimmon-8B
☆412Updated 2 years ago
PiotrNawrot / nanoT5
Fast & Simple repository for pre-training and fine-tuning T5-style models
☆1,013Updated last year
declare-lab / flan-alpaca
This repository contains code for extending the Stanford Alpaca synthetic instruction tuning to existing instruction-tuned models such as…
☆356Updated 2 years ago
apoorvumang / prompt-lookup-decoding
☆577Updated last year
huggingface / llm_training_handbook
An open collection of methodologies to help with successful training of large language models.
☆539Updated last year