Manuel030 / alpaca-optLinks

Yet another LLM

☆10

Alternatives and similar repositories for alpaca-opt

Users that are interested in alpaca-opt are comparing it to the libraries listed below

Sorting:

the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated 2 years ago
emrgnt-cmplxty / zero-shot-replication
☆73Updated 2 years ago
kyegomez / Kosmos-X
The Next Generation Multi-Modality Superintelligence
☆69Updated last year
VatsaDev / NanoPhi-alpha
GPT-2 small trained on phi-like data
☆67Updated last year
CERC-AAI / Robin
☆63Updated last year
lachlansneff / sparsellama
☆40Updated 2 years ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆35Updated last year
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆27Updated last year
attashe / ModifiedBeamSampler
Modified Beam Search with periodical restart
☆12Updated last year
kyegomez / Falcon
A simple package for leveraging Falcon 180B and the HF ecosystem's tools, including training/inference scripts, safetensors, integrations…
☆11Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Updated 2 years ago
ConiferLabsWA / flan-ul2-alpaca
☆33Updated 2 years ago
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated this week
desik1998 / MathWithLLMs
☆49Updated last year
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated last year
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
donaldafeith / Pytorch_Merge
Merge LLM that are split in to parts
☆26Updated 3 months ago
qwopqwop200 / gptqlora
GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ
☆102Updated 2 years ago
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 11 months ago
Narsil / bloomserver
☆39Updated 3 years ago
NolanoOrg / sparse_quant_llms
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆40Updated 2 years ago
kyegomez / Finetuning-Suite
Finetune any model on HF in less than 30 seconds
☆55Updated last week
mayank31398 / GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
☆50Updated 2 years ago
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated 2 years ago
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated 2 years ago
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆169Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated last year
moomou / listening-with-llm
☆17Updated last year
soochan-lee / RoT
Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…
☆45Updated 2 years ago
CogNLP / CogAGENT
☆35Updated 2 years ago