MDK8888 / GPTFastLinks

Accelerate your Hugging Face Transformers 7.6-9x. Native to Hugging Face and PyTorch.

☆684

Alternatives and similar repositories for GPTFast

Users that are interested in GPTFast are comparing it to the libraries listed below

Sorting:

mistralai-sf24 / hackathon
☆447Updated last year
AnswerDotAI / fsdp_qlora
Training LLMs with QLoRA + FSDP
☆1,490Updated 8 months ago
huggingface / optimum-nvidia
☆986Updated 5 months ago
Vahe1994 / AQLM
Official Pytorch repository for Extreme Compression of Large Language Models via Additive Quantization https://arxiv.org/pdf/2401.06118.p…
☆1,269Updated 2 months ago
myshell-ai / JetMoE
Reaching LLaMA2 Performance with 0.1M Dollars
☆984Updated 11 months ago
tomaarsen / attention_sinks
Extend existing LLMs way beyond the original training length with constant memory usage, without retraining
☆701Updated last year
likejazz / llama3.np
llama3.np is a pure NumPy implementation for Llama 3 model.
☆986Updated 2 months ago
abacaj / fine-tune-mistral
Fine-tune mistral-7B on 3090s, a100s, h100s
☆714Updated last year
jondurbin / bagel
A bagel, with everything.
☆322Updated last year
apoorvumang / prompt-lookup-decoding
☆546Updated 10 months ago
mlabonne / llm-autoeval
Automatically evaluate your LLMs in Google Colab
☆649Updated last year
nomic-ai / contrastors
Train Models Contrastively in Pytorch
☆727Updated 3 months ago
pbelcak / UltraFastBERT
The repository for the code of the UltraFastBERT paper
☆516Updated last year
persimmon-ai-labs / adept-inference
Inference code for Persimmon-8B
☆415Updated last year
valine / NeuralFlow
Visualize the intermediate output of Mistral 7B
☆365Updated 5 months ago
mistralai / mistral-common
Official inference library for pre-processing of Mistral models
☆755Updated this week
mistralai / megablocks-public
☆864Updated last year
lucidrains / self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
☆1,392Updated last year
mobiusml / hqq
Official implementation of Half-Quadratic Quantization (HQQ)
☆842Updated last week
SkunkworksAI / hydra-moe
☆415Updated last year
alasdairforsythe / tokenmonster
Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript
☆588Updated last year
sabetAI / BLoRA
batched loras
☆343Updated last year
uclaml / SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
☆1,172Updated last year
cognitivecomputations / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
scaleapi / llm-engine
Scale LLM Engine public repository
☆808Updated last week
Cerebras / gigaGPT
a small code base for training large models
☆304Updated 2 months ago
galatolofederico / microchain
function calling-based LLM agents
☆287Updated 9 months ago
zhudotexe / kani
kani (カニ) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)
☆583Updated last week
AI-Hypercomputer / maxtext
A simple, performant and scalable Jax LLM!
☆1,831Updated this week
microsoft / Samba
[ICLR 2025] Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling
☆888Updated 2 months ago