jorahn / llama-int8Links

Quantized inference code for LLaMA models

☆13

Alternatives and similar repositories for llama-int8

Users that are interested in llama-int8 are comparing it to the libraries listed below

Sorting:

kir-gadjello / zipslicer
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
cwhy / rwkv-decon
Trying to deconstruct RWKV in understandable terms
☆14Updated 2 years ago
zarakiquemparte / zaraki-tools
☆27Updated last year
Hellisotherpeople / llm_steer-oobabooga
Steer LLM outputs towards a certain topic/subject and enhance response capabilities using activation engineering by adding steering vecto…
☆43Updated last year
finetunej / transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
☆56Updated 3 years ago
Birch-san / falcon-play
Command-line script for inferencing from models such as falcon-7b-instruct
☆75Updated 2 years ago
Netwrck / stable-diffusion-server
Image Generation API Server - Similar to https://text-generator.io but for images
☆50Updated last week
trevbook / sd-prompt-graph
A curve-editor for Stable Diffusion prompt interpolation
☆21Updated 2 years ago
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated last year
lachlansneff / sparsellama
☆40Updated 2 years ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
DOUDOU0314 / GPT-J-hf
GPT-jax based on the official huggingface library
☆13Updated 4 years ago
PuchToTalk / DOOM-MistralAI
Mistral7B playing DOOM
☆28Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
camenduru / stable-fast-colab
☆28Updated last year
jina-ai / big_creative_ai
BIG: Back In the Game of Creative AI
☆27Updated 2 years ago
mrcolo / longboii
☆19Updated 2 years ago
CoffeeVampir3 / ez-trainer
Train Llama Loras Easily
☆31Updated last year
jllllll / exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
☆64Updated last year
LAION-AI / opendream
Frontend (and soon also midleware and backend) for a new, opensource image generation platform.
☆14Updated 2 years ago
ahmed-moubtahij / TokenHealer
☆22Updated last year
AlpinDale / LLM-Shearing
Preprint: Sheared LLaMA: Accelerating Language Model Pre-training via Structured Pruning
☆28Updated last year
huggingface / discord-bots
☆50Updated last year
jags111 / floral-diffusion
Floral Diffusion is a custom diffusion model trained by jags using a DD 5.6 version
☆26Updated 2 years ago
Birch-san / diffusers-play
Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.
☆53Updated last year
ConiferLabsWA / flan-ul2-alpaca
☆32Updated 2 years ago
silphendio / sliced_llama
Simple LLM inference server
☆20Updated last year
simonw / laion-aesthetic-datasette
Use Datasette to explore LAION improved_aesthetics_6plus training data used by Stable DIffusion
☆58Updated last year
reka-ai / rekaquant
☆49Updated last week
ethansmith2000 / QuickEmbedding
☆1Updated 4 months ago