AlexBuz / llama-zipLinks

LLM-powered lossless compression tool

☆283

Alternatives and similar repositories for llama-zip

Users that are interested in llama-zip are comparing it to the libraries listed below

Sorting:

epolewski / EricLLM
A fast batching API to serve LLM models
☆183Updated last year
ejones / llama-journey
Experimental adventure game with AI-generated content
☆111Updated 3 months ago
matt-c1 / llama-3-quant-comparison
Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
☆156Updated last year
leafspark / AutoGGUF
automatically quant GGUF models
☆187Updated this week
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆173Updated last year
sam-paech / antislop-sampler
☆308Updated 3 months ago
turboderp-org / exui
Web UI for ExLlamaV2
☆503Updated 5 months ago
moritztng / fltr
Like grep but for natural language questions. Based on Mistral 7B or Mixtral 8x7B.
☆381Updated last year
av / klmbr
klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs
☆78Updated 9 months ago
itsme2417 / PolyMind
A multimodal, function calling powered LLM webui.
☆214Updated 9 months ago
jd-3d / SOLOBench
☆130Updated 2 months ago
TheProxyCompany / proxy-structuring-engine
Guaranteed Structured Output from any Language Model via Hierarchical State Machines
☆140Updated last month
fairydreaming / farel-bench
Testing LLM reasoning abilities with family relationship quizzes.
☆62Updated 5 months ago
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆239Updated last year
EGjoni / DRUGS
Stop messing around with finicky sampling parameters and just use DRµGS!
☆349Updated last year
pranavjad / tinyllama-bitnet
Train your own small bitnet model
☆74Updated 8 months ago
TC-Zheng / ActuosusAI
AI management tool
☆118Updated 8 months ago
theroyallab / YALS
☆80Updated this week
chigkim / Ollama-MMLU-Pro
☆95Updated 6 months ago
kroggen / mamba.c
Inference of Mamba models in pure C
☆188Updated last year
cognitivecomputations / OpenChatML
☆157Updated last year
matatonic / openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆257Updated 4 months ago
rafacelente / bllama
1.58-bit LLaMa model
☆81Updated last year
avarayr / suaveui
Open source LLM UI, compatible with all local LLM providers.
☆175Updated 9 months ago
Maximilian-Winter / llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …
☆578Updated 5 months ago
galatolofederico / microchain
function calling-based LLM agents
☆287Updated 10 months ago
LostRuins / datasetexplorer
Easily view and modify JSON datasets for large language models
☆78Updated 2 months ago
exo-explore / mlx-bitnet
1.58 Bit LLM on Apple Silicon using MLX
☆214Updated last year
wavify-labs / wavify-sdks
fast state-of-the-art speech models and a runtime that runs anywhere 💥
☆55Updated last month
distantmagic / paddler
Stateful load balancer custom-tailored for llama.cpp 🏓🦙
☆792Updated this week