matt-c1 / llama-3-quant-comparisonLinks

Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.

☆165

Alternatives and similar repositories for llama-3-quant-comparison

Users that are interested in llama-3-quant-comparison are comparing it to the libraries listed below

Sorting:

epolewski / EricLLM
A fast batching API to serve LLM models
☆189Updated last year
sam-paech / antislop-sampler
☆330Updated 4 months ago
chigkim / Ollama-MMLU-Pro
☆108Updated 3 months ago
rafacelente / bllama
1.58-bit LLaMa model
☆83Updated last year
QuixiAI / laserRMT
This is our own implementation of 'Layer Selective Rank Reduction'
☆240Updated last year
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆179Updated last year
turboderp-org / exllamav3
An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs
☆586Updated last week
itsme2417 / PolyMind
A multimodal, function calling powered LLM webui.
☆217Updated last year
CerebrasResearch / reap
REAP: Router-weighted Expert Activation Pruning for SMoE compression
☆129Updated 3 weeks ago
theroyallab / YALS
☆86Updated 2 weeks ago
LostRuins / datasetexplorer
Easily view and modify JSON datasets for large language models
☆84Updated 6 months ago
severian42 / Vodalus-Expert-LLM-Forge
Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …
☆190Updated last year
arcee-ai / PruneMe
Automated Identification of Redundant Layer Blocks for Pruning in Large Language Models
☆256Updated last year
SicariusSicariiStuff / SLOP_Detector
SLOP Detector and analyzer based on dictionary for shareGPT JSON and text
☆79Updated last week
turboderp-org / exui
Web UI for ExLlamaV2
☆514Updated 10 months ago
jukofyork / transplant-vocab
Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.
☆47Updated last month
jd-3d / SOLOBench
☆135Updated 7 months ago
fairydreaming / farel-bench
Testing LLM reasoning abilities with family relationship quizzes.
☆63Updated 10 months ago
EQ-bench / EQ-Bench
A benchmark for emotional intelligence in large language models
☆389Updated last year
AndrewVeee / nucleo-ai
An AI assistant beyond the chat box.
☆328Updated last year
FailSpy / abliterator
Simple Python library/structure to ablate features in LLMs which are supported by TransformerLens
☆537Updated last year
av / klmbr
klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs
☆86Updated last year
TheProxyCompany / proxy-structuring-engine
Guaranteed Structured Output from any Language Model via Hierarchical State Machines
☆145Updated last month
QuixiAI / OpenChatML
☆164Updated 3 months ago
matatonic / openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆266Updated 9 months ago
AlpinDale / gptslop
A community list of common phrases generated by GPT and Claude models
☆79Updated 2 years ago
the-crypt-keeper / LLooM
Experimental LLM Inference UX to aid in creative writing
☆127Updated 11 months ago
Gryphe / BlockMerge_Gradient
Merge Transformers language models by use of gradient parameters.
☆209Updated last year
willccbb / mlx_parallm
Fast parallel LLM inference for MLX
☆234Updated last year
Maximilian-Winter / llama-cpp-agent
The llama-cpp-agent framework is a tool designed for easy interaction with Large Language Models (LLMs). Allowing users to chat with LLM …
☆611Updated 9 months ago