matt-c1 / llama-3-quant-comparisonView external linksLinks
Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
☆165May 16, 2024Updated last year
Alternatives and similar repositories for llama-3-quant-comparison
Users that are interested in llama-3-quant-comparison are comparing it to the libraries listed below
Sorting:
- an auto-sleeping and -waking framework around llama.cpp☆12Feb 8, 2025Updated last year
- Attend - to what matters.☆17Feb 22, 2025Updated 11 months ago
- Experimental LLM Inference UX to aid in creative writing☆128Dec 14, 2024Updated last year
- AirLLM 70B inference with single 4GB GPU☆17Jun 27, 2025Updated 7 months ago
- Web Interface for Vision Language Models Including InternVLM2☆25Jul 29, 2024Updated last year
- Web UI for ExLlamaV2☆513Feb 5, 2025Updated last year
- The official API server for Exllama. OAI compatible, lightweight, and fast.☆1,129Updated this week
- entropix style sampling + GUI☆27Oct 30, 2024Updated last year
- a browser gui for nvidia smi☆20Mar 17, 2025Updated 10 months ago
- Mixture-of-Ollamas☆30Aug 12, 2024Updated last year
- Experimental sampler to make LLMs more creative☆31Aug 2, 2023Updated 2 years ago
- This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?☆1,450Nov 13, 2025Updated 3 months ago
- private-machine is an AI companion system with emotion, needs and goals simulation. Very silly, not based on real science.☆28Nov 13, 2025Updated 3 months ago
- Practical and advanced guide to LLMOps. It provides a solid understanding of large language models’ general concepts, deployment techniqu…☆79Aug 16, 2024Updated last year
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated last month
- A stock market bot that automatically, once a day, rebalances your Robinhood portfolio by gathering information about each ticker in the …☆60Feb 25, 2025Updated 11 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- A very simple interactive demo to understand the common LLM samplers.☆40Jul 9, 2024Updated last year
- Local first human friendly agents toolkit for the browser and Nodejs☆45Feb 2, 2026Updated last week
- Who needs o1 anyways. Add CoT to any OpenAI compatible endpoint.☆44Sep 17, 2024Updated last year
- Copilot with deepseek and more...☆13Mar 7, 2025Updated 11 months ago
- Photo Tinder - Desktop app for image triage and ranking (Tauri)☆20Dec 18, 2025Updated last month
- ☆24Jan 22, 2025Updated last year
- ☆23Jun 4, 2024Updated last year
- Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc☆2,374Feb 8, 2026Updated last week
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆49Oct 29, 2025Updated 3 months ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆82Feb 7, 2026Updated last week
- CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…☆67Nov 5, 2024Updated last year
- Docker images and configuration to run text-generation-webui with GPU or CPU support☆32Mar 19, 2024Updated last year
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 9 months ago
- ☆109Aug 21, 2025Updated 5 months ago
- Docker image for AI Horde dreamer (Stable Diffusion)☆12Sep 13, 2023Updated 2 years ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Oct 9, 2024Updated last year
- ☆71Jun 20, 2025Updated 7 months ago
- Large-scale LLM inference engine☆1,651Jan 21, 2026Updated 3 weeks ago
- Powerful LLM Query Framework with YAML Prompt Templates. Made for Automation☆34Sep 20, 2025Updated 4 months ago
- Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claud…☆30Mar 20, 2025Updated 10 months ago
- Agent framework for generating a synthetic dataset. This will be raw CoT and Reflection output to be cleaned up by a later step.☆15Apr 11, 2025Updated 10 months ago