Mixed-precision quantization for LLMs. Every layer refracts into a different format based on its sensitivity. Native compressed-tensors export, validated on Qwen3.6-35B-A3B MoE with MTP speculative decoding.
☆80Jun 2, 2026Updated this week
Alternatives and similar repositories for prismaquant
Users that are interested in prismaquant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- sparkrun - launch, manage, and stop LLM inference workloads on NVIDIA DGX Spark systems☆301Updated this week
- Local AI photo scoring, culling, and gallery — score, organise, and explore your library with face recognition and semantic search. No cl…☆106Updated this week
- Learn faster with the power of AI☆17Updated this week
- A dynamic multi-expert AI architecture running on a single consumer GPU (RTX 3060).☆36Dec 2, 2025Updated 6 months ago
- Kiwix ZIM-to-vector RAG system for local, offline LLM knowledge retrieval☆21Mar 24, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆25Oct 13, 2025Updated 7 months ago
- Voice Assistant using Google's Gemini API Key☆21Jul 25, 2024Updated last year
- Ultimate Persona is an all-in-one persona generator and plot hook creator for SillyTavern. It uses pre-existing character cards to shape …☆40Dec 30, 2025Updated 5 months ago
- Professional desktop app for converting text to audiobooks with local TTS☆33Oct 6, 2025Updated 8 months ago
- ☆108Mar 6, 2026Updated 3 months ago
- ☆35Jan 2, 2026Updated 5 months ago
- A tool-call based memory system for SillyTavern☆37Dec 30, 2025Updated 5 months ago
- Tries to UI development. Clone of https://www.perplexity.ai/☆11Sep 30, 2023Updated 2 years ago
- Implements harmful/harmless refusal removal using pure HF Transformers☆21May 8, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 🧬 Viral genome reference alignment☆12Jan 26, 2021Updated 5 years ago
- xLSTMAD - Powerful xLSTM based Method for Anomaly Detection☆18Apr 27, 2026Updated last month
- ☆32Feb 18, 2025Updated last year
- ☆12Nov 21, 2024Updated last year
- Rewritten frontend for SillyTavern☆75Feb 28, 2026Updated 3 months ago
- Automate things, visualize your flows.☆39Jan 16, 2026Updated 4 months ago
- A SillyTavern extension that fixes schizo markdown. Also some HTML/JS stuff.☆46Oct 17, 2025Updated 7 months ago
- ☆15Feb 23, 2026Updated 3 months ago
- ☆11Jun 21, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Open-source AI-powered video sequence platform built with TanStack Start☆111Updated this week
- ☆16Feb 9, 2024Updated 2 years ago
- Talon docs.☆13Mar 2, 2020Updated 6 years ago
- High-Resolution Differential Z-Belt Mod for V0 (with optional Kirigami support)☆12May 22, 2022Updated 4 years ago
- 152 open-source tools to run LLMs 100% locally – no cloud, no API keys, no censorship☆77Nov 30, 2025Updated 6 months ago
- Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parall…☆82Apr 22, 2026Updated last month
- MCP server that gives any LLM its own computer — managed Docker workspaces with live browser, terminal, code execution, document skills, …☆88Updated this week
- ☆10Jan 22, 2023Updated 3 years ago
- Structured, temporal memory for AI agents.☆82May 18, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆57Oct 10, 2025Updated 7 months ago
- Automate AI testing & ensure model trustworthiness☆13Nov 16, 2024Updated last year
- Computer-Use Agents as Judges for Generative UI☆45Nov 27, 2025Updated 6 months ago
- LLM inference in C/C++☆21May 28, 2026Updated last week
- A mount for a standard 3x15mm cartridge thermistor on 1515 T-Slot Extrusion☆10Feb 23, 2023Updated 3 years ago
- ☆13Aug 10, 2021Updated 4 years ago
- A tool for the Jubilee printer using a Voron afterburner extruder☆10Jul 1, 2022Updated 3 years ago