Mixed-precision quantization for LLMs. Every layer refracts into a different format based on its sensitivity. Native compressed-tensors export, validated on Qwen3.6-35B-A3B MoE with MTP speculative decoding.
☆82Jun 22, 2026Updated this week
Alternatives and similar repositories for prismaquant
Users that are interested in prismaquant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆72Feb 27, 2026Updated 4 months ago
- Thank you LenAnderson I am yoinking this!☆28May 3, 2026Updated last month
- Learn faster with the power of AI☆17Jun 14, 2026Updated 2 weeks ago
- Kiwix ZIM-to-vector RAG system for local, offline LLM knowledge retrieval☆25Mar 24, 2026Updated 3 months ago
- Voice Assistant using Google's Gemini API Key☆22Jul 25, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Professional desktop app for converting text to audiobooks with local TTS☆33Oct 6, 2025Updated 8 months ago
- ☆108Mar 6, 2026Updated 3 months ago
- ☆14Sep 4, 2024Updated last year
- Tool-calling quality benchmark for LLM serving stacks. 80+ deterministic scenarios testing multi-turn orchestration, safety boundaries, a…☆134Updated this week
- ☆41Jan 2, 2026Updated 5 months ago
- DevSecOps tools container, for use in local development and as a builder/runner in CI/CD pipelines. Not to be used to run production work…☆13Oct 21, 2022Updated 3 years ago
- Loader extension for tabbyAPI in SillyTavern☆27Jun 30, 2025Updated 11 months ago
- A dedicated effort to make an optimized, bleeding edge vLLM image using Docker to support DGX comprehensively☆121Feb 22, 2026Updated 4 months ago
- Tries to UI development. Clone of https://www.perplexity.ai/☆11Sep 30, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implements harmful/harmless refusal removal using pure HF Transformers☆21May 8, 2025Updated last year
- ☆31Jan 13, 2026Updated 5 months ago
- 🧬 Viral genome reference alignment☆12Jan 26, 2021Updated 5 years ago
- xLSTMAD - Powerful xLSTM based Method for Anomaly Detection☆19Apr 27, 2026Updated 2 months ago
- ☆32Feb 18, 2025Updated last year
- ☆12Nov 21, 2024Updated last year
- AI coder powered by open source LLMs☆11Nov 28, 2024Updated last year
- OpenClaw Operator gives coding agents like Codex and Claude Code the context and playbooks needed to set up, validate, and troubleshoot a…☆20Mar 7, 2026Updated 3 months ago
- Rewritten frontend for SillyTavern☆81Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Automate things, visualize your flows.☆39Jun 9, 2026Updated 2 weeks ago
- ☆37Mar 12, 2026Updated 3 months ago
- A SillyTavern extension that fixes schizo markdown. Also some HTML/JS stuff.☆47Jun 9, 2026Updated 2 weeks ago
- ☆17Feb 23, 2026Updated 4 months ago
- ☆11Jun 21, 2023Updated 3 years ago
- CPU/GPU Implicit & Explicit Finite Element Solver for Large Strains☆26Feb 20, 2026Updated 4 months ago
- ☆16Feb 9, 2024Updated 2 years ago
- High-Resolution Differential Z-Belt Mod for V0 (with optional Kirigami support)☆12May 22, 2022Updated 4 years ago
- Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and in…☆18Nov 11, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 152 open-source tools to run LLMs 100% locally – no cloud, no API keys, no censorship☆83Nov 30, 2025Updated 6 months ago
- Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parall…☆83Updated this week
- ☆10Jan 22, 2023Updated 3 years ago
- Open-source AI-powered video sequence platform built with TanStack Start☆126Updated this week
- Talon docs.☆14Mar 2, 2020Updated 6 years ago
- Structured, temporal memory for AI agents.☆85May 18, 2026Updated last month
- ☆17Feb 12, 2025Updated last year