Mixed-precision quantization for LLMs. Every layer refracts into a different format based on its sensitivity. Native compressed-tensors export, validated on Qwen3.6-35B-A3B MoE with MTP speculative decoding.
☆69May 16, 2026Updated this week
Alternatives and similar repositories for prismaquant
Users that are interested in prismaquant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆68Feb 27, 2026Updated 2 months ago
- sparkrun - launch, manage, and stop LLM inference workloads on NVIDIA DGX Spark systems☆203May 9, 2026Updated last week
- Get progress information for an ffmpeg process.☆18Apr 26, 2026Updated 3 weeks ago
- A Gentle Introduction to ROS Examples in Rospy☆15Aug 28, 2014Updated 11 years ago
- A fork which adds a UI to the original deep-research tool☆11Feb 8, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Iterator, Result and Option written in Rust, for Python☆49Updated this week
- Your CLAUDE.md stopped working at 200 lines. Generate scoped skill files from your import graph, auto-sync on every commit. Claude Code a…☆81May 11, 2026Updated last week
- Thank you LenAnderson I am yoinking this!☆27May 3, 2026Updated 2 weeks ago
- Voice Assistant using Google's Gemini API Key☆22Jul 25, 2024Updated last year
- Kiwix ZIM-to-vector RAG system for local, offline LLM knowledge retrieval☆19Mar 24, 2026Updated last month
- Swift binding for nanomsg☆14Jan 16, 2018Updated 8 years ago
- The complete Neo ecosystem for macOS/Linux☆11Feb 20, 2019Updated 7 years ago
- Learn faster with the power of AI☆17Updated this week
- A dynamic multi-expert AI architecture running on a single consumer GPU (RTX 3060).☆36Dec 2, 2025Updated 5 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ZeroMQ Swift bindings☆13Oct 13, 2016Updated 9 years ago
- ☆25Oct 13, 2025Updated 7 months ago
- ☆31Jan 2, 2026Updated 4 months ago
- Ultimate Persona is an all-in-one persona generator and plot hook creator for SillyTavern. It uses pre-existing character cards to shape …☆38Dec 30, 2025Updated 4 months ago
- LZW Compressoion algorithm in verilog☆17Dec 19, 2013Updated 12 years ago
- Professional desktop app for converting text to audiobooks with local TTS☆33Oct 6, 2025Updated 7 months ago
- ☆45Jan 26, 2026Updated 3 months ago
- ☆107Mar 6, 2026Updated 2 months ago
- A senior microsoldering technician, available to every repair shop — from the seasoned pro to the apprentice. Powered by Claude Opus 4.7.☆132May 4, 2026Updated 2 weeks ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆14Sep 4, 2024Updated last year
- A tool-call based memory system for SillyTavern☆36Dec 30, 2025Updated 4 months ago
- DevSecOps tools container, for use in local development and as a builder/runner in CI/CD pipelines. Not to be used to run production work…☆13Oct 21, 2022Updated 3 years ago
- Loader extension for tabbyAPI in SillyTavern☆26Jun 30, 2025Updated 10 months ago
- USB interface for FPGA using a the Cypress FX3☆18Mar 24, 2020Updated 6 years ago
- A dedicated effort to make an optimized, bleeding edge vLLM image using Docker to support DGX comprehensively☆112Feb 22, 2026Updated 2 months ago
- Tries to UI development. Clone of https://www.perplexity.ai/☆11Sep 30, 2023Updated 2 years ago
- AIMNet2: Fast and accurate machine-learned interatomic potential for molecular dynamics simulations☆86May 3, 2026Updated 2 weeks ago
- An Easy-to-Use RPI Pico PIO Emulator☆27May 8, 2023Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- A 4-DOF serial robotic arm designed for pick-and-place☆24Oct 14, 2023Updated 2 years ago
- Swift server side - push notification. APNS and GCM☆20Nov 24, 2018Updated 7 years ago
- Implements harmful/harmless refusal removal using pure HF Transformers☆21May 8, 2025Updated last year
- ☆31Jan 13, 2026Updated 4 months ago
- 🧬 Viral genome reference alignment☆12Jan 26, 2021Updated 5 years ago
- xLSTMAD - Powerful xLSTM based Method for Anomaly Detection☆17Apr 27, 2026Updated 3 weeks ago
- ☆30Feb 18, 2025Updated last year