huawei-csl / SINQView external linksLinks
Welcome to the official repository of SINQ! A novel, fast and high-quality quantization method designed to make any Large Language Model smaller while preserving accuracy.
☆595Updated this week
Alternatives and similar repositories for SINQ
Users that are interested in SINQ are comparing it to the libraries listed below
Sorting:
- ☆22Aug 9, 2024Updated last year
- ☆21Jan 25, 2025Updated last year
- Lightweight Neural Architecture Search for Temporal Convolutional Networks at the Edge☆10Mar 6, 2023Updated 2 years ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆45Jun 13, 2023Updated 2 years ago
- Single-file, pure CUDA C implementation for running inference on Qwen3 0.6B GGUF. No Dependencies.☆22Nov 26, 2025Updated 2 months ago
- EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆27Jul 30, 2025Updated 6 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- ☆11Feb 20, 2025Updated 11 months ago
- Makes llama.cpp easy to use.☆12May 14, 2025Updated 9 months ago
- AI in A Box☆25Jan 20, 2026Updated 3 weeks ago
- With dri3 we can configure in ~/.drirc which GPU a program with a given name should be rendered on. This is a small utlity to make this p…☆10Oct 21, 2016Updated 9 years ago
- A chat UI for Llama.cpp☆15Dec 2, 2025Updated 2 months ago
- Home of ALP/GraphBLAS and ALP/Pregel, featuring shared- and distributed-memory auto-parallelisation of linear algebraic and vertex-centri…☆33Updated this week
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆36Jul 2, 2025Updated 7 months ago
- This Streamlit application allows users to upload images and engage in interactive conversations about them using the Ollama Vision Model…☆15Nov 11, 2024Updated last year
- See vLLM official support: https://github.com/vllm-project/vllm-ascend☆11Feb 5, 2025Updated last year
- ☆20Sep 20, 2025Updated 4 months ago
- 🤖 AI-powered CLI for file reorganization. Runs fully locally — no data leaves your machine.☆19Jul 2, 2025Updated 7 months ago
- Watch for file changes and auto restart an application using fork checkpoints to continue the process (for quick live development)☆13Dec 30, 2021Updated 4 years ago
- This repository contains the training code of ParetoQ introduced in our work "ParetoQ Scaling Laws in Extremely Low-bit LLM Quantization"☆118Oct 15, 2025Updated 4 months ago
- Run Ollama LLM models in Google Colab for free☆37Nov 24, 2024Updated last year
- MLIR tools and dialect for GraphBLAS☆18Mar 30, 2022Updated 3 years ago
- A c++ framework on efficient training & fine-tuning LLMs☆27Updated this week
- Hierarchical roles add-on plugin for Members.☆15Feb 11, 2020Updated 6 years ago
- Lightning Training strategy for HiveMind☆18Jan 20, 2026Updated 3 weeks ago
- Make new tmux windows and panes inherit the currently active conda environment.☆18Dec 22, 2025Updated last month
- World's most accurate password guessing AI tool. A PyTorch implementation of PassLLM (USENIX 2025) that leverages PII and LoRA fine-tunin…☆38Feb 4, 2026Updated last week
- LLM FX: A LLM Server Desktop Client free for everyone!☆33Dec 19, 2025Updated last month
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆30Jan 23, 2026Updated 3 weeks ago
- Append-only key-value database on a distributed shared-log☆52Aug 14, 2024Updated last year
- ☆19Nov 28, 2024Updated last year
- An fully autonomous agent that accesses the browser and performs tasks.☆17Apr 25, 2025Updated 9 months ago
- A Plug-and-play Lightweight tool for the Inference Optimization of Deep Neural networks☆47Oct 27, 2025Updated 3 months ago
- ☆20Sep 28, 2024Updated last year
- Cross-Platform High-Level LLM Library☆43Feb 5, 2026Updated last week
- A go wrapper around the rwkv.cpp library☆20Mar 4, 2024Updated last year
- Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"☆20Jun 11, 2025Updated 8 months ago
- A WordPress plugin that adds a button in the editor sidebar to show the raw post data as well as taxonomy and custom field data☆20Nov 19, 2023Updated 2 years ago
- Control your computer with a voice interface☆28Nov 12, 2025Updated 3 months ago