☆121Feb 19, 2026Updated last month
Alternatives and similar repositories for Quark
Users that are interested in Quark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repository contains low-bit quantization papers from 2020 to 2025 on top conference.☆126Mar 5, 2026Updated 3 weeks ago
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- AiTer Optimized Model☆49Updated this week
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆20Jan 24, 2025Updated last year
- EDA toolchain for processing-in-memory architectures, including an architecture synthesizer, a compiler, and a simulator☆19Jun 12, 2025Updated 9 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A mobile app for controlling Pulsetto devices☆21Jan 9, 2026Updated 2 months ago
- ☆20Aug 21, 2025Updated 7 months ago
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆51Oct 21, 2023Updated 2 years ago
- HALO: Hadamard-Assisted Low-Precision Optimization and Training method for finetuning LLMs. 🚀 The official implementation of https://arx…☆29Feb 17, 2025Updated last year
- List of papers related to neural network quantization in recent AI conferences and journals.☆813Mar 27, 2025Updated last year
- Code for the paper “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling”☆140Mar 7, 2026Updated 2 weeks ago
- Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".☆11Feb 5, 2024Updated 2 years ago
- Code Implementation for "NASH: A Simple Unified Framework of Structured Pruning for Accelerating Encoder-Decoder Language Models" (EMNLP …☆17Oct 17, 2023Updated 2 years ago
- Bjontegaard metric calculation. Include BD-PSNR and BD-rate☆13Sep 4, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- [DEPRECATED] Moved to ROCm/rocm-libraries repo☆139Updated this week
- ☆20Nov 26, 2025Updated 4 months ago
- Development repository for the Triton language and compiler☆143Mar 19, 2026Updated last week
- Fast low-bit matmul kernels in Triton☆438Feb 1, 2026Updated last month
- Official Code of The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks[ICML2022]☆17Sep 20, 2022Updated 3 years ago
- Github Repo for OATS: Outlier-Aware Pruning through Sparse and Low Rank Decomposition☆18Apr 16, 2025Updated 11 months ago
- O'Reilly Course, In-Memory Computing Essentials☆10Oct 16, 2020Updated 5 years ago
- ☆11Apr 5, 2023Updated 2 years ago
- Research about dataflow architecture☆12Nov 30, 2023Updated 2 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 360zhiano2☆11Dec 3, 2024Updated last year
- ☆169Mar 9, 2023Updated 3 years ago
- ☆10Nov 16, 2024Updated last year
- ☆27Mar 29, 2025Updated 11 months ago
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆95Feb 20, 2026Updated last month
- NTAPI系统关键进程视频源代码☆11Aug 23, 2022Updated 3 years ago
- Training with Block Minifloat number representation☆18May 2, 2021Updated 4 years ago
- This is the official pytorch implementation for the paper: Towards Accurate Post-training Quantization for Diffusion Models.(CVPR24 Poste…☆38Jun 4, 2024Updated last year
- Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.☆18Feb 9, 2026Updated last month
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 7 months ago
- [ICML2025] LoRA fine-tune directly on the quantized models.☆39Nov 25, 2024Updated last year
- 2D and 3D Graph Calculator☆33Mar 13, 2026Updated 2 weeks ago
- Official Repo for SparseLLM: Global Pruning of LLMs (NeurIPS 2024)☆67Mar 27, 2025Updated last year
- Frequently updated list of dLLM (Diffusion Large Language Models) papers, models, and other resources☆24Jan 30, 2026Updated last month
- 一个基于AXI接口的PL端卷积加速器,可由PS端调用☆12Apr 15, 2023Updated 2 years ago
- [ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.☆134May 16, 2024Updated last year