☆114Feb 19, 2026Updated 2 weeks ago
Alternatives and similar repositories for Quark
Users that are interested in Quark are comparing it to the libraries listed below
Sorting:
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- This repository contains low-bit quantization papers from 2020 to 2025 on top conference.☆103Feb 13, 2026Updated 3 weeks ago
- EDA toolchain for processing-in-memory architectures, including an architecture synthesizer, a compiler, and a simulator☆18Jun 12, 2025Updated 8 months ago
- High Performance FP8 GEMM Kernels for SM89 and later GPUs.☆20Jan 24, 2025Updated last year
- Code for the paper “Four Over Six: More Accurate NVFP4 Quantization with Adaptive Block Scaling”☆130Updated this week
- Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…☆50Oct 21, 2023Updated 2 years ago
- List of papers related to neural network quantization in recent AI conferences and journals.☆806Mar 27, 2025Updated 11 months ago
- Spotify modded APK repo before Revanced or XManager do them.☆19Jun 27, 2025Updated 8 months ago
- Development repository for the Triton language and compiler☆141Feb 27, 2026Updated last week
- Peking University Embedded Microprocessor System Lesson’s all Homework☆10Dec 28, 2021Updated 4 years ago
- This is the official pytorch implementation for the paper: Towards Accurate Post-training Quantization for Diffusion Models.(CVPR24 Poste…☆38Jun 4, 2024Updated last year
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆26Jun 16, 2025Updated 8 months ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- Demo for Self-contained Systems (SCS) using jQuery for Frontend Integration☆10May 26, 2023Updated 2 years ago
- ☆97Feb 26, 2026Updated last week
- ☆12Jan 6, 2023Updated 3 years ago
- ☆169Mar 9, 2023Updated 2 years ago
- List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.☆102Jun 2, 2024Updated last year
- ☆10Nov 16, 2024Updated last year
- A simple Windows program to put monitors into sleep mode☆11Jan 22, 2025Updated last year
- Parse a `git log` output of a repository into an object with useful commit data. Supports plugins, streaming, promises and callback APIs.☆12Jan 12, 2023Updated 3 years ago
- 北航校园网网关自动登录☆10Nov 8, 2021Updated 4 years ago
- Stochastic Machines for Unsupervised Learning implemented in Pytorch.☆10Sep 3, 2017Updated 8 years ago
- ☆11Apr 16, 2023Updated 2 years ago
- Express DLA implementation for FPGA, revised based on NVDLA.☆11Oct 17, 2019Updated 6 years ago
- BFloat16 Fused Adam Operator for PyTorch☆16Nov 16, 2024Updated last year
- A PyTorch Lightning template to try out a wide range of ideas on the Ubiquant Market Prediction competition without modifying any code!☆12Mar 24, 2022Updated 3 years ago
- Training Quantized Neural Networks with a Full-precision Auxiliary Module☆13Jun 19, 2020Updated 5 years ago
- OpenAI GPT model to build your personal assistant in IoT devices. Just like Alexa, Google Assistant, Siri, etc. but with your own skills,…☆12Aug 7, 2023Updated 2 years ago
- Self-contained Python lib with zero-dependencies that give you a unified device properties for gpu, cpu, and npu. No more calling separat…☆15Dec 12, 2025Updated 2 months ago
- The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"☆10Mar 22, 2023Updated 2 years ago
- Persistent dense gemm for Hopper in `CuTeDSL`☆15Aug 9, 2025Updated 6 months ago
- Learning programs with the Exploration-Compression algorithm☆10May 17, 2018Updated 7 years ago
- Codes for our paper "Exploring Bit-Slice Sparsity in Deep Neural Networks for Efficient ReRAM-Based Deployment" [NeurIPS'19 EMC2 workshop]…☆10Oct 12, 2020Updated 5 years ago
- ☆11Apr 5, 2023Updated 2 years ago
- ☆12Feb 5, 2024Updated 2 years ago
- 360zhiano2☆11Dec 3, 2024Updated last year
- Implementation of NM sparsity recipe presented in the paper "Progressive Gradient Flow for Robust N:M Sparsity Training in Transformers".☆11Feb 5, 2024Updated 2 years ago
- O'Reilly Course, In-Memory Computing Essentials☆10Oct 16, 2020Updated 5 years ago