Efficient GPU kernels for mixed-precision Vision Transformers in Triton
☆18Sep 18, 2025Updated 5 months ago
Alternatives and similar repositories for qattn
Users that are interested in qattn are comparing it to the libraries listed below
Sorting:
- ☆28Nov 5, 2021Updated 4 years ago
- This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is…☆24Aug 17, 2021Updated 4 years ago
- BitSplit Post-trining Quantization☆50Dec 20, 2021Updated 4 years ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…☆31Mar 12, 2024Updated last year
- Speedup the attention computation of Swin Transformer☆31Jun 14, 2025Updated 8 months ago
- DAC System Design Contest 2020☆29Jun 11, 2020Updated 5 years ago
- [CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric☆60Mar 23, 2023Updated 2 years ago
- [TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"☆37Aug 20, 2024Updated last year
- code for the paper "A Statistical Framework for Low-bitwidth Training of Deep Neural Networks"☆29Oct 31, 2020Updated 5 years ago
- Pytorch implementation for FAT: learning low-bitwidth parametric representation via frequency-aware transformation☆27May 2, 2021Updated 4 years ago
- The official implementation of the ICML 2023 paper OFQ-ViT☆39Oct 3, 2023Updated 2 years ago
- CAVEDU出版之Jetson Nano書籍範例☆10Jul 12, 2021Updated 4 years ago
- source code of the paper: Robust Quantization: One Model to Rule Them All☆41Mar 24, 2023Updated 2 years ago
- ☆35Mar 4, 2020Updated 5 years ago
- A simple set of commands to manage three levels of modified context and link world info where necessary.☆10May 16, 2021Updated 4 years ago
- Weighted Masks Fusion (WMF) - Ensembling for Instance Segmentation.☆11Aug 1, 2023Updated 2 years ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆26Jun 16, 2025Updated 8 months ago
- A MATLAB tool for material decomposition based on dual energy microCT scanning☆10Jul 6, 2017Updated 8 years ago
- PyTorch Quantization Framework For OCP MX Datatypes.☆16May 30, 2025Updated 9 months ago
- An active RFID system for locating free roaming pets / wildlife.☆10Jul 22, 2023Updated 2 years ago
- A collection python tools used to create gguf files and upload to huggingface☆17Feb 20, 2026Updated last week
- An Extension for Automatic1111 Webui that makes the interface easier to use on mobile (portrait)☆16Apr 16, 2024Updated last year
- A flexible utility for converting tensor precision in PyTorch models and safetensors files, enabling efficient deployment across various …☆11Aug 24, 2023Updated 2 years ago
- An Tensorflow.keras implementation of Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorizatio…☆10Dec 18, 2019Updated 6 years ago
- Recyclable Robotic Sorting with Custom Synthetic Dataset and Annotations☆12Apr 9, 2021Updated 4 years ago
- Automated creation of Native Boot (VHDx) installations, using just a Windows Setup image (.iso).☆14Apr 2, 2017Updated 8 years ago
- Aggregation framework for annotating datasets in computer vision tasks (detection, segmentation, video captioning etc.)☆11Nov 6, 2024Updated last year
- ODLabel is a powerful tool for zero-shot object detection, labeling and visualization. It provides an intuitive graphical user interface …☆10May 19, 2024Updated last year
- simple UCI chess engine written by self learner from scratch☆11Sep 16, 2020Updated 5 years ago
- A curated collection of AI coding tools, tutorials, and resources to get started with AI assisted coding.☆16Feb 15, 2025Updated last year
- This repository teaches how to train, evaluate and deploy ML models using MLFlow☆13Oct 23, 2024Updated last year
- Multi-turn dataset management tool for LLM trainers☆12Mar 31, 2025Updated 11 months ago
- Make windows installer 🪟 for flutter powered apps💻.☆13Jul 6, 2024Updated last year
- A common protocol for AI agent tools☆10Oct 21, 2024Updated last year
- 北航校园网网关自动登录☆10Nov 8, 2021Updated 4 years ago
- Express DLA implementation for FPGA, revised based on NVDLA.☆11Oct 17, 2019Updated 6 years ago
- PyTorch code for full quantization of DNN using BCGD☆14Jul 24, 2019Updated 6 years ago
- Faster and cheaper! parallel processing of Anthropic API requests, optimizing for speed, cost-efficiency, and rate limit compliance.☆15Oct 1, 2024Updated last year
- CLI interface to the TGBOX☆11Feb 24, 2026Updated last week