sophgo/tpu-mq

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sophgo/tpu-mq)

sophgo / tpu-mq

Model Quantization Benchmark

☆20

Alternatives and similar repositories for tpu-mq

Users that are interested in tpu-mq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

GATECH-EIC / ShiftAddViT
View on GitHub
[NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
☆30Dec 6, 2023Updated 2 years ago
insuhan / calibquant
View on GitHub
☆21Apr 3, 2025Updated last year
ModelTC / quant_horizon
View on GitHub
☆11Jan 10, 2025Updated last year
IST-DASLab / gemm-fp8
View on GitHub
High Performance FP8 GEMM Kernels for SM89 and later GPUs.
☆21Jan 24, 2025Updated last year
sophgo / tpu-mlir
View on GitHub
Machine learning compiler based on MLIR for Sophgo TPU.
☆949Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
megvii-research / IntLLaMA
View on GitHub
IntLLaMA: A fast and light quantization solution for LLaMA
☆19Jul 21, 2023Updated 3 years ago
ZillaRU / SD-lcm-tpu
View on GitHub
Stable Diffusion+LCM在SG2300X上，纵享丝滑一秒出图
☆17Nov 29, 2024Updated last year
PannenetsF / TQT
View on GitHub
TQT's pytorch implementation.
☆22Dec 17, 2021Updated 4 years ago
BillAmihom / RAPQ
View on GitHub
Pytorch implementation of RAPQ, IJCAI 2022
☆23Jul 19, 2023Updated 3 years ago
PolyArch / dsagen2
View on GitHub
Domain-Specific Architecture Generator 2
☆26Oct 2, 2022Updated 3 years ago
jakc4103 / piecewise-quantization
View on GitHub
PyTorch implementation of Near-Lossless Post-Training Quantization of Deep Neural Networks via a Piecewise Linear Approximation
☆23Feb 17, 2020Updated 6 years ago
syshensyshen / pva-mobilenet-v2
View on GitHub
using pvanet framework train mobilenet-v2 for objects detection, papaer: https://arxiv.org/abs/1611.08588
☆13Feb 13, 2019Updated 7 years ago
imyhxy / ccocotools
View on GitHub
This is a C++ implementation of cocoapi bbox evaluation code.
☆11Dec 9, 2021Updated 4 years ago
kimp01 / compression-and-regularisation
View on GitHub
☆12Dec 23, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
luckynote / ibug_300W_make_landmarks_tools
View on GitHub
Make a new landmarks or a less points landmarks from dlib ibug_300W_large_face_landmark_dataset.
☆10Nov 11, 2018Updated 7 years ago
XJTUWYD / DoReFa_Cifar10
View on GitHub
implement of DoReFaNet with tensorflow based on cifar10 dataset
☆28Nov 8, 2017Updated 8 years ago
ModelTC / Dipoorlet
View on GitHub
Offline Quantization Tools for Deploy.
☆143Dec 28, 2023Updated 2 years ago
JasonPlawinski / SuperResolution
View on GitHub
Pytorch Implemenation of a SRGAN with regularization loss to stabilize GAN training. Work presented at the Japanese conference MIRU.
☆12Oct 17, 2018Updated 7 years ago
FrancescoB-Vintra / fp16tensorRT
View on GitHub
TensorRT half precision inference routine on a API-based TensorRT model
☆12Jul 3, 2018Updated 8 years ago
rog93 / PS-PreciseRoIPooling
View on GitHub
Position sensitive PreciseRoIPooling without roi coordinates gradient backward
☆16Aug 2, 2018Updated 7 years ago
Ther-nullptr / circult-eda-mlsys-tinyml-arxiv-daily
View on GitHub
🎓Automatically Update circult-eda-mlsys-tinyml Papers Daily using Github Actions (Update Every 8th hours)
☆10Jul 13, 2026Updated last week
GoatWu / AdaLog
View on GitHub
[ECCV 2024] AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer
☆43Dec 9, 2024Updated last year
DensoITLab / bitprune
View on GitHub
☆11Apr 5, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Cloveryww / make-augmented-PascalVOC-semantic-segmentation-dataset-tool
View on GitHub
☆10Apr 10, 2019Updated 7 years ago
HuangOwen / QAT-ACS
View on GitHub
[TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"
☆38Aug 20, 2024Updated last year
chinthysl / AdderNetTensorRT
View on GitHub
Nvidia TensorRT implementation of AdderNet for edge deployment
☆10Nov 19, 2020Updated 5 years ago
VITA-Group / AutoPose
View on GitHub
Code for "AutoPose: Searching Multi-Scale Branch Aggregation for Pose Estimation"
☆10Dec 30, 2021Updated 4 years ago
Sabokrou / NRE
View on GitHub
☆11Aug 16, 2019Updated 6 years ago
tobna / TaylorShift
View on GitHub
This repository contains the code for the paper "TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back)…
☆15Feb 25, 2026Updated 4 months ago
ZouJiu1 / numpy_transformer
View on GitHub
transformer which using numpy，vision transformer of VIT, MNIST testset precision = 97.2%，mutil-attention, patch embed, position embed, fu…
☆12Mar 4, 2026Updated 4 months ago
ray-project / serve_config_examples
View on GitHub
☆12Mar 16, 2026Updated 4 months ago
JKay0327 / whisper-TPU_py
View on GitHub
A whisper repo for TPU
☆11Jun 4, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
GATECH-EIC / HALO
View on GitHub
The official code for [ECCV2020] "HALO: Hardware-aware Learning to Optimize"
☆10Mar 22, 2023Updated 3 years ago
SeuTao / MsCelebGAN
View on GitHub
Training multi resolution GAN using MsCeleb datasets.
☆12May 2, 2018Updated 8 years ago
enjoy-digital / litex_verilog_axi_test
View on GitHub
Integration test of Verilog AXI modules (https://github.com/alexforencich/verilog-axi) with LiteX.
☆17Dec 19, 2022Updated 3 years ago
DoctorKey / Practise
View on GitHub
[CVPR2023] Practical Network Acceleration with Tiny Sets
☆13Jul 28, 2023Updated 2 years ago
mostafaelhoushi / DeepShift
View on GitHub
Implementation of "DeepShift: Towards Multiplication-Less Neural Networks" https://arxiv.org/abs/1905.13298
☆113Nov 22, 2021Updated 4 years ago
lucia123 / lapa-dataset
View on GitHub
A large-scale dataset for face parsing (AAAI2020)
☆13Jan 4, 2021Updated 5 years ago
chenjun2hao / Self_cross_entropy
View on GitHub
Write a cross_entropy function in pytorch to remove the abnormal nan value
☆10Aug 22, 2019Updated 6 years ago