Codebase for the Progressive Mixed-Precision Decoding paper.
☆19Jul 15, 2025Updated 7 months ago
Alternatives and similar repositories for PMPD
Users that are interested in PMPD are comparing it to the libraries listed below
Sorting:
- Tutorial of how to deloy DNN on android device using TFLite☆12Aug 23, 2019Updated 6 years ago
- The implementation of E3DNet☆17Jun 4, 2019Updated 6 years ago
- [ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression☆34Aug 7, 2025Updated 6 months ago
- Static Block Floating Point Quantization for CNN☆37Jun 9, 2021Updated 4 years ago
- This repository contains the code used in a publication 'Active Learning for Decision-Making from Imbalanced Observational Data', Iiris S…☆11May 14, 2019Updated 6 years ago
- Improved the performance of 8-bit PTQ4DM expecially on FID.☆11Aug 30, 2023Updated 2 years ago
- Compress BiSeNet with Structure Knowledge Distillation for Real-time image segmentation on wali-TX2☆11Jul 29, 2020Updated 5 years ago
- 机器学习实验 - 线性回归 - 预测连续值☆11Aug 11, 2017Updated 8 years ago
- Information Bottleneck in DNN with PyTorch☆15Jul 6, 2023Updated 2 years ago
- draw object rect and add some properties☆11May 28, 2018Updated 7 years ago
- ☆14Nov 30, 2023Updated 2 years ago
- ☆10Jan 21, 2018Updated 8 years ago
- Practical example using python to train a decision tree☆11Jul 27, 2016Updated 9 years ago
- An accelerator to which you can offload RE matching☆14Dec 22, 2024Updated last year
- ☆10Nov 27, 2024Updated last year
- A notebook showing how to easily convert a current notebook you have to a notebook that can be run on Kubeflow Pipelines.☆15Jul 15, 2020Updated 5 years ago
- ☆27Jan 12, 2026Updated last month
- ☆14Feb 14, 2022Updated 4 years ago
- Userspace DMA library for Zynq-based SoCs☆16Jan 22, 2019Updated 7 years ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆13Feb 11, 2026Updated 2 weeks ago
- ☆17Nov 18, 2025Updated 3 months ago
- [ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs☆123Jul 4, 2025Updated 7 months ago
- kmeans for YOLO anchors☆13Jun 15, 2018Updated 7 years ago
- ☆15Jun 14, 2022Updated 3 years ago
- Python utility to convert PyTorch model weights from '.bin' to '.safetensors' format.☆17Sep 19, 2025Updated 5 months ago
- Chisel Project for Integrating RTL code into SDAccel☆17Jan 12, 2018Updated 8 years ago
- Public repostory for the DAC 2021 paper "Scaling up HBM Efficiency of Top-K SpMV forApproximate Embedding Similarity on FPGAs"☆16Aug 29, 2021Updated 4 years ago
- Artifacts for ATC '22 paper "Faster Software Packet Processing on FPGA NICs with eBPF Program Warping"☆17May 20, 2022Updated 3 years ago
- Detecting, tracking, and count the number of objects.☆15Jun 24, 2018Updated 7 years ago
- Resource Utilization and Latency Estimation for ML on FPGA.☆18Feb 4, 2026Updated 3 weeks ago
- Wrapper shells enabling designs generated by rocket-chip to map onto certain FPGA boards☆20Nov 27, 2024Updated last year
- ☆16Apr 27, 2025Updated 10 months ago
- ☆16Nov 25, 2022Updated 3 years ago
- Generate versal system design from ONNX model. AI engine kernels. Sub-microsecond speeds for autoencoders.☆16Dec 29, 2024Updated last year
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆14Nov 27, 2024Updated last year
- 浙江大学2018年冬季机器学习课程☆12Mar 21, 2019Updated 6 years ago
- Realtime Multi-Person Pose Estimation data server. Used as a training and validation data provider in training process.☆14Nov 14, 2017Updated 8 years ago
- Cost-Effective Object Detection: Active Sample Mining with Switchable Selection Criteria☆12Dec 1, 2018Updated 7 years ago
- Multi-Target Adversarial Frameworks for Domain Adaptation in Semantic Segmentation☆24Jul 12, 2022Updated 3 years ago