A tool for model sparse based on torch.fx
β13Jun 3, 2024Updated last year
Alternatives and similar repositories for msbench
Users that are interested in msbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2025] This is the official PyTorch implementation of "π΅ HarmoniCa: Harmonizing Training and Inference for Better Feature Caching iβ¦β45Jul 10, 2025Updated 8 months ago
- EXL2 quantization generalized to other models.β10Mar 17, 2024Updated 2 years ago
- The code for Joint Neural Architecture Search and Quantizationβ14Apr 10, 2019Updated 6 years ago
- β11Jan 10, 2025Updated last year
- Offline Quantization Tools for Deploy.β144Dec 28, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [CVPR 2025] DreamRelation: Bridging Customization and Relation Generationβ19Dec 17, 2025Updated 3 months ago
- [EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.β696Apr 1, 2026Updated last week
- β21Feb 11, 2022Updated 4 years ago
- [CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization forβ¦β109Sep 29, 2025Updated 6 months ago
- β12Sep 20, 2018Updated 7 years ago
- This repository contains code and diagram for human following robot projectβ12Nov 1, 2021Updated 4 years ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Modβ¦β39Mar 11, 2024Updated 2 years ago
- β12Jan 10, 2023Updated 3 years ago
- TOKEN-IMPORTANCE GUIDED DIRECT PREFERENCE OPTIMIZATIONβ25Jan 26, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLMβ14Dec 27, 2023Updated 2 years ago
- A collection of research papers on low-precision training methodsβ65May 10, 2025Updated 10 months ago
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/β11Oct 26, 2025Updated 5 months ago
- NART = NART is not A RunTime, a deep learning inference framework.β37Mar 2, 2023Updated 3 years ago
- QuIP quantizationβ64Mar 17, 2024Updated 2 years ago
- Implementing LRP (Layer-wise Relevance Propagation) for a sequence-to-sequence model with GRU layers.β12Sep 8, 2023Updated 2 years ago
- [PR 2024] HTQ: Exploring the High-Dimensional Trade-Off of Mixed-Precision Quantizationβ12Jul 16, 2024Updated last year
- Read-only mirror of https://github.com/openjdk/jdk17u/β12Apr 1, 2026Updated last week
- demo about the usage of tvm.β12Jan 31, 2019Updated 7 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.β25Updated this week
- β13Jun 16, 2024Updated last year
- β13Feb 16, 2022Updated 4 years ago
- My first project: a smart robot based on ROS with 2D lidar sensors and RGB-D cameraβ12Jul 7, 2019Updated 6 years ago
- Spring Petclinic Microservices with AI on Azure Container Appsβ13Jan 26, 2026Updated 2 months ago
- hisi3519v101,fast-mtcnn,opencv,face detectionβ16Oct 10, 2018Updated 7 years ago
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.β11Aug 20, 2024Updated last year
- A practical example showing how to develop your own custom Spring Cloud Stream Binderβ10May 22, 2022Updated 3 years ago
- a linux OS on MIPS r3000 chipβ10Jul 5, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languagβ¦β205Feb 10, 2025Updated last year
- Summary of system papers/frameworks/codes/tools on training or serving large modelβ57Dec 17, 2023Updated 2 years ago
- An implementation of DSOD in Pytonchβ15Jul 13, 2018Updated 7 years ago
- this is an integration of all the Spring Native hints that don't yet have another homeβ14Jan 11, 2026Updated 2 months ago
- Multi target people tracker for mobile robots. It uses multiple detector modalities, is based on particle filtering and outputs a set of β¦β16Nov 2, 2016Updated 9 years ago
- All-in-One Safety Evaluation Framworkβ47Mar 4, 2026Updated last month
- A buildpack for translating a Procfile into Process Typesβ21Mar 13, 2026Updated 3 weeks ago