A tool for model sparse based on torch.fx
β13Jun 3, 2024Updated last year
Alternatives and similar repositories for msbench
Users that are interested in msbench are comparing it to the libraries listed below
Sorting:
- [ICML 2025] This is the official PyTorch implementation of "π΅ HarmoniCa: Harmonizing Training and Inference for Better Feature Caching iβ¦β45Jul 10, 2025Updated 8 months ago
- EXL2 quantization generalized to other models.β10Mar 17, 2024Updated 2 years ago
- The code for Joint Neural Architecture Search and Quantizationβ14Apr 10, 2019Updated 6 years ago
- β11Jan 10, 2025Updated last year
- Offline Quantization Tools for Deploy.β142Dec 28, 2023Updated 2 years ago
- [CVPR 2025] DreamRelation: Bridging Customization and Relation Generationβ19Dec 17, 2025Updated 3 months ago
- [EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.β688Mar 11, 2026Updated last week
- β21Feb 11, 2022Updated 4 years ago
- [CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization forβ¦β108Sep 29, 2025Updated 5 months ago
- β12Sep 20, 2018Updated 7 years ago
- This repository contains code and diagram for human following robot projectβ11Nov 1, 2021Updated 4 years ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Modβ¦β39Mar 11, 2024Updated 2 years ago
- β12Jan 10, 2023Updated 3 years ago
- Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLMβ14Dec 27, 2023Updated 2 years ago
- A collection of research papers on low-precision training methodsβ64May 10, 2025Updated 10 months ago
- NART = NART is not A RunTime, a deep learning inference framework.β37Mar 2, 2023Updated 3 years ago
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/β11Oct 26, 2025Updated 4 months ago
- QuIP quantizationβ62Mar 17, 2024Updated 2 years ago
- Implementing LRP (Layer-wise Relevance Propagation) for a sequence-to-sequence model with GRU layers.β12Sep 8, 2023Updated 2 years ago
- [PR 2024] HTQ: Exploring the High-Dimensional Trade-Off of Mixed-Precision Quantizationβ12Jul 16, 2024Updated last year
- Read-only mirror of https://github.com/openjdk/jdk17u/β12Updated this week
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.β24Feb 16, 2026Updated last month
- demo about the usage of tvm.β12Jan 31, 2019Updated 7 years ago
- β13Jun 16, 2024Updated last year
- β13Feb 16, 2022Updated 4 years ago
- My first project: a smart robot based on ROS with 2D lidar sensors and RGB-D cameraβ12Jul 7, 2019Updated 6 years ago
- Spring Petclinic Microservices with AI on Azure Container Appsβ13Jan 26, 2026Updated last month
- hisi3519v101,fast-mtcnn,opencv,face detectionβ16Oct 10, 2018Updated 7 years ago
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.β11Aug 20, 2024Updated last year
- A practical example showing how to develop your own custom Spring Cloud Stream Binderβ10May 22, 2022Updated 3 years ago
- a linux OS on MIPS r3000 chipβ10Jul 5, 2019Updated 6 years ago
- PluRel: Synthetic Data unlocks Scaling Laws for Relational Foundation Modelsβ44Updated this week
- A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languagβ¦β203Feb 10, 2025Updated last year
- Summary of system papers/frameworks/codes/tools on training or serving large modelβ57Dec 17, 2023Updated 2 years ago
- An implementation of DSOD in Pytonchβ15Jul 13, 2018Updated 7 years ago
- this is an integration of all the Spring Native hints that don't yet have another homeβ14Jan 11, 2026Updated 2 months ago
- Multi target people tracker for mobile robots. It uses multiple detector modalities, is based on particle filtering and outputs a set of β¦β16Nov 2, 2016Updated 9 years ago
- All-in-One Safety Evaluation Framworkβ44Mar 4, 2026Updated 2 weeks ago
- A buildpack for translating a Procfile into Process Typesβ21Updated this week