A tool for model sparse based on torch.fx
β13Jun 3, 2024Updated last year
Alternatives and similar repositories for msbench
Users that are interested in msbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2025] This is the official PyTorch implementation of "π΅ HarmoniCa: Harmonizing Training and Inference for Better Feature Caching iβ¦β45Jul 10, 2025Updated 9 months ago
- EXL2 quantization generalized to other models.β10Mar 17, 2024Updated 2 years ago
- The code for Joint Neural Architecture Search and Quantizationβ14Apr 10, 2019Updated 7 years ago
- β11Jan 10, 2025Updated last year
- Offline Quantization Tools for Deploy.β144Dec 28, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR 2025] DreamRelation: Bridging Customization and Relation Generationβ19Dec 17, 2025Updated 4 months ago
- [EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.β710Apr 1, 2026Updated 3 weeks ago
- β21Feb 11, 2022Updated 4 years ago
- [CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization forβ¦β108Sep 29, 2025Updated 7 months ago
- β12Sep 20, 2018Updated 7 years ago
- This repository contains code and diagram for human following robot projectβ12Nov 1, 2021Updated 4 years ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Modβ¦β39Mar 11, 2024Updated 2 years ago
- β12Jan 10, 2023Updated 3 years ago
- Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLMβ14Dec 27, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A collection of research papers on low-precision training methodsβ66May 10, 2025Updated 11 months ago
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/β11Oct 26, 2025Updated 6 months ago
- NART = NART is not A RunTime, a deep learning inference framework.β37Mar 2, 2023Updated 3 years ago
- QuIP quantizationβ64Mar 17, 2024Updated 2 years ago
- Implementing LRP (Layer-wise Relevance Propagation) for a sequence-to-sequence model with GRU layers.β12Sep 8, 2023Updated 2 years ago
- Read-only mirror of https://github.com/openjdk/jdk17u/β12Apr 22, 2026Updated last week
- [PR 2024] HTQ: Exploring the High-Dimensional Trade-Off of Mixed-Precision Quantizationβ12Jul 16, 2024Updated last year
- demo about the usage of tvm.β12Jan 31, 2019Updated 7 years ago
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.β25Apr 21, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- β13Jun 16, 2024Updated last year
- β13Feb 16, 2022Updated 4 years ago
- My first project: a smart robot based on ROS with 2D lidar sensors and RGB-D cameraβ12Jul 7, 2019Updated 6 years ago
- Spring Petclinic Microservices with AI on Azure Container Appsβ13Jan 26, 2026Updated 3 months ago
- hisi3519v101,fast-mtcnn,opencv,face detectionβ16Oct 10, 2018Updated 7 years ago
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.β11Aug 20, 2024Updated last year
- A practical example showing how to develop your own custom Spring Cloud Stream Binderβ11May 22, 2022Updated 3 years ago
- a linux OS on MIPS r3000 chipβ10Jul 5, 2019Updated 6 years ago
- A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languagβ¦β205Feb 10, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Summary of system papers/frameworks/codes/tools on training or serving large modelβ57Dec 17, 2023Updated 2 years ago
- An implementation of DSOD in Pytonchβ15Jul 13, 2018Updated 7 years ago
- this is an integration of all the Spring Native hints that don't yet have another homeβ14Jan 11, 2026Updated 3 months ago
- Multi target people tracker for mobile robots. It uses multiple detector modalities, is based on particle filtering and outputs a set of β¦β16Nov 2, 2016Updated 9 years ago
- All-in-One Safety Evaluation Framworkβ48Apr 21, 2026Updated last week
- Dynamic, ORB-SLAM, MaskRCNNβ16Jun 7, 2020Updated 5 years ago
- A buildpack for translating a Procfile into Process Typesβ21Updated this week