A tool for model sparse based on torch.fx
β13Jun 3, 2024Updated 2 years ago
Alternatives and similar repositories for msbench
Users that are interested in msbench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML 2025] This is the official PyTorch implementation of "π΅ HarmoniCa: Harmonizing Training and Inference for Better Feature Caching iβ¦β45Jul 10, 2025Updated 10 months ago
- EXL2 quantization generalized to other models.β10Mar 17, 2024Updated 2 years ago
- The code for Joint Neural Architecture Search and Quantizationβ14Apr 10, 2019Updated 7 years ago
- β11Jan 10, 2025Updated last year
- Offline Quantization Tools for Deploy.β143Dec 28, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2025] DreamRelation: Bridging Customization and Relation Generationβ19Dec 17, 2025Updated 5 months ago
- [EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.β721May 14, 2026Updated 3 weeks ago
- β22Feb 11, 2022Updated 4 years ago
- [CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization forβ¦β110Sep 29, 2025Updated 8 months ago
- Convert KITTI labels for yolo training.β10Nov 20, 2022Updated 3 years ago
- β12Sep 20, 2018Updated 7 years ago
- This repository contains code and diagram for human following robot projectβ13Nov 1, 2021Updated 4 years ago
- [ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Modβ¦β39Mar 11, 2024Updated 2 years ago
- Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLMβ14Dec 27, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- β12Jan 10, 2023Updated 3 years ago
- A collection of research papers on low-precision training methodsβ68May 10, 2025Updated last year
- Leaderboard of Frontier Models for Program Repair https://repairbench.github.io/β11Oct 26, 2025Updated 7 months ago
- NART = NART is not A RunTime, a deep learning inference framework.β37Mar 2, 2023Updated 3 years ago
- QuIP quantizationβ66Mar 17, 2024Updated 2 years ago
- Implementing LRP (Layer-wise Relevance Propagation) for a sequence-to-sequence model with GRU layers.β12Sep 8, 2023Updated 2 years ago
- Read-only mirror of https://github.com/openjdk/jdk17u/β12Updated this week
- [PR 2024] HTQ: Exploring the High-Dimensional Trade-Off of Mixed-Precision Quantizationβ12Jul 16, 2024Updated last year
- demo about the usage of tvm.β12Jan 31, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β13Jun 16, 2024Updated last year
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.β26May 13, 2026Updated 3 weeks ago
- β13Feb 16, 2022Updated 4 years ago
- My first project: a smart robot based on ROS with 2D lidar sensors and RGB-D cameraβ12Jul 7, 2019Updated 6 years ago
- ADAG: Transluce's MLP neuron-level circuit tracing libraryβ28Apr 10, 2026Updated last month
- Spring Petclinic Microservices with AI on Azure Container Appsβ14Jan 26, 2026Updated 4 months ago
- hisi3519v101,fast-mtcnn,opencv,face detectionβ16Oct 10, 2018Updated 7 years ago
- Explainability of Deep RL algorithms using graph networks and layer-wise relevance propagation.β12Aug 20, 2024Updated last year
- A practical example showing how to develop your own custom Spring Cloud Stream Binderβ11May 22, 2022Updated 4 years ago
- GPUs on demand by Runpod - Special Offer Available β’ AdRun AI, ML, and HPC workloads on powerful cloud GPUsβwithout limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- a linux OS on MIPS r3000 chipβ10Jul 5, 2019Updated 6 years ago
- A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languagβ¦β206Feb 10, 2025Updated last year
- Summary of system papers/frameworks/codes/tools on training or serving large modelβ57Dec 17, 2023Updated 2 years ago
- An implementation of DSOD in Pytonchβ15Jul 13, 2018Updated 7 years ago
- this is an integration of all the Spring Native hints that don't yet have another homeβ14Jan 11, 2026Updated 4 months ago
- Multi target people tracker for mobile robots. It uses multiple detector modalities, is based on particle filtering and outputs a set of β¦β16Nov 2, 2016Updated 9 years ago
- All-in-One Safety Evaluation Framworkβ50Apr 21, 2026Updated last month