[ICCAD 2025] Squant
☆15Jul 3, 2025Updated 8 months ago
Alternatives and similar repositories for squant
Users that are interested in squant are comparing it to the libraries listed below
Sorting:
- [ICLR 2026] FastCar☆16May 22, 2025Updated 9 months ago
- [CVPR 2025] QuartDepth☆17Mar 24, 2025Updated 11 months ago
- FireQ: Fast INT4-FP8 Kernel and RoPE-aware Quantization for LLM Inference Acceleration☆20Jun 27, 2025Updated 8 months ago
- It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher [CVPR 2022 Oral]☆29Sep 15, 2022Updated 3 years ago
- Single-Cell Multimodal Prediction via Transformer☆28Feb 7, 2024Updated 2 years ago
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆26Apr 15, 2025Updated 10 months ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆13Apr 29, 2025Updated 10 months ago
- Code for "APTBench: Benchmarking Agentic Potential of Base LLMs During Pre-Training"☆38Dec 23, 2025Updated 2 months ago
- ☆11May 3, 2022Updated 3 years ago
- ☆39Jun 9, 2025Updated 8 months ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆26Jun 16, 2025Updated 8 months ago
- ☆10Apr 24, 2024Updated last year
- Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".☆21May 23, 2025Updated 9 months ago
- ☆14Nov 12, 2025Updated 3 months ago
- Official repository for Activation-Informed Merging (AIM) of Large Language Models☆21Feb 10, 2025Updated last year
- A method to generate counterfactuals☆12Feb 24, 2026Updated last week
- ☆15Jan 12, 2026Updated last month
- Source code of our TNNLS paper "Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution"☆12Apr 14, 2023Updated 2 years ago
- ☆11Apr 5, 2023Updated 2 years ago
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- Codes for "Learning bounds for risk-sensitive learning," NeurIPS 2020 (or see arXiv 2006.08138)☆11Oct 15, 2020Updated 5 years ago
- ☆16Jul 29, 2025Updated 7 months ago
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆14Nov 25, 2025Updated 3 months ago
- monae: multi-modal single-cell integration and imputation☆13Sep 13, 2024Updated last year
- Heterogeneous Causal Metapath Graph Neural Network for Gene-Microbe-Disease Association Prediction☆12Aug 19, 2024Updated last year
- Efficient LLM Inference Acceleration using Prompting☆51Oct 22, 2024Updated last year
- [ICML 2025] MxMoE: Mixed-precision Quantization for MoE with Accuracy and Performance Co-Design☆22Jul 4, 2025Updated 8 months ago
- scAce: an adaptive embedding and clustering method for scRNA-seq data☆12Sep 8, 2023Updated 2 years ago
- ☆13Jul 14, 2025Updated 7 months ago
- Official Implementation of Robustifying and Boosting Training-Free Neural Architecture Search☆10Mar 12, 2024Updated last year
- Codebase for training the SubCell models☆18Updated this week
- Biological Information Extraction from Large Language Models (LLMs) (Journal of Computational Biology 2025)☆12Jun 18, 2025Updated 8 months ago
- MSTI☆16Mar 6, 2024Updated 2 years ago
- Reproducible analyses for the NicheCompass manuscript☆13Jul 3, 2025Updated 8 months ago
- Evaluating GPT-OSS on BrowseComp-Plus with Native Browsering Tools☆17Oct 17, 2025Updated 4 months ago
- Bencharking pipeline for evaluating Transcriptomic representations for perturbation tasks☆12Nov 5, 2024Updated last year
- Code for Heima☆59Apr 21, 2025Updated 10 months ago
- ☆17Mar 10, 2025Updated 11 months ago
- [COLM 2025] Official PyTorch implementation of "Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models"☆71Jul 8, 2025Updated 7 months ago