The-Inscrutable-X/TACQ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/The-Inscrutable-X/TACQ)

The-Inscrutable-X / TACQ

Official Repository for Task-Circuit Quantization

☆28

Alternatives and similar repositories for TACQ

Users that are interested in TACQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

kaistAI / InstructIR
View on GitHub
IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…
☆32Jun 13, 2024Updated 2 years ago
sunblaze-ucb / reasoning_ladder
View on GitHub
☆35May 16, 2025Updated last year
ModelTC / QVGen
View on GitHub
[ICLR 2026] This is the official PyTorch implementation of "QVGen: Pushing the Limit of Quantized Video Generative Models".
☆32Feb 11, 2026Updated 5 months ago
shalomma / PytorchBottleneck
View on GitHub
Information Bottleneck in DNN with PyTorch
☆15Jul 6, 2023Updated 3 years ago
ZhangShiyue / extractive_is_not_faithful
View on GitHub
☆17May 19, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
krafton-ai / lexico
View on GitHub
KV cache compression via sparse coding
☆17Oct 26, 2025Updated 8 months ago
AXERA-TECH / Qwen2.5-VL-3B-Instruct.axera
View on GitHub
Demo for Qwen2.5-VL-3B-Instruct on Axera device.
☆16Sep 3, 2025Updated 10 months ago
josejg / instruction_following_eval
View on GitHub
Instruction Following Eval
☆18Jan 16, 2025Updated last year
thu-coai / Backdoor-Data-Extraction
View on GitHub
☆33May 22, 2025Updated last year
A-suozhang / MixDQ
View on GitHub
[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
☆14Nov 27, 2024Updated last year
EfficientLLMSys / MuxServe
View on GitHub
☆15Jun 26, 2024Updated 2 years ago
SuDIS-ZJU / llm-inference-all-in-one
View on GitHub
☆19Feb 18, 2025Updated last year
StiphyJay / MQuant
View on GitHub
[ACM MM2025]: MQuant: Unleashing the Inference Potential of Multimodal Large Language Models via Full Static Quantization
☆44Aug 13, 2025Updated 11 months ago
yuntian-group / interactive-training
View on GitHub
https://interactivetraining.ai/
☆18Jul 11, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
IST-DASLab / QuEST
View on GitHub
Work in progress.
☆80Nov 25, 2025Updated 7 months ago
dmis-lab / CompAct
View on GitHub
[EMNLP 2024] CompAct: Compressing Retrieved Documents Actively for Question Answering
☆37Sep 20, 2024Updated last year
elsatch / daily_hf_papers_abstracts
View on GitHub
This repository includes the code to download the curated HuggingFace papers into a single markdown formatted file
☆16Jul 26, 2024Updated last year
agentic-learning-ai-lab / anticipatory-recovery
View on GitHub
Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"
☆11Oct 27, 2025Updated 8 months ago
xpan413 / FSMoE
View on GitHub
☆16Jan 14, 2025Updated last year
SteveTsui / ReBNN
View on GitHub
☆12Nov 17, 2023Updated 2 years ago
zaydzuhri / flame
View on GitHub
Fork of Flame repo for training of some new stuff in development
☆20Updated this week
SteveTsui / RBONN
View on GitHub
☆16Nov 25, 2022Updated 3 years ago
casys-kaist / oaken
View on GitHub
Artifact for Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization
☆17May 9, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
EEESlab / CMSIS_NN-INTQ
View on GitHub
INT-Q Extension of the CMSIS-NN library for ARM Cortex-M target
☆18Jan 10, 2020Updated 6 years ago
Cornell-RelaxML / qtip
View on GitHub
☆180Jun 22, 2025Updated last year
ndexter / MLFA
View on GitHub
Machine Learning Function Approximation: This code implements the fully-connected Deep Neural Network (DNN) architectures considered in t…
☆20Oct 27, 2020Updated 5 years ago
TianheWu / IQA-Paperlist
View on GitHub
Image Quality Assessment Paper Reading
☆15Sep 11, 2022Updated 3 years ago
awai54st / Enabling-Binary-Neural-Network-Training-on-the-Edge
View on GitHub
☆20Mar 6, 2022Updated 4 years ago
KaiLv69 / DuoDecoding
View on GitHub
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting
☆19Mar 4, 2025Updated last year
duykhuongnguyen / MAT-Steer
View on GitHub
☆21Aug 19, 2025Updated 11 months ago
AiArt-Gao / SAGE
View on GitHub
[IJCAI'23] Semantic-aware Generation of Multi-view Portrait Drawings (SAGE)
☆10Feb 25, 2024Updated 2 years ago
Juanerx / Q-DiT
View on GitHub
[CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
☆79Sep 3, 2024Updated last year
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Xingyu-Zheng / FOEM
View on GitHub
(AAAI 2026) First-Order Error Matters: Accurate Compensation for Quantized Large Language Models
☆16Apr 16, 2026Updated 3 months ago
AminKaramlou / QNLG
View on GitHub
Contains the codebase for Quantum Natural Language Generation project
☆23Nov 2, 2022Updated 3 years ago
Adlik / smoothquantplus
View on GitHub
[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
☆23Mar 15, 2024Updated 2 years ago
garipovroma / autojudge
View on GitHub
[NeurIPS 2025] Official PyTorch implementation for the paper AutoJudge: Judge Decoding Without Manual Annotation
☆21Dec 22, 2025Updated 6 months ago
pulp-platform / gvsoc
View on GitHub
Pulp virtual platform
☆24Jul 16, 2025Updated last year
anakin-skywalker-Joseph / Folder
View on GitHub
Official Implementation of Paper FOLDER (ICCV2025) and Turbo (ECCV2024)
☆15Jun 27, 2025Updated last year
jiangycTarheel-zz / TPT-Summ
View on GitHub
☆11Jun 24, 2021Updated 5 years ago