ylsung/rsq

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ylsung/rsq)

ylsung / rsq

Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"

☆23

Alternatives and similar repositories for rsq

Users that are interested in rsq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

JingyangXiang / DFRot
View on GitHub
[COLM 2025] DFRot: Achieving Outlier-Free and Massive Activation-Free for Rotated LLMs with Refined Rotation; 知乎：https://zhuanlan.zhihu.c…
☆30Mar 5, 2025Updated last year
Intelligent-Computing-Lab-Panda / TesseraQ
View on GitHub
☆25Oct 31, 2024Updated last year
LinkAnonymous / BESA
View on GitHub
☆12Oct 9, 2023Updated 2 years ago
zhangsichengsjtu / AFPQ
View on GitHub
AFPQ code implementation
☆23Nov 6, 2023Updated 2 years ago
BrotherHappy / OSTQuant
View on GitHub
[ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitt…
☆94Apr 8, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
jaehong31 / RACCooN
View on GitHub
(EMNLP 2025 Main) RACCooN: A Versatile Instructional Video Editing Framework with Auto-Generated Narratives
☆37Dec 20, 2025Updated 7 months ago
dinobby / MAgICoRE
View on GitHub
☆23Sep 19, 2024Updated last year
daeunni / VideoRepair
View on GitHub
Code for "VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement [ACL 2026 Findings]"
☆52Apr 7, 2026Updated 3 months ago
ChenMnZ / PrefixQuant
View on GitHub
An algorithm for weight-activation quantization (W4A4, W4A8) of LLMs, supporting both static and dynamic quantization
☆176Nov 26, 2025Updated 7 months ago
ruikangliu / FlatQuant
View on GitHub
[ICML 2025] Official PyTorch implementation of "FlatQuant: Flatness Matters for LLM Quantization"
☆223Nov 25, 2025Updated 8 months ago
arsenal9971 / shearlet_semantic_edge
View on GitHub
This repository contains the entire pipline (including data preprocessing, training, testing, evaluation and visualization) for the Shear…
☆11Dec 3, 2019Updated 6 years ago
66RING / CritiPrefill
View on GitHub
Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".
☆17Sep 15, 2024Updated last year
TL-UESTC / DAPM
View on GitHub
A Pytorch implementation of Diffusion-Based Probabilistic Uncertainty Estimation for Active Domain Adaptation
☆16Nov 28, 2023Updated 2 years ago
StarDewXXX / AdaR1
View on GitHub
The official repository of NeurIPS'25 paper "Ada-R1: From Long-Cot to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization"
☆24May 6, 2026Updated 2 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
dinobby / MAGDi
View on GitHub
The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…
☆40Feb 5, 2024Updated 2 years ago
song-wx / SIFT
View on GitHub
[ICML2024 Spotlight] Fine-Tuning Pre-trained Large Language Models Sparsely
☆24Jun 26, 2024Updated 2 years ago
TianchunH97 / fairseq-rl
View on GitHub
Modified version of fairseq, including new implementations for criterions using reinforcement learning methods.
☆11Aug 14, 2019Updated 6 years ago
SkyRiverMoon / PSDLO
View on GitHub
☆12Aug 10, 2023Updated 2 years ago
Improbable-AI / orso
View on GitHub
☆18Feb 22, 2025Updated last year
moranshkolnik / RobustQuantization
View on GitHub
source code of the paper: Robust Quantization: One Model to Rule Them All
☆42Mar 24, 2023Updated 3 years ago
daniel-cores / tvbench
View on GitHub
TVBench: Redesigning Video-Language Evaluation
☆15Jun 9, 2025Updated last year
philschmid / optimum-static-quantization
View on GitHub
☆28May 3, 2023Updated 3 years ago
ZGC-EmbodyAI / TwinBrainVLA
View on GitHub
☆29May 22, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gmlwns2000 / sea-attention
View on GitHub
Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)
☆12Jun 20, 2025Updated last year
Mia-Cong / SWIFT
View on GitHub
Official implementation of "Can Test-Time Scaling Improve World Foundation Model?"
☆15Jul 12, 2025Updated last year
lhxcs / DVD-Quant
View on GitHub
☆17Oct 5, 2025Updated 9 months ago
penhunt / full-quantization-DNN
View on GitHub
PyTorch code for full quantization of DNN using BCGD
☆14Jul 24, 2019Updated 7 years ago
A-suozhang / ViDiT-Q
View on GitHub
☆15Mar 21, 2025Updated last year
Yui010206 / MEXA
View on GitHub
[EMNLP 2025 Findings] MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
☆15Aug 22, 2025Updated 11 months ago
UtkarshSaxena1 / EigenAttn
View on GitHub
☆20Oct 13, 2024Updated last year
klauscc / DAM
View on GitHub
Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning
☆15Apr 25, 2024Updated 2 years ago
michiganleon / ReCLIP_WACV
View on GitHub
☆18Mar 4, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ScalingIntelligence / CATS
View on GitHub
☆33Nov 11, 2024Updated last year
CeeZh / SILVR
View on GitHub
Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"
☆19Jan 18, 2026Updated 6 months ago
AgenticIR-Lab / OThink-R1
View on GitHub
This is the official code for OThink-R1 project.
☆21Jun 19, 2025Updated last year
BAI-LAB / MoE-CL
View on GitHub
[WWW 2026 Oral] MoE-CL:Self-Evolving LLMs via Continual Instruction Tuning
☆21Dec 1, 2025Updated 7 months ago
SimonAytes / SoT
View on GitHub
Official code repository for Sketch-of-Thought (SoT)
☆138May 8, 2025Updated last year
Adamdad / Samesame
View on GitHub
An Tensorflow.keras implementation of Same, Same But Different - Recovering Neural Network Quantization Error Through Weight Factorizatio…
☆10Dec 18, 2019Updated 6 years ago
goodevening13 / aquakv
View on GitHub
☆21Apr 27, 2026Updated 2 months ago