Qualcomm-AI-research/outlier-free-transformers

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Qualcomm-AI-research/outlier-free-transformers)

Qualcomm-AI-research / outlier-free-transformers

☆46

Alternatives and similar repositories for outlier-free-transformers

Users that are interested in outlier-free-transformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Qualcomm-AI-research / pruning-vs-quantization
View on GitHub
☆26Mar 1, 2024Updated 2 years ago
AIS-SNU / GraNNDis_Artifact
View on GitHub
[PACT'24] GraNNDis. A fast and unified distributed graph neural network (GNN) training framework for both full-batch (full-graph) and min…
☆10Aug 13, 2024Updated last year
Qualcomm-AI-research / oscillations-qat
View on GitHub
☆81Jul 21, 2022Updated 4 years ago
epfml / pam
View on GitHub
☆16Dec 9, 2023Updated 2 years ago
hongsunjang / pipe-bd
View on GitHub
[DATE 2023] Pipe-BD: Pipelined Parallel Blockwise Distillation
☆12Jul 13, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
HabanaAI / Megatron-DeepSpeed
View on GitHub
Intel Gaudi's Megatron DeepSpeed Large Language Models for training
☆18Dec 19, 2024Updated last year
csguoh / IntLoRA
View on GitHub
[ICML2025] LoRA fine-tune directly on the INT4 models.
☆41Nov 25, 2024Updated last year
ashafahi / RobustTransferLWF
View on GitHub
Adversarially Robust Transfer Learning with LWF loss applied to the deep feature representation (penultimate) layer
☆19Feb 9, 2020Updated 6 years ago
htqin / QuantSR
View on GitHub
[NeurIPS 2023 Spotlight] This project is the official implementation of our accepted NeurIPS 2023 (spotlight) paper QuantSR: Accurate Low…
☆53May 13, 2024Updated 2 years ago
zysxmu / DDTB
View on GitHub
Pytorch implementation of our paper accepted by ECCV2022 -- Dynamic Dual Trainable Bounds for Ultra-low Precision Super-Resolution Networ…
☆30Sep 13, 2022Updated 3 years ago
thu-nics / ViDiT-Q
View on GitHub
[ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
☆163Mar 21, 2025Updated last year
Blaok / soda
View on GitHub
Stencil with Optimized Dataflow Architecture
☆12Feb 27, 2024Updated 2 years ago
kssteven418 / I-BERT
View on GitHub
[ICML'21 Oral] I-BERT: Integer-only BERT Quantization
☆269Jan 29, 2023Updated 3 years ago
Cheeun / DAQ-pytorch
View on GitHub
[WACV2022] Official Code for the "DAQ: Channel-Wise Distribution-Aware Quantization for Deep Image Super-Resolution Networks"
☆27Feb 19, 2024Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
yonsei-hpcp / pid-join
View on GitHub
☆12May 8, 2025Updated last year
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
Qualcomm-AI-research / transformer-quantization
View on GitHub
☆211Nov 9, 2021Updated 4 years ago
iamkanghyunchoi / falqon
View on GitHub
Official repository of paper [FALQON: Accelerating LoRA Fine-tuning with Low-Bit Floating-Point Arithmetic, NeurIPS 2025]
☆21Dec 2, 2025Updated 7 months ago
hongsunjang / HILOS
View on GitHub
[ASPLOS'26] HILOS: A Cost-Effective Near-Storage Processing Solution for Offline Inference of Long-Context LLMs
☆20Jan 18, 2026Updated 6 months ago
SII-pbguo / QueryCDR
View on GitHub
[ECCV 2024] QueryCDR: Query-based Controllable Distortion Rectification Network for Fisheye Images
☆11Feb 14, 2025Updated last year
ldynx / SAVE
View on GitHub
☆25Nov 22, 2024Updated last year
graphcore-research / pytorch-tensor-tracker
View on GitHub
Flexibly track outputs and grad-outputs of torch.nn.Module.
☆13Oct 6, 2023Updated 2 years ago
Qualcomm-AI-research / gptvq
View on GitHub
☆42Mar 28, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
iamkanghyunchoi / ait
View on GitHub
It's All In the Teacher: Zero-Shot Quantization Brought Closer to the Teacher [CVPR 2022 Oral]
☆29Sep 15, 2022Updated 3 years ago
AIS-SNU / PathWeaver
View on GitHub
A High-Throughput Multi-GPU System for Graph-Based Approximate Nearest Neighbor Search
☆21Jul 22, 2025Updated 11 months ago
cornell-zhang / llm-datatypes
View on GitHub
Codebase for ICML'24 paper: Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
☆27Jun 25, 2024Updated 2 years ago
nbasyl / LLM-FP4
View on GitHub
The official implementation of the EMNLP 2023 paper LLM-FP4
☆225Dec 15, 2023Updated 2 years ago
iamkanghyunchoi / qimera
View on GitHub
Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples [NeurIPS 2021]
☆34Dec 12, 2021Updated 4 years ago
ModelTC / L2_Compression
View on GitHub
☆13Jun 16, 2024Updated 2 years ago
DCGM / SoftCTC
View on GitHub
This repository contains source codes for SoftCTC. Original paper can be found here: https://arxiv.org/abs/2212.02135
☆19Mar 7, 2023Updated 3 years ago
gshstexsociety / gshs-format
View on GitHub
LaTeX 양식 : R&E, 졸업논문, beamer 등등 - 컴파일된 결과 pdf파일 미포함
☆62Mar 11, 2025Updated last year
HabanaAI / Gaudi-tutorials
View on GitHub
Tutorials for running models on First-gen Gaudi and Gaudi2 for Training and Inference. The source files for the tutorials on https://dev…
☆65Sep 18, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cooelf / CompassMTL
View on GitHub
Task Compass: Scaling Multi-task Pre-training with Task Prefix (EMNLP 2022: Findings) (stay tuned & more will be updated)
☆22Oct 17, 2022Updated 3 years ago
ModelTC / TFMQ-DM
View on GitHub
[CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for…
☆110Sep 29, 2025Updated 9 months ago
C-Fun / Self-Attentive-Pooling-for-Efficient-Deep-Learning
View on GitHub
Official PyTorch implementation of the paper entitled 'Self Attentive Pooling for Efficient Deep Learning'.
☆13May 3, 2024Updated 2 years ago
jaewonalive / PeerAiD
View on GitHub
☆21Jun 6, 2024Updated 2 years ago
joshyZhou / ASTv2
View on GitHub
Learning An Adaptive Sparse Transformer for Efficient Image Restoration
☆15Aug 3, 2025Updated 11 months ago
enyac-group / Quamba
View on GitHub
The official repository of Quamba1 [ICLR 2025] & Quamba2 [ICML 2025]
☆70Jun 19, 2025Updated last year
tonyzhao-jt / LLM-PQ
View on GitHub
Official Repo for "SplitQuant / LLM-PQ: Resource-Efficient LLM Offline Serving on Heterogeneous GPUs via Phase-Aware Model Partition and …
☆39Aug 29, 2025Updated 10 months ago