AozhongZhang/MagR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AozhongZhang/MagR)

AozhongZhang / MagR

☆16

Alternatives and similar repositories for MagR

Users that are interested in MagR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ziplab / QLLM
View on GitHub
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…
☆31Mar 12, 2024Updated 2 years ago
ModelTC / Outlier_Suppression_Plus
View on GitHub
Official implementation of the EMNLP23 paper: Outlier Suppression+: Accurate quantization of large language models by equivalent and opti…
☆52Oct 21, 2023Updated 2 years ago
ilur98 / DGQ
View on GitHub
Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM
☆14Dec 27, 2023Updated 2 years ago
thepowerfuldeez / OLMo
View on GitHub
My fork os allen AI's OLMo for educational purposes.
☆28Dec 5, 2024Updated last year
SonicCodes / subcloning
View on GitHub
implementation of https://arxiv.org/pdf/2312.09299
☆21Jul 3, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Intelligent-Computing-Lab-Panda / TesseraQ
View on GitHub
☆25Oct 31, 2024Updated last year
Qualcomm-AI-research / lr-qat
View on GitHub
☆54Nov 5, 2024Updated last year
Sike-Wang / low-bit-Shampoo
View on GitHub
4-bit Shampoo for Memory-Efficient Network Training (NeurIPS 2024)
☆13Feb 13, 2025Updated last year
xvyaward / owq
View on GitHub
Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…
☆72Mar 7, 2024Updated 2 years ago
Kai-Liu001 / CondiQuant
View on GitHub
☆12Feb 24, 2025Updated last year
shahdharam7 / MGSEE
View on GitHub
In the hyperspectral unmixing literature, endmember extraction is addressed majorly using three methods i.e. Statistical, Sparse-regressi…
☆11Nov 30, 2020Updated 5 years ago
ACondaway / Ego-Exo4D_bodypose_challenge_code_base
View on GitHub
A code base for the third place solution of Ego-Exo4D bodypose challenge for CVPR2024 workshop
☆12Jun 16, 2024Updated 2 years ago
pharaouk / dharma
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
ziplab / EcoFormer
View on GitHub
[NeurIPS 2022 Spotlight] This is the official PyTorch implementation of "EcoFormer: Energy-Saving Attention with Linear Complexity"
☆74Nov 15, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
thomasahle / kanmlps
View on GitHub
KANs and MLPs
☆12Jun 7, 2024Updated 2 years ago
evanatyourservice / llm-jax
View on GitHub
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Jul 24, 2025Updated last year
ucsb-seclab / BullseyePoison
View on GitHub
Bullseye Polytope Clean-Label Poisoning Attack
☆18Nov 5, 2020Updated 5 years ago
KellerJordan / hlb-CIFAR10
View on GitHub
Train to 94% on CIFAR-10 in 4.4 seconds on a single A100
☆12Dec 30, 2023Updated 2 years ago
IST-DASLab / QIGen
View on GitHub
Repository for CPU Kernel Generation for LLM Inference
☆28Jul 13, 2023Updated 3 years ago
bhneo / decorrelated_bn
View on GitHub
An implementation of DecorrelatedBN by tensorflow
☆13Jun 30, 2022Updated 4 years ago
cfmata / CoPT
View on GitHub
[ECCV 2024] Official Implementation of CoPT: Unsupervised Domain Adaptive Segmentation using Domain-Agnostic Text Embeddings
☆10Feb 24, 2025Updated last year
AlpinDale / QuIP-for-Llama
View on GitHub
Code for paper: "QuIP: 2-Bit Quantization of Large Language Models With Guarantees" adapted for Llama models
☆41Aug 4, 2023Updated 2 years ago
xiaofeng1990 / tensorrt-tutorial
View on GitHub
tensorrt部署教程
☆11Aug 1, 2025Updated 11 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
google / drjax
View on GitHub
☆19Jul 8, 2026Updated 3 weeks ago
siyan-zhao / prepacking
View on GitHub
The source code of our work "Prepacking: A Simple Method for Fast Prefilling and Increased Throughput in Large Language Models" [AISTATS …
☆62Oct 11, 2024Updated last year
NUS-HPC-AI-Lab / Dynamic-Tuning
View on GitHub
The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"
☆54Dec 30, 2024Updated last year
iankur / vqllm
View on GitHub
Residual vector quantization for KV cache compression in large language model
☆12Oct 22, 2024Updated last year
gmlwns2000 / sea-attention
View on GitHub
Official Implementation of SEA: Sparse Linear Attention with Estimated Attention Mask (ICLR 2024)
☆12Jun 20, 2025Updated last year
nreimers / se-pytorch-xla
View on GitHub
☆21Sep 6, 2021Updated 4 years ago
ETHRuiGong / TADA
View on GitHub
TACS: Taxonomy Adaptive Cross-Domain Semantic Segmentation
☆12Jul 14, 2022Updated 4 years ago
HuangOwen / RoLoRA
View on GitHub
[EMNLP 2024] RoLoRA: Fine-tuning Rotated Outlier-free LLMs for Effective Weight-Activation Quantization
☆41Sep 24, 2024Updated last year
lixilinx / IVA4Cocktail
View on GitHub
Neural network density models for speech separation.
☆20Nov 26, 2020Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
mtenenholtz / lmsys-chatbot-arena-solution
View on GitHub
☆15Aug 26, 2024Updated last year
ariellubonja / orthogonal-matching-pursuit-gpu
View on GitHub
Orthogonal Matching Pursuit, parallelized on both CPU and GPU. 100x+ Speedup
☆17Apr 24, 2026Updated 3 months ago
XIANGLONGYAN / PBS2P
View on GitHub
PyTorch code for our paper "Progressive Binarization with Semi-Structured Pruning for LLMs"
☆13Jul 11, 2026Updated 2 weeks ago
haizhongzheng / LTE
View on GitHub
☆13Oct 13, 2025Updated 9 months ago
ElvisCheny / CUDA_C-Code
View on GitHub
CUDA_C编程权威指南示例代码
☆13Mar 22, 2023Updated 3 years ago
Beckschen / spatialcode
View on GitHub
Open studio for "Thinking with Spatial Code" (https://arxiv.org/pdf/2603.05591)
☆20Mar 18, 2026Updated 4 months ago
SimarKareer / UnifiedVideoDA
View on GitHub
We're Not Using Videos Effectively (TMLR 2024)
☆17Feb 4, 2024Updated 2 years ago