snudm-starlab / SynQLinks
SynQ: Accurate Zero-shot Quantization by Synthesis-aware Fine-tuning (ICLR 2025)
☆27Updated 11 months ago
Alternatives and similar repositories for SynQ
Users that are interested in SynQ are comparing it to the libraries listed below
Sorting:
- SensiMix: Sensitivity-Aware 8-bit Index & 1-bit Value Mixed Precision Quantization for BERT Compression (PLOS One)☆34Updated 5 months ago
- Sturctured pruning algorithm for pruning Transformer☆31Updated 2 years ago
- Pea-KD: Parameter-efficient and accurate knowledge distillation on BERT (PLOS One)☆35Updated 5 months ago
- ☆33Updated 3 years ago
- Flexible Convolutional Neural Network☆23Updated 2 years ago
- PET: Parameter-efficient Knowledge Distillation on Transformer (PLOS One)☆15Updated 5 months ago
- Falcon: Lightweight and Accurate Convolution Based on Depthwise Separable Convolution (KAIS)☆45Updated 5 months ago
- Edge-guided Model Inversion for Accurate Data-Free Applications☆22Updated 2 months ago
- Vector multiplication on Low-rank Matrix Factorization☆46Updated 2 years ago
- ☆90Updated last year
- Accurate Retraining-free Pruning for Pretrained Encoder-based Language Models (ICLR 2024)☆14Updated 8 months ago
- Review papers of NLP, mainly LLM.☆33Updated last year
- ☆56Updated 3 years ago
- 2022_AAAI accepted paper, NaturalInversion:Data-Free Image Synthesis Improving Real-World Consistency☆10Updated 3 years ago
- The official NetsPresso Python package.☆47Updated 2 months ago
- LaTeX 양식 : R&E, 졸업논문, beamer 등등 - 컴파일된 결과 pdf파일 미포함☆63Updated 10 months ago
- [AAAI 2025] SMMF: Square-Matricized Momentum Factorization for Memory-Efficient Optimization☆20Updated 8 months ago
- ☆28Updated 11 months ago
- PyTorch CoreSIG☆57Updated last year
- Official Implementation of LANTERN (ICLR'25) and LANTERN++(ICLRW-SCOPE'25)☆19Updated 10 months ago
- 추천 시스템 관련 자료 모음☆88Updated last year
- ☆56Updated last year
- This project aims to automatically translate and summarize Huggingface's daily papers into Korean using ChatGPT.☆52Updated 8 months ago
- 추천시스템 논문을 읽고 구현한 Code가 저장된 Repository☆66Updated 2 years ago
- A performance library for machine learning applications.☆184Updated 2 years ago
- Repository for the paper, "Exploiting Representation Curvature for Boundary Detection in Time Series" accepted at NeurIPS 2024☆19Updated last year
- [Zoom & Facebook Live] Weekly AI Arxiv 시즌2☆962Updated 2 years ago
- OwLite is a low-code AI model compression toolkit for AI models.☆52Updated 2 months ago
- 제4회 AI × Bookathon 우수상☆14Updated 3 years ago
- ☆22Updated 3 years ago