☆10Jul 14, 2022Updated 3 years ago
Alternatives and similar repositories for powerlevel-10k
Users that are interested in powerlevel-10k are comparing it to the libraries listed below
Sorting:
- Testing paligemma2 finetuning on reasoning dataset☆18Dec 28, 2024Updated last year
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆13Dec 5, 2023Updated 2 years ago
- ☆12Aug 26, 2022Updated 3 years ago
- Quantization in the Jagged Loss Landscape of Vision Transformers☆13Oct 22, 2023Updated 2 years ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆32Nov 4, 2024Updated last year
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- [CVPR 2022] AlignQ: Alignment Quantization with ADMM-based Correlation Preservation☆11Jan 6, 2023Updated 3 years ago
- Minimal PyTorch implementation of TP, SP, FSDP and sharded-EMA☆32Nov 27, 2025Updated 3 months ago
- Open Source Projects from Pallas Lab☆21Oct 10, 2021Updated 4 years ago
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Oct 2, 2024Updated last year
- ☆13Jun 16, 2024Updated last year
- Generator for VHDL regular expression matchers☆15Jan 11, 2021Updated 5 years ago
- ☆15Sep 24, 2023Updated 2 years ago
- ☆45Dec 6, 2025Updated 3 months ago
- Model deployment, Yolov4, xilinx, Ultra96_v2.☆15Nov 4, 2021Updated 4 years ago
- Reading notes on Speculative Decoding papers☆26Feb 24, 2026Updated 3 weeks ago
- Scaling Sparse Fine-Tuning to Large Language Models☆18Jan 31, 2024Updated 2 years ago
- ☆22Oct 26, 2022Updated 3 years ago
- ☆25Oct 31, 2024Updated last year
- AFPQ code implementation☆23Nov 6, 2023Updated 2 years ago
- [COLM 2024] SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models☆24Oct 5, 2024Updated last year
- ☆28Nov 5, 2021Updated 4 years ago
- The official repo of continuous speculative decoding☆32Mar 28, 2025Updated 11 months ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"☆30Jun 30, 2025Updated 8 months ago
- mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless inte…☆25Nov 28, 2024Updated last year
- 2020 xilinx summer school☆19Aug 13, 2020Updated 5 years ago
- official code for GliDe with a CaPE☆20Aug 13, 2024Updated last year
- DAC System Design Contest 2020☆29Jun 11, 2020Updated 5 years ago
- ☆34Mar 28, 2025Updated 11 months ago
- ☆25Dec 11, 2021Updated 4 years ago
- FPGA实现动态图像识别☆23Jul 31, 2020Updated 5 years ago
- ☆30Jul 22, 2024Updated last year
- Pytorch code for paper QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models☆25Sep 27, 2023Updated 2 years ago
- [TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"☆38Aug 20, 2024Updated last year
- ACL 2023☆39Jun 6, 2023Updated 2 years ago
- Official PyTorch implementation of "GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance" (ICML 2025)☆51Jul 6, 2025Updated 8 months ago
- Official implementation of the ICLR 2024 paper AffineQuant☆28Mar 30, 2024Updated last year
- Data-Free Neural Architecture Search via Recursive Label Calibration. ECCV 2022.☆33Sep 13, 2022Updated 3 years ago