☆11Jul 14, 2022Updated 3 years ago
Alternatives and similar repositories for powerlevel-10k
Users that are interested in powerlevel-10k are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Testing paligemma2 finetuning on reasoning dataset☆18Dec 28, 2024Updated last year
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆13Dec 5, 2023Updated 2 years ago
- ☆12Aug 26, 2022Updated 3 years ago
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆33Nov 4, 2024Updated last year
- Quantization in the Jagged Loss Landscape of Vision Transformers☆13Oct 22, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Binary Neural Network-based COVID-19 Face-Mask Wear and Positioning Predictor on Edge Devices☆12Jul 1, 2021Updated 4 years ago
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference☆45Mar 28, 2026Updated 2 months ago
- [CVPR 2022] AlignQ: Alignment Quantization with ADMM-based Correlation Preservation☆11Jan 6, 2023Updated 3 years ago
- Minimal PyTorch implementation of TP, SP, FSDP and sharded-EMA☆32Nov 27, 2025Updated 6 months ago
- Open Source Projects from Pallas Lab☆21Oct 10, 2021Updated 4 years ago
- [Findings of EMNLP 2024] AdaMoE: Token-Adaptive Routing with Null Experts for Mixture-of-Experts Language Models☆20Oct 2, 2024Updated last year
- ☆13Jun 16, 2024Updated last year
- IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse☆106Mar 14, 2026Updated 2 months ago
- Generator for VHDL regular expression matchers☆15Jan 11, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆15Sep 24, 2023Updated 2 years ago
- Model deployment, Yolov4, xilinx, Ultra96_v2.☆16Nov 4, 2021Updated 4 years ago
- Reading notes on Speculative Decoding papers☆36Jun 2, 2026Updated last week
- Scaling Sparse Fine-Tuning to Large Language Models☆19Jan 31, 2024Updated 2 years ago
- ☆79Apr 18, 2026Updated last month
- ☆22Oct 26, 2022Updated 3 years ago
- ☆25Oct 31, 2024Updated last year
- AFPQ code implementation☆23Nov 6, 2023Updated 2 years ago
- [COLM 2024] SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models☆24Oct 5, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆28Nov 5, 2021Updated 4 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated 2 years ago
- Official code for the paper "Examining Post-Training Quantization for Mixture-of-Experts: A Benchmark"☆30Jun 30, 2025Updated 11 months ago
- mi-optimize is a versatile tool designed for the quantization and evaluation of large language models (LLMs). The library's seamless inte …☆25Nov 28, 2024Updated last year
- 2020 xilinx summer school☆20Aug 13, 2020Updated 5 years ago
- The official repo of continuous speculative decoding☆34Mar 28, 2025Updated last year
- official code for GliDe with a CaPE☆22Aug 13, 2024Updated last year
- DAC System Design Contest 2020☆29Jun 11, 2020Updated 6 years ago
- ☆34Mar 28, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆25Dec 11, 2021Updated 4 years ago
- FPGA实现动态图像识别☆24Jul 31, 2020Updated 5 years ago
- ☆30Jul 22, 2024Updated last year
- Pytorch code for paper QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models☆25Sep 27, 2023Updated 2 years ago
- [TMLR] Official PyTorch implementation of paper "Efficient Quantization-aware Training with Adaptive Coreset Selection"☆39Aug 20, 2024Updated last year
- ACL 2023☆39Jun 6, 2023Updated 3 years ago
- Official PyTorch implementation of "GuidedQuant: Large Language Model Quantization via Exploiting End Loss Guidance" (ICML 2025)☆51Apr 13, 2026Updated last month