LLM Quantization toolkit
☆20Jun 9, 2026Updated this week
Alternatives and similar repositories for lm-quant-toolkit
Users that are interested in lm-quant-toolkit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆47Apr 21, 2026Updated last month
- boilerplate for using create react app and ant design☆11Mar 15, 2019Updated 7 years ago
- stm32f103 usb keyboard cmake project☆10Aug 7, 2016Updated 9 years ago
- leadigital-agent☆13Mar 14, 2024Updated 2 years ago
- Understanding gRPC - go. The implementation details.☆13Apr 7, 2021Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- IOT-python☆21Feb 16, 2026Updated 4 months ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆11Dec 13, 2023Updated 2 years ago
- 基于graaljs的js脚本引擎,api快速开发平台☆19Nov 6, 2021Updated 4 years ago
- React Native for Web (with server render)☆19Oct 22, 2018Updated 7 years ago
- ☆50May 9, 2026Updated last month
- 初创企业微服务化过程中,为了协作更顺畅,统一代码结构风格。尝试整理一套适合企业的应用架构。该工程为模板工程。依托于DDD 开源框架COLA 4.0应用架构。技术栈基于spring-cloud-alibaba、SpringBoot、Alibaba RocketMQ、mybat…☆10Jun 3, 2024Updated 2 years ago
- Yet another musicxml render written in JS☆19Feb 24, 2022Updated 4 years ago
- Multichannel Looper/Feedback System for Riffusion☆14May 6, 2023Updated 3 years ago
- 网络爬虫,动态网页爬取,多线程,并行☆19Dec 21, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- An insanely secure password manager.☆17Mar 10, 2026Updated 3 months ago
- Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks☆15Feb 17, 2025Updated last year
- something new☆13Nov 25, 2023Updated 2 years ago
- OnePlus 8T Param Read/Write☆14Dec 4, 2020Updated 5 years ago
- Pytorch implementation of our paper accepted by ICML 2023 -- "Bi-directional Masks for Efficient N:M Sparse Training"☆13Jun 7, 2023Updated 3 years ago
- Machine Bullshit: Characterizing the Emergent Disregard for Truth in Large Language Models☆26Sep 14, 2025Updated 9 months ago
- spark,NLP,新词发现,自然语言处理☆23Mar 16, 2018Updated 8 years ago
- ☆12Apr 4, 2024Updated 2 years ago
- ☆16Nov 22, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- 🎓Automatically Update CV Papers Daily using Github Actions (Update Every 12th hours)☆12May 17, 2026Updated 3 weeks ago
- [ACL'26 Findings] Steering LLM Thinking with Budget Guidance☆31Feb 19, 2026Updated 3 months ago
- MTTN: Multi-Pair Text to Text Narratives for Prompt Generation☆11Feb 4, 2023Updated 3 years ago
- BESA is a differentiable weight pruning technique for large language models.☆17Mar 4, 2024Updated 2 years ago
- ☆17May 2, 2024Updated 2 years ago
- ☆16Feb 10, 2023Updated 3 years ago
- Pytorch code of [CVPR 2023] "NAR-Former: Neural Architecture Representation Learning towards Holistic Attributes Prediction".☆11Mar 14, 2023Updated 3 years ago
- ☆21Feb 5, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- channel pruning for accelerating very deep neural networks☆13Mar 8, 2021Updated 5 years ago
- [NAACL 2025] MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning☆20May 31, 2025Updated last year
- A curated list of my GitHub stars!☆23Updated this week
- Pytorch implementation of our paper accepted by NeurIPS 2022 -- Learning Best Combination for Efficient N:M Sparsity☆22Jan 13, 2023Updated 3 years ago
- ☆12Apr 23, 2025Updated last year
- Stalker Checker MAC Generator Portal IPTV FREE☆18Nov 18, 2024Updated last year
- Scripts/tools engineers uses to build and release Rocky Linux. Note: Peridot source code is at https://github.com/rocky-linux/peridot☆16Nov 3, 2022Updated 3 years ago