[ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
☆11Dec 13, 2023Updated 2 years ago
Alternatives and similar repositories for smoothquant
Users that are interested in smoothquant are comparing it to the libraries listed below
Sorting:
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆23Mar 15, 2024Updated last year
- ☆44Nov 29, 2025Updated 3 months ago
- ☆27Jan 8, 2024Updated 2 years ago
- Machine Learning Explorations - A list of machine learning resources☆33May 9, 2023Updated 2 years ago
- AWS 中文教程 | 读英文文档太久,读中文文档都是机翻,谷歌搜中文博客内容也不多,干脆做个列表列一下,方便找☆12May 23, 2021Updated 4 years ago
- Repo for ResNet-101 model☆13Oct 10, 2019Updated 6 years ago
- origin docs --> https://github.com/schrodingercatss/tuning_playbook_zh_cn☆10Feb 22, 2023Updated 3 years ago
- ☆16Nov 26, 2024Updated last year
- EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆27Jul 30, 2025Updated 7 months ago
- Artifact of the ICSE 2020 paper: "ReluDiff: Differential Verification of Deep Neural Networks"☆11Feb 1, 2022Updated 4 years ago
- Smart spinner component for Qwik, to manage the duration of loading states.☆13Sep 25, 2023Updated 2 years ago
- [Corca / OR] Solver for Multi-dimensional Multi-demand Quadratic Knapsack Problems☆12Mar 22, 2022Updated 3 years ago
- Demo of fine-tuning QA models for answering FAQ of cloud providers documentation☆11Mar 7, 2023Updated 2 years ago
- Twitch Stream Analysis with Apache Spark and Apache Zeppelin☆12Aug 8, 2016Updated 9 years ago
- ☆11Dec 11, 2024Updated last year
- ☆10Aug 3, 2022Updated 3 years ago
- ☆16Jan 14, 2025Updated last year
- Write events for TensorBoard☆11Jun 27, 2024Updated last year
- 适合于开发人员的运维管理平台(基于ASP.NET Core Blazor 5语言编写)☆11Feb 18, 2024Updated 2 years ago
- DALLE-tools provided useful dataset utilities to improve you workflow with WebDatasets.☆14Mar 9, 2022Updated 3 years ago
- Code for COMET: Cardinality Constrained Mixture of Experts with Trees and Local Search☆11Jun 21, 2023Updated 2 years ago
- cursor logs with gpt-4o using litellm proxy☆14Sep 9, 2025Updated 5 months ago
- Public codebase for ECONET: EMNLP'21☆12Mar 11, 2022Updated 3 years ago
- My Gen AI research☆11Jun 3, 2024Updated last year
- ☆15Feb 18, 2026Updated last week
- Generating Sentences from Disentangled Syntactic and Semantic Spaces☆11Jun 24, 2019Updated 6 years ago
- Mobile OTP based authentication using golang☆12Aug 5, 2023Updated 2 years ago
- Fast and memory-efficient exact attention☆18Updated this week
- An experimental library for HTML generation in Mojo☆13May 8, 2024Updated last year
- Evaluation Code repository for the paper "ModuLoRA: Finetuning 3-Bit LLMs on Consumer GPUs by Integrating with Modular Quantizers". (2023…☆13Dec 5, 2023Updated 2 years ago
- Locally run an Instruction-Tuned Chat-Style LLM☆13Mar 30, 2023Updated 2 years ago
- A few demos of deep learning☆13Jun 16, 2023Updated 2 years ago
- Idea based off of 12ft.io, this works for new york times!☆12Oct 30, 2023Updated 2 years ago
- [TSE 2024] APPT: Boosting Automated Patch Correctness Prediction via Fine-tuning Pre-trained Models☆15Jan 29, 2024Updated 2 years ago
- ☆51May 31, 2024Updated last year
- Phi-2 Colab Notebook☆14Dec 14, 2023Updated 2 years ago
- ☆17Feb 2, 2024Updated 2 years ago
- Yet another LLM☆10Apr 6, 2023Updated 2 years ago
- Node.js binding for SORT: Simple, online, and real-time tracking of multiple objects in a video sequence.☆12Apr 26, 2021Updated 4 years ago