Introduction to Quantization
☆20Mar 3, 2024Updated 2 years ago
Alternatives and similar repositories for quantization-intro
Users that are interested in quantization-intro are comparing it to the libraries listed below
Sorting:
- ☆10Sep 10, 2023Updated 2 years ago
- Official implementation of the ICML 2024 paper RoSA (Robust Adaptation)☆44Feb 13, 2024Updated 2 years ago
- An sbt plugin for adding sounds to task completions☆28May 5, 2018Updated 7 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- Lightweight Neural Architecture Search for Temporal Convolutional Networks at the Edge☆10Mar 6, 2023Updated 2 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Fastai+PyTorch implementation of sparse model training methods (SET, SNFS, RigL) + customize-your-own.☆10Oct 20, 2022Updated 3 years ago
- Distributed Time Series Analysis on Spark☆15Jun 22, 2017Updated 8 years ago
- Implementation of a Hierarchical Mamba as described in the paper: "Hierarchical State Space Models for Continuous Sequence-to-Sequence Mo…☆15Nov 11, 2024Updated last year
- Symbolic Graphics Programming with Large Language Models☆37Sep 14, 2025Updated 5 months ago
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 11 months ago
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference☆47Jun 19, 2024Updated last year
- ChineseCLIP using online learning☆13Nov 7, 2022Updated 3 years ago
- Nano vLLM☆12Jun 26, 2025Updated 8 months ago
- 🎯 Speech Recognition Challenge by Speech Lab - IIT Madras☆11Nov 5, 2020Updated 5 years ago
- For the session notes of the MSD class!☆12Mar 25, 2021Updated 4 years ago
- [NAACL'25 🏆 SAC Award] Official code for "Advancing MoE Efficiency: A Collaboration-Constrained Routing (C2R) Strategy for Better Expert…☆15Feb 4, 2025Updated last year
- ⚓️ Interactive playground for the "Thought Anchors: Which LLM Reasoning Steps Matter?" paper.☆18Dec 20, 2025Updated 2 months ago
- [ACL'22] Training-free Neural Architecture Search for RNNs and Transformers☆14May 26, 2024Updated last year
- A Keras-based and TensorFlow-backend NLP Models Toolkit.☆12Jul 7, 2022Updated 3 years ago
- Benchmark Python and Cython code☆13Jun 13, 2014Updated 11 years ago
- ☆11Jan 10, 2025Updated last year
- Simple i2c example for STM32F4: scan the i2c bus for connected devices☆11Jul 15, 2015Updated 10 years ago
- Sthaan uses AI to create digital addresses with local language support in voice/text, making it easier for people to find and reach locat…☆12Nov 17, 2024Updated last year
- My runthrough of karpathy's lectures (with notes), building NN's from scratch, simple autoregressive language models, GPT models and lear…☆10Sep 11, 2023Updated 2 years ago
- A modular implementation of product of experts VAEs for multimodal data☆13Nov 15, 2021Updated 4 years ago
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- Code to reproduce the experiments of the ICLR24-paper: "Sparse Model Soups: A Recipe for Improved Pruning via Model Averaging"☆12Oct 14, 2025Updated 4 months ago
- ☆11Feb 10, 2018Updated 8 years ago
- PyTorch implementation of "Learning from Students: Online Contrastive Distillation Network for General Continual Learning" (IJCAI 2022)☆11Dec 29, 2022Updated 3 years ago
- [ICLR 2025] Linear Combination of Saved Checkpoints Makes Consistency and Diffusion Models Better☆16Feb 15, 2025Updated last year
- ☆12Sep 1, 2023Updated 2 years ago
- Code Repository for the NeurIPS 2024 Paper "Toward Efficient Inference for Mixture of Experts".☆19Oct 30, 2024Updated last year
- PyTorch code for our paper "Progressive Binarization with Semi-Structured Pruning for LLMs"☆13Sep 28, 2025Updated 5 months ago
- Official code of "NAS acceleration via proxy data", IJCAI21☆10May 29, 2022Updated 3 years ago
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆52Jan 29, 2024Updated 2 years ago
- ScienceMeter: Tracking Scientific Knowledge Updates in Language Models☆17Jun 28, 2025Updated 8 months ago