Introduction to Quantization
☆20Mar 3, 2024Updated 2 years ago
Alternatives and similar repositories for quantization-intro
Users that are interested in quantization-intro are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lightweight Neural Architecture Search for Temporal Convolutional Networks at the Edge☆10Mar 6, 2023Updated 3 years ago
- ☆10Aug 22, 2023Updated 2 years ago
- ☆14May 4, 2024Updated 2 years ago
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated last year
- Pytorch implementation of our paper accepted by ICML 2024 -- CaM: Cache Merging for Memory-efficient LLMs Inference☆49Jun 19, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆35Jun 12, 2024Updated last year
- Software abstractions for the analog signal exploration tools.☆34Sep 3, 2024Updated last year
- Designs for finalist teams of the DAC System Design Contest☆37Jul 8, 2020Updated 5 years ago
- Official Implementation of "LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference"☆25Nov 12, 2023Updated 2 years ago
- Python scripts for WIDER FACE Evaluation☆10May 25, 2019Updated 6 years ago
- A TensorFlow implementation of Google's BlazeFace☆11Nov 9, 2021Updated 4 years ago
- Android app for the Hole in your Palm project, making LLMs accessible on-device!☆19May 3, 2024Updated 2 years ago
- Dirve human model in Unity through 3D keypoints☆11Sep 30, 2019Updated 6 years ago
- Collection of Common Machine Translation Tools☆11Jul 26, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The Modern Data Stack in a (Smaller) Box☆12Jan 28, 2023Updated 3 years ago
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Oct 12, 2022Updated 3 years ago
- This repo has been migrated to https://code.larus.se/lmas/Damerau-Levenshtein☆11Jul 21, 2023Updated 2 years ago
- tdd skill for coding agents☆139Feb 24, 2026Updated 2 months ago
- ☆19Jan 8, 2020Updated 6 years ago
- My solutions for Advanced Python Mastery (course by @dabeaz)☆11Jan 29, 2024Updated 2 years ago
- A Lua extension library that allows to run various scripting languages from within Lua scripts, cross-language require scripts and load 3…☆18Nov 23, 2020Updated 5 years ago
- ACL Paper Lists(machine translation)☆13Mar 23, 2022Updated 4 years ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Sampling-Based Minimum Bayes-Risk Decoding for Neural Machine Translation☆16Oct 14, 2022Updated 3 years ago
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- Video object detection benchmark.☆19Jan 24, 2019Updated 7 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last month
- My learnings (publicly) on RAG systems☆14Jan 2, 2024Updated 2 years ago
- A Dataset for Direct Quotation Extraction and Attribution in News Articles.☆14Sep 28, 2021Updated 4 years ago
- For the session notes of the MSD class!☆12Mar 25, 2021Updated 5 years ago
- ☆20Oct 25, 2025Updated 6 months ago
- ☆14Sep 10, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Project code for training LLMs to write better unit tests + code☆21May 19, 2025Updated 11 months ago
- ☆10Oct 7, 2019Updated 6 years ago
- Fastai+PyTorch implementation of sparse model training methods (SET, SNFS, RigL) + customize-your-own.☆10Oct 20, 2022Updated 3 years ago
- Official Implementation of "Accel-GNN: High-Performance GPU Accelerator Design for Graph Neural Networks"☆52Mar 20, 2025Updated last year
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆53Jan 29, 2024Updated 2 years ago
- ☆15May 29, 2022Updated 3 years ago
- ☆20Dec 16, 2024Updated last year