Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"
☆11Mar 31, 2024Updated last year
Alternatives and similar repositories for apple-silicon-4bit-quant
Users that are interested in apple-silicon-4bit-quant are comparing it to the libraries listed below
Sorting:
- ModernBERT model optimized for Apple Neural Engine.☆31Jan 10, 2025Updated last year
- Tool for visual profiling Core ML models, compatible with both package and compiled versions, including reasons for unsupported operation…☆38Jun 18, 2024Updated last year
- Train small sequence models in your browser with WebGPU.☆32Dec 3, 2025Updated 3 months ago
- Code for my workshop "Production-ready WebAssembly with Rust" presented at RustLab 2023 in Florence☆15Nov 23, 2023Updated 2 years ago
- Tool for exporting Apple Neural Engine-accelerated versions of transformers models on HuggingFace Hub.☆13Updated this week
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated 10 months ago
- See the device (CPU/GPU/ANE) and estimated cost for every layer in your CoreML model.☆25Oct 23, 2025Updated 4 months ago
- Find out why your CoreML model isn't running on the Neural Engine!☆30Jun 18, 2024Updated last year
- Run transformers (incl. LLMs) on the Apple Neural Engine.☆63Nov 22, 2023Updated 2 years ago
- Profile your CoreML models directly from Python 🐍☆30Sep 8, 2025Updated 6 months ago
- CLI to demonstrate running a large language model (LLM) on Apple Neural Engine.☆124Dec 27, 2024Updated last year
- EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive Pruning (ACL 2023)☆33Jul 18, 2023Updated 2 years ago
- A minimalistic Swift implementation of the Jinja templating engine, specifically designed for parsing and rendering ML chat templates.☆119Feb 19, 2026Updated 2 weeks ago
- CodePath Slackbot (Fred)☆11Mar 26, 2021Updated 4 years ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 4 months ago
- ☆11Jan 7, 2023Updated 3 years ago
- Example iOS app using the open-source combustion-ios-ble framework.☆11Aug 2, 2023Updated 2 years ago
- Qwen3-TTS, Apple MLX, WebUI, API Server☆34Feb 12, 2026Updated 3 weeks ago
- import documents for LLMs☆46Jan 19, 2025Updated last year
- This Elgg plugin lets users preview MS Office files (doc, docx, xls, xlsx, ppt, pptx), Apple iWork pages, Adobe eps, and zip files using …☆12Aug 28, 2015Updated 10 years ago
- Continuous Benchmark for cache libraries written in golang.☆12Mar 26, 2023Updated 2 years ago
- Fastai+PyTorch implementation of sparse model training methods (SET, SNFS, RigL) + customize-your-own.☆10Oct 20, 2022Updated 3 years ago
- Vectorgraph Image Painter☆12Mar 24, 2019Updated 6 years ago
- ☆10May 2, 2023Updated 2 years ago
- O'Reilly Course, In-Memory Computing Essentials☆10Oct 16, 2020Updated 5 years ago
- ☆13Nov 27, 2025Updated 3 months ago
- ☆11Apr 5, 2023Updated 2 years ago
- Symbolic Graphics Programming with Large Language Models☆37Sep 14, 2025Updated 5 months ago
- codes for ICML2021 paper iDARTS: Differentiable Architecture Search with Stochastic Implicit Gradients☆10May 27, 2021Updated 4 years ago
- ☆10Nov 16, 2024Updated last year
- Creole Network Monorepo☆11Dec 2, 2024Updated last year
- 知乎图片选择框架的优化版本,增加是否选择原图功能,可显示原图大小,状态栏颜色自适应;解决无法显示某些大图的bug。☆11Sep 11, 2019Updated 6 years ago
- An example of distributed tracing an MCP enabled agent☆15Feb 14, 2026Updated 3 weeks ago
- a tiny, portable, stackless coroutine in C++11☆11May 17, 2023Updated 2 years ago
- Try to export the ONNX QDQ model that conforms to the AXERA NPU quantization specification. Currently, only w8a8 is supported.☆10Sep 10, 2024Updated last year
- Alias mutliple derives as one.☆11Nov 30, 2024Updated last year
- ChineseCLIP using online learning☆13Nov 7, 2022Updated 3 years ago
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 29, 2025Updated 11 months ago