openvinotoolkit / mlasLinks
☆11Updated 5 months ago
Alternatives and similar repositories for mlas
Users that are interested in mlas are comparing it to the libraries listed below
Sorting:
- Loop Nest - Linear algebra compiler and code generator.☆22Updated 2 years ago
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆53Updated last week
- MozoLM: A language model (LM) serving library☆45Updated this week
- Training hybrid models for dummies.☆25Updated 6 months ago
- Notes and artifacts from the ONNX steering committee☆26Updated last week
- JAX bindings for the flash-attention3 kernels☆11Updated 11 months ago
- Acoustic Neighbor Embeddings☆24Updated 7 months ago
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆56Updated this week
- Experiments with BitNet inference on CPU☆54Updated last year
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆21Updated 8 months ago
- ONNX Script editor & visualiser running completely in the browser thanks to Pyodide and Netron☆20Updated 2 years ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated 2 weeks ago
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.☆18Updated last year
- Inference TinyLlama models on ncnn☆24Updated last year
- OpenVINO Tokenizers extension☆37Updated this week
- Unit Scaling demo and experimentation code☆16Updated last year
- Model compression for ONNX☆96Updated 7 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated last week
- Lyra V2 (SoundStream) running in the browser☆19Updated last year
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆17Updated last year
- 3rd party dependencies for DALI project☆10Updated this week
- ☆124Updated last year
- FlexAttention w/ FlashAttention3 Support☆26Updated 9 months ago
- Survey of available speech datasets for Polish ASR development☆16Updated 6 months ago
- Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…☆34Updated 3 years ago
- MLPerf™ Mobile models☆26Updated 9 months ago
- AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch☆2Updated last year
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆51Updated last week
- Port of Facebook's LLaMA model in C/C++☆22Updated last year
- APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…☆24Updated 3 weeks ago