openvinotoolkit / mlasLinks

☆11

Alternatives and similar repositories for mlas

Users that are interested in mlas are comparing it to the libraries listed below

Sorting:

facebookresearch / loop_nest
Loop Nest - Linear algebra compiler and code generator.
☆22Updated 2 years ago
google-ai-edge / ai-edge-quantizer
AI Edge Quantizer: flexible post training quantization for LiteRT models.
☆53Updated last week
google-research / mozolm
MozoLM: A language model (LM) serving library
☆45Updated this week
Zyphra / zcookbook
Training hybrid models for dummies.
☆25Updated 6 months ago
onnx / steering-committee
Notes and artifacts from the ONNX steering committee
☆26Updated last week
kyutai-labs / jax-flash-attn3
JAX bindings for the flash-attention3 kernels
☆11Updated 11 months ago
apple / ml-acn-embed
Acoustic Neighbor Embeddings
☆24Updated 7 months ago
ARM-software / kleidiai
This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai
☆56Updated this week
catid / bitnet_cpu
Experiments with BitNet inference on CPU
☆54Updated last year
lucasnewman / vocos-mlx
Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX
☆21Updated 8 months ago
josephrocca / onnxscript-editor
ONNX Script editor & visualiser running completely in the browser thanks to Pyodide and Netron
☆20Updated 2 years ago
kyegomez / MM1
PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"
☆24Updated 2 weeks ago
facebookresearch / llama-hd-dataset
This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.
☆18Updated last year
lrw04 / tinyllamas-ncnn
Inference TinyLlama models on ncnn
☆24Updated last year
openvinotoolkit / openvino_tokenizers
OpenVINO Tokenizers extension
☆37Updated this week
graphcore-research / unit-scaling-demo
Unit Scaling demo and experimentation code
☆16Updated last year
onnx / neural-compressor
Model compression for ONNX
☆96Updated 7 months ago
tile-ai / tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
☆19Updated last week
josephrocca / lyra-v2-soundstream-web
Lyra V2 (SoundStream) running in the browser
☆19Updated last year
PINTO0309 / sne4onnx
A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…
☆17Updated last year
NVIDIA / DALI_deps
3rd party dependencies for DALI project
☆10Updated this week
daquexian / faster-rwkv
☆124Updated last year
GindaChen / FlexFlashAttention3
FlexAttention w/ FlashAttention3 Support
☆26Updated 9 months ago
goodmike31 / pl-asr-speech-data-survey
Survey of available speech datasets for Polish ASR development
☆16Updated 6 months ago
hisrg / SNPE
Snapdragon Neural Processing Engine (SNPE) SDKThe Snapdragon Neural Processing Engine (SNPE) is a Qualcomm Snapdragon software accelerate…
☆34Updated 3 years ago
mlcommons / mobile_models
MLPerf™ Mobile models
☆26Updated 9 months ago
oneapi-src / voice-data-generation
AI Starter Kit for Synthetic Voice and Audio Generation using Intel® Extension for Pytorch
☆2Updated last year
morousg / cvGPUSpeedup
A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!
☆51Updated last week
xaedes / llama.cpp
Port of Facebook's LLaMA model in C/C++
☆22Updated last year
habanero-lab / APPy
APPy (Annotated Parallelism for Python) enables users to annotate loops and tensor expressions in Python with compiler directives akin to…
☆24Updated 3 weeks ago