openvinotoolkit / mlasLinks
☆11Updated 4 months ago
Alternatives and similar repositories for mlas
Users that are interested in mlas are comparing it to the libraries listed below
Sorting:
- ONNX Script editor & visualiser running completely in the browser thanks to Pyodide and Netron☆20Updated 2 years ago
- Inference TinyLlama models on ncnn☆24Updated last year
- JAX bindings for the flash-attention3 kernels☆11Updated 10 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆34Updated 2 years ago
- Acoustic Neighbor Embeddings☆24Updated 6 months ago
- the C++ version of Seq2Seq with ncnn☆23Updated 4 years ago
- ☆32Updated 11 months ago
- Whisper in TensorRT-LLM☆16Updated last year
- MozoLM: A language model (LM) serving library☆45Updated 4 months ago
- Estimating hardware and cloud costs of LLMs and transformer projects☆17Updated this week
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆49Updated this week
- Experiments with BitNet inference on CPU☆54Updated last year
- C++ implementations for various tokenizers (sentencepiece, tiktoken etc).☆31Updated last week
- FlexAttention w/ FlashAttention3 Support☆26Updated 8 months ago
- PyTorch Implementation of the paper "MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training"☆24Updated last month
- ☆11Updated 4 years ago
- Unit Scaling demo and experimentation code☆16Updated last year
- ☆29Updated 4 months ago
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆19Updated this week
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.☆18Updated last year
- UIE(Universal Information Extraction) infer by ncnn☆12Updated 9 months ago
- ☆21Updated last year
- Cantonese Grapheme-to-Phoneme Converter based on GitYCC/g2pW☆13Updated 6 months ago
- GGML implementation of BERT model with Python bindings and quantization.☆26Updated last year
- ☆10Updated 7 months ago
- ☆124Updated last year
- Using OpenVINO to speed up MeloTTS inference☆11Updated 7 months ago
- This repository is a read-only mirror of https://gitlab.arm.com/kleidi/kleidiai☆51Updated last week
- A faster implementation of OpenCV-CUDA that uses OpenCV objects, and more!☆51Updated this week
- ☆74Updated 7 months ago