openvinotoolkit / mlas
☆10Updated 11 months ago
Alternatives and similar repositories for mlas:
Users that are interested in mlas are comparing it to the libraries listed below
- ☆9Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆27Updated 11 months ago
- Experiments with BitNet inference on CPU☆52Updated 9 months ago
- ☆18Updated this week
- Loop Nest - Linear algebra compiler and code generator.☆22Updated 2 years ago
- Inference TinyLlama models on ncnn☆24Updated last year
- 3rd party dependencies for DALI project☆10Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆34Updated 2 years ago
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆22Updated last month
- Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX☆15Updated 2 months ago
- ☆22Updated 9 months ago
- Github repo for Peifeng's internship project☆12Updated last year
- StyleTTS 2 Optimized Training Fork☆15Updated this week
- A TensorFlow Extension: GPU performance tools for TensorFlow.☆25Updated last year
- ONNX Script editor & visualiser running completely in the browser thanks to Pyodide and Netron☆20Updated last year
- Course Project for COMP4471 on RWKV☆16Updated 11 months ago
- ☆12Updated last year
- Ubuntu kernels which are optimized for NVIDIA server systems☆30Updated this week
- Download full or partial git-lfs repos without temporarily using 2x disk space☆30Updated last year
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆15Updated 2 months ago
- SynapseAI Core is a reference implementation of the SynapseAI API running on Habana Gaudi☆38Updated 2 years ago
- ☆28Updated last week
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆14Updated 3 months ago
- CHAI is a library for dynamic pruning of attention heads for efficient LLM inference.☆11Updated last month
- the C++ version of Seq2Seq with ncnn☆23Updated 3 years ago
- Notes and artifacts from the ONNX steering committee☆25Updated last week
- This is a balanced dataset for English homograph disambiguation (HD), generated with Meta's Llama 2-Chat 70B model.☆12Updated 11 months ago
- ☆11Updated last month
- TAO Toolkit deep learning networks with TensorFlow 1.x backend☆13Updated 11 months ago