Linaro / tinyBLASLinks
A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support
☆17Updated 5 years ago
Alternatives and similar repositories for tinyBLAS
Users that are interested in tinyBLAS are comparing it to the libraries listed below
Sorting:
- tiny code to access tenstorrent blackhole☆57Updated 2 months ago
- Editor with LLM generation tree exploration☆73Updated 5 months ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆52Updated 5 months ago
- GGUF implementation in C as a library and a tools CLI program☆280Updated 7 months ago
- The Finite Field Assembly Programming Language☆36Updated 2 months ago
- Inference of Mamba models in pure C☆190Updated last year
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆73Updated 6 months ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- ☆59Updated last year
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆67Updated last month
- Inference Llama/Llama2/Llama3 Modes in NumPy☆21Updated last year
- 1.58 Bit LLM on Apple Silicon using MLX☆217Updated last year
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated 4 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆164Updated this week
- noise_step: Training in 1.58b With No Gradient Memory☆220Updated 7 months ago
- Train your own small bitnet model☆75Updated 9 months ago
- Mistral7B playing DOOM☆133Updated last year
- Inference RWKV v7 in pure C.☆37Updated 2 weeks ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated last year
- A massively parallel, optimal functional runtime in Rust☆31Updated last year
- A JPEG Image Compression Service using Part Homomorphic Encryption.☆31Updated 5 months ago
- asynchronous/distributed speculative evaluation for llama3☆39Updated last year
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆211Updated last year
- Tensor library & inference framework for machine learning☆106Updated 3 weeks ago
- Tenstorrent console based hardware information program☆49Updated this week
- ☆61Updated 11 months ago
- ☆392Updated this week
- ☆196Updated 3 months ago
- A tiny version of GPT fully implemented in Python with zero dependencies☆72Updated 8 months ago
- ☆188Updated 11 months ago