Linaro / tinyBLAS
A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support
☆17Updated 4 years ago
Alternatives and similar repositories for tinyBLAS:
Users that are interested in tinyBLAS are comparing it to the libraries listed below
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated 11 months ago
- Editor with LLM generation tree exploration☆66Updated 2 months ago
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated last month
- ☆54Updated 8 months ago
- ☆23Updated 5 months ago
- A faithful clone of Karpathy's llama2.c (one file inference, zero dependency) but fully functional with LLaMA 3 8B base and instruct mode…☆125Updated 9 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆56Updated 2 months ago
- The Finite Field Assembly Programming Language☆36Updated 2 weeks ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- ☆19Updated last month
- Framework for specifying and proving properties—such as robustness, fairness, and interpretability—of machine learning models using Lean …☆58Updated last month
- Simple LLM inference server☆20Updated 10 months ago
- A JPEG Image Compression Service using Part Homomorphic Encryption.☆30Updated last month
- look how they massacred my boy☆63Updated 6 months ago
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆96Updated 6 months ago
- Schola is a plugin for enabling Reinforcement Learning (RL) in Unreal Engine. It provides tools to help developers create environments, d…☆34Updated 3 weeks ago
- Because it's there.☆16Updated 7 months ago
- Lego for GRPO☆27Updated 3 weeks ago
- Make DSP Great Again. The Chata programming language!☆63Updated last month
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆18Updated 3 weeks ago
- ☆66Updated 11 months ago
- Open-source LLM app starter templates – easily get started with a systematic, rapid workflow for taking an LLM app from prototype to prod…☆10Updated 6 months ago
- Prompt-based software development☆23Updated 8 months ago
- Inference of Mamba models in pure C☆187Updated last year
- TensorRT-LLM server with Structured Outputs (JSON) built with Rust☆49Updated 3 weeks ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆46Updated 2 months ago
- A minimalistic C++ Jinja templating engine for LLM chat templates☆132Updated last week
- Train your own small bitnet model☆67Updated 6 months ago
- Golf is a programming language, framework and application server for high-performance web services and web applications, with focus on …☆44Updated this week
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆71Updated 2 months ago