Linaro / tinyBLAS
A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support
☆16Updated 4 years ago
Alternatives and similar repositories for tinyBLAS:
Users that are interested in tinyBLAS are comparing it to the libraries listed below
- Editor with LLM generation tree exploration☆65Updated last month
- Lightweight Llama 3 8B Inference Engine in CUDA C☆47Updated last week
- ☆53Updated 7 months ago
- The Finite Field Assembly Programming Language☆36Updated this week
- A faithful clone of Karpathy's llama2.c (one file inference, zero dependency) but fully functional with LLaMA 3 8B base and instruct mode…☆125Updated 8 months ago
- asynchronous/distributed speculative evaluation for llama3☆39Updated 7 months ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated 10 months ago
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆108Updated 3 weeks ago
- LLM Divergent Thinking Creativity Benchmark. LLMs generate 25 unique words that start with a given letter with no connections to each oth…☆32Updated 2 weeks ago
- ☆64Updated this week
- ☆17Updated last week
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆44Updated last month
- ☆29Updated 3 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆55Updated last month
- ☆66Updated 10 months ago
- The DPAB-α Benchmark☆19Updated 2 months ago
- A playground to make it easy to try crazy things☆33Updated this week
- A massively parallel, optimal functional runtime in Rust☆31Updated 7 months ago
- George is an API leveraging AI to make it easy to control a computer with natural language.☆43Updated 3 months ago
- Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks☆31Updated 10 months ago
- look how they massacred my boy☆63Updated 5 months ago
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆14Updated 2 months ago
- Fast parallel LLM inference for MLX☆177Updated 8 months ago
- Interpolate between embedding points with llm☆36Updated 8 months ago
- ☆56Updated 8 months ago
- Fun with wgpu: Simulating slime mold☆24Updated 7 months ago
- Prompt-based software development☆23Updated 7 months ago
- GPU-targeted vendor-agnostic AI library for Windows, and Mistral model implementation.☆55Updated last year
- Open-source LLM app starter templates – easily get started with a systematic, rapid workflow for taking an LLM app from prototype to prod…☆10Updated 6 months ago
- Moxin is a family of fully open-source and reproducible LLMs☆85Updated 2 weeks ago