Linaro / tinyBLAS
A fork of OpenBLAS with Armv8-A SVE (Scalable Vector Extension) support
☆13Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for tinyBLAS
- Tiny ASIC implementation for "The Era of 1-bit LLMs All Large Language Models are in 1.58 Bits" matrix multiplication unit☆111Updated 7 months ago
- llama.cpp fork with additional SOTA quants and improved performance☆93Updated this week
- GPU Power and Performance Manager☆48Updated last month
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆41Updated last month
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆12Updated 6 months ago
- Run 64-bit Linux on LiteX + RocketChip☆188Updated 3 months ago
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆100Updated 3 weeks ago
- This project aims to enable language model inference on FPGAs, supporting AI applications in edge devices and environments with limited r…☆137Updated 6 months ago
- ☆179Updated 2 months ago
- Open-Source Software for Designing 3D-Printable Luneburg Lenses for RF Applications☆63Updated 3 weeks ago
- prime is a framework for efficient, globally distributed training of AI models over the internet.☆212Updated this week
- ☆45Updated this week
- new optimizer☆19Updated 3 months ago
- Documentation for the AI in a Box project☆33Updated last year
- The FPGA application for Monocle's graphics, camera and microphone accelerators☆48Updated 11 months ago
- ☆40Updated last year
- Mistral7B playing DOOM☆122Updated 4 months ago
- Verilog design examples for use with the Signaloid C0-microSD☆37Updated 2 weeks ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆43Updated 6 months ago
- Open-source LLM app starter templates – easily get started with a systematic, rapid workflow for taking an LLM app from prototype to prod…☆9Updated last month
- ☆64Updated 5 months ago
- 1.58 Bit LLM on Apple Silicon using MLX☆148Updated 6 months ago
- Distributed Inference for mlx LLm☆70Updated 3 months ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated last year
- Running a LLM on the ESP32☆45Updated last month
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆207Updated 11 months ago
- a curated list of data for reasoning ai☆113Updated 3 months ago
- Large Model Proxy is designed to make it easy to run multiple resource-heavy Large Models (LM) on the same machine with limited amount of…☆47Updated last month
- Onboarding documentation source for the AMD Ryzen™ AI Software Platform. The AMD Ryzen™ AI Software Platform enables developers to take…☆47Updated this week
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆31Updated 4 months ago