mikex86 / LibreCuda
☆1,038Updated 5 months ago
Alternatives and similar repositories for LibreCuda:
Users that are interested in LibreCuda are comparing it to the libraries listed below
- ☆444Updated last month
- Apple AMX Instruction Set☆1,078Updated 4 months ago
- Docker-based inference engine for AMD GPUs☆230Updated 7 months ago
- Online compiler for HIP and NVIDIA® CUDA® code to WebGPU☆148Updated 4 months ago
- Richard is gaining power☆186Updated 5 months ago
- Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, an…☆1,362Updated 2 weeks ago
- Nvidia Instruction Set Specification Generator☆260Updated 10 months ago
- Algebraic enhancements for GEMM & AI accelerators☆275Updated 2 months ago
- NVIDIA Linux open GPU with P2P support☆1,133Updated this week
- ☆241Updated last year
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆349Updated 3 weeks ago
- Reverse engineered Linux driver for the Apple Neural Engine (ANE).☆410Updated last year
- ☆187Updated 8 months ago
- Exploring the scalable matrix extension of the Apple M4 processor☆173Updated 6 months ago
- General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). …☆2,204Updated last month
- Exocompilation for productive programming of hardware accelerators☆600Updated this week
- GPUOcelot: A dynamic compilation framework for PTX☆187Updated 3 months ago
- Apple GPU microarchitecture☆519Updated 7 months ago
- VS Code extension for LLM-assisted code/text completion☆705Updated this week
- HIPIFY: Convert CUDA to Portable C++ Code☆574Updated this week
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆753Updated last week
- ☆187Updated this week
- Felafax is building AI infra for non-NVIDIA GPUs☆560Updated 3 months ago
- throwaway GPT inference☆139Updated 11 months ago
- GGUF implementation in C as a library and a tools CLI program☆270Updated 4 months ago
- Vim plugin for LLM-assisted code/text completion☆1,400Updated this week
- JSON for Classic C++☆717Updated 5 months ago
- llama3.np is a pure NumPy implementation for Llama 3 model.☆980Updated 2 weeks ago
- A faithful clone of Karpathy's llama2.c (one file inference, zero dependency) but fully functional with LLaMA 3 8B base and instruct mode…☆126Updated 9 months ago
- Apple G13 GPU architecture docs and tools☆584Updated last month