mikex86 / LibreCudaLinks
☆1,048Updated 3 months ago
Alternatives and similar repositories for LibreCuda
Users that are interested in LibreCuda are comparing it to the libraries listed below
Sorting:
- NVIDIA Linux open GPU with P2P support☆1,226Updated 2 months ago
- ☆449Updated 4 months ago
- ☆187Updated 11 months ago
- Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, an…☆1,452Updated last week
- Multi-Threaded FP32 Matrix Multiplication on x86 CPUs☆351Updated 4 months ago
- ☆249Updated last year
- Online compiler for HIP and NVIDIA® CUDA® code to WebGPU☆191Updated 7 months ago
- Exploring the scalable matrix extension of the Apple M4 processor☆195Updated 9 months ago
- Nvidia Instruction Set Specification Generator☆289Updated last year
- throwaway GPT inference☆140Updated last year
- llama3.np is a pure NumPy implementation for Llama 3 model.☆988Updated 3 months ago
- Richard is gaining power☆194Updated 2 months ago
- A reimplementation of Stable Diffusion 3.5 in pure PyTorch☆659Updated 2 months ago
- Felafax is building AI infra for non-NVIDIA GPUs☆566Updated 7 months ago
- A modern model graph visualizer and debugger☆1,306Updated this week
- Docker-based inference engine for AMD GPUs☆231Updated 10 months ago
- GGUF implementation in C as a library and a tools CLI program☆283Updated 7 months ago
- Solve Puzzles. Learn Metal 🤘☆580Updated 11 months ago
- Apple AMX Instruction Set☆1,128Updated 8 months ago
- JSON for Classic C++☆747Updated 3 weeks ago
- ☆197Updated 3 months ago
- SCUDA is a GPU over IP bridge allowing GPUs on remote machines to be attached to CPU-only machines.☆1,752Updated 2 months ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆605Updated 6 months ago
- Llama 2 Everywhere (L2E)☆1,520Updated 7 months ago
- TT-NN operator library, and TT-Metalium low level kernel programming model.☆1,090Updated this week
- Algebraic enhancements for GEMM & AI accelerators☆278Updated 5 months ago
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆211Updated last year
- VS Code extension for LLM-assisted code/text completion☆917Updated last week
- Reverse engineered Linux driver for the Apple Neural Engine (ANE).☆419Updated last year
- Open-source LLMOps platform for hosting and scaling AI in your own infrastructure 🏓🦙☆1,098Updated this week