AMD-AIG-AIMA / AMD-LLM
☆186Updated 7 months ago
Alternatives and similar repositories for AMD-LLM:
Users that are interested in AMD-LLM are comparing it to the libraries listed below
- Docker-based inference engine for AMD GPUs☆230Updated 6 months ago
- ☆238Updated this week
- Run and explore Llama models locally with minimal dependencies on CPU☆189Updated 6 months ago
- Code sample showing how to run and benchmark models on Qualcomm's Window PCs☆96Updated 6 months ago
- Algebraic enhancements for GEMM & AI accelerators☆275Updated last month
- Dead Simple LLM Abliteration☆210Updated last month
- Animating R1's thoughts.☆372Updated last month
- A BERT that you can train on a (gaming) laptop.☆208Updated last year
- ☆163Updated 10 months ago
- Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).☆249Updated last year
- An implementation of bucketMul LLM inference☆216Updated 9 months ago
- ☆243Updated last year
- Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…☆281Updated last month
- Felafax is building AI infra for non-NVIDIA GPUs☆558Updated 2 months ago
- Richard is gaining power☆184Updated 4 months ago
- Fully neural approach for text chunking☆26Updated this week
- Mistral7B playing DOOM☆130Updated 9 months ago
- Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.☆566Updated last month
- throwaway GPT inference☆138Updated 10 months ago
- See Through Your Models☆376Updated last month
- Heirarchical Navigable Small Worlds☆94Updated last week
- Hashed Lookup Table based Matrix Multiplication (halutmatmul) - Stella Nera accelerator☆210Updated last year
- A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.☆105Updated last year
- The fastest 128-bit and 256-bit hash, passes all tests, and under 140 source lines of code. API library and CLI tool in C++ and NodeJS/Wa…☆125Updated 2 months ago
- ☆1,034Updated 4 months ago
- Agent Based Model on GPU using CUDA 12.2.1 and OpenGL 4.5 (CUDA OpenGL interop) on Windows/Linux☆70Updated last month
- Online compiler for HIP and NVIDIA® CUDA® code to WebGPU☆143Updated 3 months ago
- Sequential Logic☆111Updated last week
- GGUF implementation in C as a library and a tools CLI program☆264Updated 3 months ago
- A GPU Accelerated Binary Vector Store☆47Updated last month