AMD-AIG-AIMA / AMD-LLMLinks

☆187

Alternatives and similar repositories for AMD-LLM

Users that are interested in AMD-LLM are comparing it to the libraries listed below

Sorting:

slashml / amd_inference
Docker-based inference engine for AMD GPUs
☆231Updated 10 months ago
Foreseerr / TScale
☆197Updated 3 months ago
moonshine-ai / qc_npu_benchmark
Code sample showing how to run and benchmark models on Qualcomm's Window PCs
☆101Updated 10 months ago
anordin95 / run-llama-locally
Run and explore Llama models locally with minimal dependencies on CPU
☆191Updated 10 months ago
trevorpogue / algebraic-nnhw
Algebraic enhancements for GEMM & AI accelerators
☆278Updated 5 months ago
robjinman / richard
Richard is gaining power
☆194Updated 2 months ago
joennlae / tensorli
Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).
☆253Updated last year
ross39 / new_bloom_filter_repo
This repo contains a new way to use bloom filters to do lossless video compression
☆248Updated 2 months ago
rodlaf / BinaryGPUIndex
A GPU Accelerated Binary Vector Store
☆47Updated 6 months ago
bravenewxyz / agent-c
Ultra-lightweight AI Agent
☆250Updated this week
dipampaul17 / KVSplit
Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …
☆356Updated 3 months ago
brianmg / voynich-nlp-analysis
☆126Updated 3 months ago
neuroxhq / helm-chart-neurox-control
Neurox control helm chart details
☆30Updated 3 months ago
Tsadoq / ErisForge
Dead Simple LLM Abliteration
☆231Updated 6 months ago
dicroce / hnsw
Heirarchical Navigable Small Worlds
☆101Updated 2 weeks ago
mlecauchois / micrograd-cuda
☆249Updated last year
mikepapadim / llama-shepherd-cli
A CLI to manage install and configure llama inference implemenation in multiple languages
☆67Updated last year
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆126Updated 4 months ago
xetdata / onnx-models
A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.
☆105Updated last year
AMD-AIG-AIMA / Instella
Fully Open Language Models with Stellar Performance
☆243Updated 3 weeks ago
kolinko / effort
An implementation of bucketMul LLM inference
☆223Updated last year
carsonpo / haystackdb
☆163Updated last year
M4THYOU / TokenDagger
High-Performance Implementation of OpenAI's TikToken.
☆445Updated last month
dhealy05 / frames_of_mind
Animating R1's thoughts.
☆384Updated 6 months ago
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆135Updated last year
adenta / fire_red_agent
☆164Updated 5 months ago
babycommando / neuralgraffiti
Live-bending a foundation model’s output at neural network level.
☆266Updated 4 months ago
rentruewang / bocoel
Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…
☆286Updated 3 weeks ago
a1k0n / a1gpt
throwaway GPT inference
☆140Updated last year
tearflake / flake-ui
fractal-structure inspired, parent-children orbiting, zooming-elements based interactive graph visualization user interface
☆130Updated 5 months ago