AMD-AGI / AMD-LLMLinks

☆191

Alternatives and similar repositories for AMD-LLM

Users that are interested in AMD-LLM are comparing it to the libraries listed below

Sorting:

slashml / amd_inference
Docker-based inference engine for AMD GPUs
☆230Updated last year
Foreseerr / TScale
☆199Updated 7 months ago
anordin95 / run-llama-locally
Run and explore Llama models locally with minimal dependencies on CPU
☆190Updated last year
trevorpogue / algebraic-nnhw
Algebraic enhancements for GEMM & AI accelerators
☆284Updated 9 months ago
moonshine-ai / qc_npu_benchmark
Code sample showing how to run and benchmark models on Qualcomm's Window PCs
☆104Updated last year
mlecauchois / micrograd-cuda
☆249Updated last year
joennlae / tensorli
Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).
☆254Updated 2 years ago
robjinman / richard
Richard is gaining power
☆200Updated 6 months ago
valine / training-hot-swap
Pytorch script hot swap: Change code without unloading your LLM from VRAM
☆125Updated 8 months ago
brianmg / voynich-nlp-analysis
☆126Updated 6 months ago
rentruewang / bocoel
Bayesian Optimization as a Coverage Tool for Evaluating LLMs. Accurate evaluation (benchmarking) that's 10 times faster with just a few l…
☆287Updated 3 months ago
ross39 / new_bloom_filter_repo
This repo contains a new way to use bloom filters to do lossless video compression
☆250Updated 6 months ago
xetdata / onnx-models
A copy of ONNX models, datasets, and code all in one GitHub repository. Follow the README to learn more.
☆105Updated 2 years ago
M4THYOU / TokenDagger
High-Performance Implementation of OpenAI's TikToken.
☆465Updated 5 months ago
umuthopeyildirim / DOOM-Mistral
Mistral7B playing DOOM
☆138Updated last year
dipampaul17 / KVSplit
Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit …
☆362Updated 7 months ago
carsonpo / haystackdb
☆165Updated last year
kolinko / effort
An implementation of bucketMul LLM inference
☆222Updated last year
rodlaf / BinaryGPUIndex
A GPU Accelerated Binary Vector Store
☆47Updated 10 months ago
mikex86 / LibreCuda
☆1,072Updated 7 months ago
lights0123 / hipscript
Online compiler for HIP and NVIDIA® CUDA® code to WebGPU
☆204Updated 11 months ago
mikepapadim / llama-shepherd-cli
A CLI to manage install and configure llama inference implemenation in multiple languages
☆65Updated last year
DebarghaG / proofofthought
Proof of thought : LLM-based reasoning using Z3 theorem proving with multiple backend support (SMT2 and JSON DSL)
☆364Updated 2 months ago
samvher / bert-for-laptops
A BERT that you can train on a (gaming) laptop.
☆210Updated 2 years ago
felafax / felafax
Felafax is building AI infra for non-NVIDIA GPUs
☆568Updated 10 months ago
therealoliver / Deepdive-llama3-from-scratch
Achieve the llama3 inference step-by-step, grasp the core concepts, master the process derivation, implement the code.
☆612Updated 9 months ago
a1k0n / a1gpt
throwaway GPT inference
☆141Updated last year
neuroxhq / helm-chart-neurox-control
Neurox control helm chart details
☆30Updated 7 months ago
nirw4nna / dsc
Tensor library & inference framework for machine learning
☆115Updated 2 months ago
adenta / fire_red_agent
☆164Updated 8 months ago