scooter-lacroix / Stan-s-ML-StackLinks
A complete package that provides you with all the components needed to get started of dive deeper into Machine Learning Workloads on Consumer AMD cards, providing CUDA functionality through fully leveraging HIP and ROCm!
☆42Updated last month
Alternatives and similar repositories for Stan-s-ML-Stack
Users that are interested in Stan-s-ML-Stack are comparing it to the libraries listed below
Sorting:
- Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.☆260Updated last week
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆100Updated last month
- AMD (Radeon GPU) ROCm based setup for popular AI tools on Ubuntu 24.04.1☆216Updated last week
- Prometheus exporter for Linux based GDDR6/GDDR6X VRAM and GPU Core Hot spot temperature reader for NVIDIA RTX 3000/4000 series GPUs.☆24Updated last year
- ☆418Updated 8 months ago
- Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.☆165Updated last year
- InferX: Inference as a Service Platform☆142Updated this week
- Run LLMs on AMD Ryzen™ AI NPUs in minutes. Just like Ollama - but purpose-built and deeply optimized for the AMD NPUs.☆488Updated last week
- llama-swap + a minimal ollama compatible api☆37Updated this week
- General Site for the GFX803 ROCm Stuff☆127Updated 3 months ago
- GPU Power and Performance Manager☆62Updated last year
- An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs☆588Updated this week
- Core, Junction, and VRAM temperature reader for Linux + GDDR6/GDDR6X GPUs☆61Updated last month
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆12Updated 6 months ago
- ☆87Updated 2 weeks ago
- ☆48Updated 2 years ago
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆29Updated 6 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆84Updated last week
- ☆50Updated 2 months ago
- ☆228Updated 7 months ago
- Open source LLM UI, compatible with all local LLM providers.☆176Updated last year
- German "Who Wants To Be A Millionaire" LLM Benchmarking.☆46Updated last week
- AMD APU compatible Ollama. Get up and running with OpenAI gpt-oss, DeepSeek-R1, Gemma 3 and other models.☆133Updated last week
- Code for Papeg.ai☆227Updated 11 months ago
- Privacy-first agentic framework with powerful reasoning & task automation capabilities. Natively distributed and fully ISO 27XXX complian…☆68Updated 8 months ago
- Fresh builds of llama.cpp with AMD ROCm™ 7 acceleration☆129Updated this week
- Manifold is a platform for enabling workflow automation using AI assistants.☆468Updated last week
- A utility that uses Whisper to transcribe videos and various translation APIs to translate the transcribed text and save them as SRT (sub…☆74Updated last year
- Input your VRAM and RAM and the toolchain will produce a GGUF model tuned to your system within seconds — flexible model sizing and lowes…☆66Updated this week
- Web UI for ExLlamaV2☆514Updated 10 months ago