eliranwong / MultiAMDGPU_AIDev_Ubuntu
Multi AMD GPU Setup for AI Development on Ubuntu with ROCM
☆31Updated last month
Alternatives and similar repositories for MultiAMDGPU_AIDev_Ubuntu
Users that are interested in MultiAMDGPU_AIDev_Ubuntu are comparing it to the libraries listed below
Sorting:
- Local LLM Server with NPU Acceleration☆180Updated last week
- GPU Power and Performance Manager☆58Updated 7 months ago
- Prepare for DeekSeek R1 inference: Benchmark CPU, DRAM, SSD, iGPU, GPU, ... with efficient code.☆72Updated 3 months ago
- automatically quant GGUF models☆175Updated this week
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆61Updated this week
- Distributed Inference for mlx LLm☆91Updated 9 months ago
- ☆130Updated 2 weeks ago
- NVIDIA Linux open GPU with P2P support☆22Updated 2 weeks ago
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆115Updated 10 months ago
- vLLM: A high-throughput and memory-efficient inference and serving engine for LLMs☆86Updated this week
- LLM inference in C/C++☆77Updated this week
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆29Updated 2 months ago
- Review/Check GGUF files and estimate the memory usage and maximum tokens per second.☆161Updated this week
- LLM inference in C/C++☆21Updated last month
- Running Microsoft's BitNet via Electron, React & Astro☆38Updated 3 weeks ago
- ☆69Updated this week
- A fast batching API to serve LLM models☆182Updated last year
- Conduct in-depth research with AI-driven insights : DeepDive is a command-line tool that leverages web searches and AI models to generate…☆42Updated 8 months ago
- High-speed and easy-use LLM serving framework for local deployment☆103Updated last month
- Inference RWKV v7 in pure C.☆33Updated last month
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆39Updated 8 months ago
- GroqFlow provides an automated tool flow for compiling machine learning and linear algebra workloads into Groq programs and executing tho…☆109Updated 2 months ago
- An OpenAI API compatible images server to generate or manipulate images.☆16Updated 3 months ago
- Implementation of nougat that focuses on processing pdf locally.☆81Updated 4 months ago
- ☆94Updated this week
- ☆31Updated last year
- Controllable Language Model Interactions in TypeScript☆9Updated 11 months ago
- Experimental LLM Inference UX to aid in creative writing☆116Updated 5 months ago
- A fork of vLLM enabling Pascal architecture GPUs☆28Updated 2 months ago
- ☆202Updated 3 weeks ago