llama.cpp-gfx906
☆120Mar 22, 2026Updated 3 weeks ago
Alternatives and similar repositories for llama.cpp-gfx906
Users that are interested in llama.cpp-gfx906 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- triton for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆45Dec 8, 2025Updated 4 months ago
- vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆403Feb 20, 2026Updated last month
- ML software (llama.cpp, ComfyUI, vLLM) builds for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆157Updated this week
- Triton for AMD MI25/50/60. Development repository for the Triton language and compiler☆32Dec 15, 2025Updated 4 months ago
- The real GPT-4 with image access (You probably don't have access)☆12Mar 17, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A high-throughput and memory-efficient inference and serving engine for LLMs☆117Updated this week
- Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends☆57Aug 21, 2025Updated 7 months ago
- no further adue.. just give me the forkwlows!☆112Mar 1, 2026Updated last month
- VHDL ieee_proposed library, imported as is. See also https://github.com/FPHDL/fphdl☆12Aug 26, 2016Updated 9 years ago
- ☆20May 8, 2012Updated 13 years ago
- Encryption and signing for a post quantum world☆17Apr 4, 2023Updated 3 years ago
- KTransformers 一键部署脚本☆60Apr 18, 2025Updated last year
- ☆14Sep 4, 2024Updated last year
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆28Jan 19, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- High-througput logic analyzer for FPGA☆16Oct 8, 2020Updated 5 years ago
- Tries to UI development. Clone of https://www.perplexity.ai/☆11Sep 30, 2023Updated 2 years ago
- Starcraft 2 Replay in Rerun☆24Oct 26, 2025Updated 5 months ago
- A high-performance Python library for automating AdGuard filter list management. Create, deduplicate, sort, optimize, and validate ad-blo…☆18Apr 1, 2026Updated 2 weeks ago
- high level VHDL floating point library for synthesis in fpga☆18Dec 18, 2025Updated 4 months ago
- Awesome AI Benchmarks☆29Jan 16, 2026Updated 3 months ago
- Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parall…☆52Apr 12, 2026Updated last week
- Exploring Shared Virtual Memory Abstractions in OpenCL Tools for FPGAs☆18Dec 7, 2017Updated 8 years ago
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆19Jan 10, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆24Mar 26, 2026Updated 3 weeks ago
- Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…☆20Mar 28, 2026Updated 3 weeks ago
- Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and in…☆18Nov 11, 2024Updated last year
- ☆11Nov 10, 2024Updated last year
- ☆17Dec 16, 2024Updated last year
- Framework for building transparent memory encryption and authentication solutions☆27Jun 19, 2018Updated 7 years ago
- ☆58Feb 18, 2025Updated last year
- Firecracker VM orchestration for Claude Code sessions☆25Mar 30, 2026Updated 2 weeks ago
- Downloads books from the amazon web reader☆30Oct 15, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Ongoing research training transformer language models at scale, including: BERT & GPT-2☆20Oct 23, 2023Updated 2 years ago
- Measuring Thinking Efficiency in Reasoning Models - Research Repository☆39Dec 2, 2025Updated 4 months ago
- Rust Mini Game Framework☆23May 27, 2024Updated last year
- Encoding and decoding for ARF strings☆15Mar 10, 2025Updated last year
- LLM inference in C/C++☆21Apr 9, 2026Updated last week
- LLM training in simple, raw C/HIP for AMD GPUs☆62Sep 23, 2024Updated last year
- 1 Byte Currency ISO type for PostgreSQL☆20Mar 14, 2025Updated last year