triton for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60
☆47Dec 8, 2025Updated 5 months ago
Alternatives and similar repositories for triton-gfx906
Users that are interested in triton-gfx906 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- vLLM for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆414Feb 20, 2026Updated 3 months ago
- llama.cpp-gfx906☆130Mar 22, 2026Updated 2 months ago
- ML software (llama.cpp, ComfyUI, vLLM) builds for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆231Updated this week
- Proxy for OpenAI☆16Sep 2, 2025Updated 8 months ago
- FORK of VLLM for AMD MI25/50/60. A high-throughput and memory-efficient inference and serving engine for LLMs☆70May 4, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Smart OpenAI‑compatible proxy for llama.cpp: manages slots, saves/restores KV cache to disk, routes requests by prefix similarity, and pr…☆40Nov 14, 2025Updated 6 months ago
- Triton for AMD MI25/50/60. Development repository for the Triton language and compiler☆33Dec 15, 2025Updated 5 months ago
- Tries to UI development. Clone of https://www.perplexity.ai/☆11Sep 30, 2023Updated 2 years ago
- A collection of meticulously crafted technical indicators for TradingView's Pine Script 6, implemented with mathematical rigor versus for…☆37Mar 1, 2026Updated 2 months ago
- Mixed-precision quantization for LLMs. Every layer refracts into a different format based on its sensitivity. Native compressed-tensors e…☆72Updated this week
- CPU/GPU Implicit & Explicit Finite Element Solver for Large Strains☆24Feb 20, 2026Updated 3 months ago
- Cross platform media player with lossless support☆75Updated this week
- Qwen LLM in the mac menu bar <3☆27Mar 12, 2025Updated last year
- Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and in…☆18Nov 11, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Raptor is a modern, fast, and easy-to-use system for building disk images, bootable isos, containers and much more, from a simple, Docker…☆41Feb 10, 2026Updated 3 months ago
- Lsglang is a special extension of sglang that fully utilizes CPU and GPU computing resources with an efficient GPU parallel + NUMA parall…☆81Apr 22, 2026Updated last month
- Concurrent command / event bus in Go☆16Dec 3, 2017Updated 8 years ago
- LLM inference in C/C++☆21May 22, 2026Updated last week
- First-class state management via state machines☆66Jul 16, 2016Updated 9 years ago
- A wrapper to use AqBanking CLI from a PHP context☆13Dec 5, 2018Updated 7 years ago
- Lower Precision Floating Point Operations☆80Feb 22, 2026Updated 3 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆35Feb 12, 2025Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆120Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆11Jul 8, 2023Updated 2 years ago
- ☆68Feb 27, 2026Updated 3 months ago
- Autonomous AI Video Generation Tool☆31Mar 20, 2025Updated last year
- A truly open version of gpt-oss which shows the entire pre-training from scratch☆90Sep 4, 2025Updated 8 months ago
- ☆68May 2, 2026Updated 3 weeks ago
- ☆32May 20, 2026Updated last week
- ☆33Oct 29, 2023Updated 2 years ago
- OpenERP Client Library allows to easily interact with OpenERP in Python.☆31Feb 19, 2026Updated 3 months ago
- a terminal-based performance monitor for apple silicon☆17Jul 13, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- golang 注释实现类似java的注解机制。基于ast语法解析和monkey动态代理。目前实现@Transactional的demo