A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
☆45May 1, 2025Updated 11 months ago
Alternatives and similar repositories for ktransformers
Users that are interested in ktransformers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- handle gguf files☆13Aug 14, 2025Updated 8 months ago
- kun-chat is a lightweight AI conversation app based on Ollama/kun-chat 是一款基于 Ollama 的轻量级 AI 对话应用☆10Jul 16, 2025Updated 9 months ago
- vs code extension postman☆11Jun 22, 2023Updated 2 years ago
- Socks5 Proxy based on Websocket.☆15Jul 10, 2020Updated 5 years ago
- 🐸你的服务器除了用来吃灰外,还可以拿来续☆11Mar 25, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A basic JSON library in modern C++☆16Aug 23, 2021Updated 4 years ago
- A Flexible Framework for Experiencing Heterogeneous LLM Inference/Fine-tune Optimizations☆17,070Updated this week
- ☆13Oct 21, 2023Updated 2 years ago
- This Elgg plugin lets users preview MS Office files (doc, docx, xls, xlsx, ppt, pptx), Apple iWork pages, Adobe eps, and zip files using …☆12Aug 28, 2015Updated 10 years ago
- A benchmark of real-world DL kernel problems☆181Apr 15, 2026Updated 2 weeks ago
- ☆14Apr 1, 2026Updated 3 weeks ago
- 这是一个工具库☆14Feb 25, 2026Updated 2 months ago
- Continuous Benchmark for cache libraries written in golang.☆12Mar 26, 2023Updated 3 years ago
- lean's openwrt/lede for xiaomi router 4A (R4AC)☆12Oct 9, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Luthier, a GPU binary instrumentation tool for AMD GPUs☆28Apr 20, 2026Updated last week
- ☆11Dec 5, 2016Updated 9 years ago
- ☆20Dec 27, 2024Updated last year
- 凯路创新水表蓝牙SDK☆16May 26, 2017Updated 8 years ago
- Generate Linux Perf event tables for Apple Silicon☆17Dec 16, 2025Updated 4 months ago
- Stanford CS 110L : Safety in Systems Programming☆12Mar 8, 2021Updated 5 years ago
- Triton for OpenCL backend, and use mlir-translate to get source OpenCL code☆27Aug 27, 2025Updated 8 months ago
- 👀 网易云热评爬虫 - 直男福利☆13Aug 9, 2019Updated 6 years ago
- SIEVE is simpler than LRU☆15Apr 29, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Compiler plugin for performance analysis of HIP applications☆13Apr 7, 2025Updated last year
- ILAng documentation☆10Nov 2, 2025Updated 5 months ago
- ☆14Oct 30, 2024Updated last year
- Artifact Evaluation for SpecFS [FAST'26]☆30Dec 28, 2025Updated 4 months ago
- 小游戏:吃掉小涩图☆16Nov 3, 2024Updated last year
- RISC-V vector and tensor compute extensions for Vortex GPGPU acceleration for ML workloads. Optimized for transformer models, CNNs, and g…☆23Apr 25, 2025Updated last year
- eBPF tool to collect BOLT profile☆14Apr 9, 2026Updated 3 weeks ago
- ☆34Updated this week
- hdfs client impl with pure rust☆19Jan 9, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & …☆55Jan 30, 2026Updated 2 months ago
- Wrapper shells enabling designs generated by rocket-chip to map onto certain FPGA boards☆20Nov 27, 2024Updated last year
- A docker image for One Student One Chip's debug exam☆10Sep 22, 2023Updated 2 years ago
- An interface for works with gorilla mux☆14Feb 15, 2024Updated 2 years ago
- ☆15Dec 9, 2025Updated 4 months ago
- ☆15May 27, 2019Updated 6 years ago
- Shor's algorithm simulation using CUDA☆19Nov 10, 2019Updated 6 years ago