vLLM Kunlun (vllm-kunlun) is a community-maintained hardware plugin designed to seamlessly run vLLM on the Kunlun XPU.
☆390Mar 27, 2026Updated this week
Alternatives and similar repositories for vLLM-Kunlun
Users that are interested in vLLM-Kunlun are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Qianfan-VL: Domain-Enhanced Universal Vision-Language Models☆339Mar 18, 2026Updated last week
- Showcasing my 2025 USACO US Open dual perfect scores (1000/1000 in both Gold & Silver divisions) — one of only 8 U.S. high schoolers nati…☆472Feb 22, 2026Updated last month
- 从零实现语言模型的搭建、训练、部署☆82Feb 10, 2026Updated last month
- My personal website & professional portfolio on GitHub Pages. Showcases resume, peer-reviewed publications (IEEE AIAM 2025 & IJHSR), dual…☆276Mar 1, 2026Updated 3 weeks ago
- Portfolio of capstone projects from AI & Technology Honors, Data Science Honors, and AI Internship Honors. Showcases advanced Python work…☆285Feb 16, 2026Updated last month
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- ☆20Sep 29, 2025Updated 6 months ago
- The main purpose of runtime copilot is to assist with node runtime management tasks such as configuring registries, upgrading versions, i…☆12May 16, 2023Updated 2 years ago
- Karmada APIs☆15Mar 10, 2026Updated 2 weeks ago
- Wukong AI-CRM 15 is a comprehensive customer relationship management system that supports the entire business process from lead acquisiti…☆505Dec 29, 2025Updated 3 months ago
- API Extensions for core KubeVela.☆14Feb 1, 2026Updated last month
- KubeSphere 3.2.1 快速入门☆11Jun 15, 2022Updated 3 years ago
- kubernetes client to☆11May 27, 2022Updated 3 years ago
- A fully modular, framework-agnostic, easy-to-extend SDK for building complex X402 payment integrations.☆52Mar 19, 2026Updated last week
- A light implementation of megatron for research and study☆490Dec 22, 2025Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆480Aug 8, 2025Updated 7 months ago
- A simple script to plot the Roofline model for given HW platforms and applications☆10Mar 17, 2026Updated last week
- Let CI Robot automatically execute commands for your PR/issue in your Github repository, hosting on Github Action does not require your s…☆13Updated this week
- ☆18Jun 7, 2022Updated 3 years ago
- MathLens 是一个专注于数学题目视频讲解的 Agent Skill。你只需粘贴一道数学题(图片或文字),它就能自动完成从题目分析、可视化讲解、配音脚本到 Manim 动画视频的全流程制作。单条视频1-10 分钟,成本 0.2-1 元以内。☆305Mar 10, 2026Updated 2 weeks ago
- Typecript version of https://github.com/openai/symphony☆557Mar 13, 2026Updated 2 weeks ago
- ArcticDB-backed time series cache with incremental updates — fetch once, upsert the gap.☆57Mar 22, 2026Updated last week
- ☆15May 28, 2022Updated 3 years ago
- 通过阅读代码,自动生成文档☆16Apr 27, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Multi-model DAG-driven parallel AI film generation — parallel speedup scales with scene independence; Generate film scenes simultaneously…☆155Updated this week
- Community maintained hardware plugin for vLLM on MetaX GPU☆120Updated this week
- Let AI agents run experiments in any repo while you sleep.☆318Mar 20, 2026Updated last week
- 使用ONNXRuntime部署YOLOV7人头检测,包含C++和Python两个版本的程序☆30Nov 5, 2022Updated 3 years ago
- [Moved to https://github.com/kubernetes-sigs/kwok] fake-k8s is a tool for running Fake Kubernetes clusters, It can be used as an alternat…☆19Jan 6, 2023Updated 3 years ago
- Comments and Feedbacks☆11Dec 25, 2025Updated 3 months ago
- 这里将paddle中的ocr等模型转为onnx格式,并利用java版深度框架djl加载这些onnx模型进行推理预测尝试。☆13Nov 15, 2022Updated 3 years ago
- 函数工作流可视化编排解决方案☆11Nov 1, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆16Feb 18, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A lightweight operating system abstraction layer for agents.☆16Dec 26, 2025Updated 3 months ago
- a repo store the new tech stack's tutorial and usage☆11May 25, 2022Updated 3 years ago
- A lightweight client-server system for real-time audio processing with voice activity detection (VAD) and automatic speech recognition (A…☆460Jan 4, 2026Updated 2 months ago
- Deduplication over dis-aggregated memory for Serverless Computing☆14Mar 21, 2022Updated 4 years ago
- A rust-version of NVIDIA BlueField DOCA kit.☆14Jun 11, 2023Updated 2 years ago
- ☆20Sep 29, 2025Updated 6 months ago
- UpgradeLink offering one-stop application upgrade solutions for developers and enterprises. 为开发者和企业提供一站式应用升级解决方案。☆481Mar 16, 2026Updated last week