Agent skills for vLLM
☆59Apr 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for vllm-skills
Users that are interested in vllm-skills are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Augmented Dickey-Fuller implementation in Go☆12Mar 15, 2019Updated 7 years ago
- Nabla Containers blog☆12May 26, 2021Updated 4 years ago
- Python library to add support for embedding natural code in Python with shared program state.☆29Jan 20, 2026Updated 2 months ago
- ☆10Apr 7, 2020Updated 6 years ago
- NVIDIA Networking NIC Configuration Operator For Kubernetes☆16Apr 12, 2026Updated last week
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An asynchronous streaming data management module for efficient post-training.☆49Updated this week
- Memory Topology for GPUs☆19Mar 26, 2026Updated 3 weeks ago
- AC No Code 是偷懒者最好的在OJ中写代码AC的方式: Write nothing; submit nowhere.☆10May 18, 2020Updated 5 years ago
- See vLLM official support: https://github.com/vllm-project/vllm-ascend☆11Feb 5, 2025Updated last year
- Repository for out-of-tree scheduler plugins based on scheduler framework.☆12Mar 6, 2026Updated last month
- ☆24Updated this week
- Easy Scheduler是一个分布式工作流任务调度系统,主要解决数据研发ETL错综复杂的依赖关系,而不能直观监控任务健康状态等问题。Easy Scheduler以DAG流式的方式将Task组装起来,可实时监控任务的运行状态,同时支持重试、从指定节点恢复失败、暂停及Kil…☆10Apr 9, 2019Updated 7 years ago
- Spatial Transformer Network (STN) provides attention to a particular region to in an image, by doing transformation to the input image. T…☆15Dec 21, 2020Updated 5 years ago
- turboquant-based compression engine for LLM KV cache☆57Apr 3, 2026Updated 2 weeks ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Autoscaling components for Kubernetes☆21Updated this week
- Demos of many Rosetta applications☆25Jun 10, 2025Updated 10 months ago
- SFC controller: extension to the default scheduler (Kube-Scheduler) in Kubernetes to enable scheduling in terms of latency and bandwidth☆19Jul 3, 2020Updated 5 years ago
- ☆79Mar 22, 2026Updated 3 weeks ago
- Bash script to control clock speed, memory speed, power limit, user defined automatic fan control, LED brightness and power state for Nvi…☆24May 29, 2021Updated 4 years ago
- VR-EXP: An Experimentation Platform for Adaptive Virtual Reality Video Streaming☆15May 19, 2022Updated 3 years ago
- Container Level Energy-efficient VPA Recommender☆27Mar 19, 2026Updated last month
- Prototyp MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism☆29Apr 4, 2025Updated last year
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling☆194Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- S.A.S.S.A.F.R.A.S. : a simple automatic scholar sorter appropriate for researchers and scientists☆20Oct 1, 2025Updated 6 months ago
- The Kubernetes operator for InfluxDB and the TICK stack.☆27Mar 30, 2022Updated 4 years ago
- Open source version of DOCA GPUNetIO and DOCA Verbs libraries (limited features) to enable GDAKI technology on RDMA (IB and RoCE)☆40Updated this week
- Intel® SHMEM - Device initiated shared memory based communication library☆32Nov 12, 2025Updated 5 months ago
- A small experiment on assigning a processes threads a specific CPU and then blocking it with a high priority thread☆33Sep 24, 2025Updated 6 months ago
- [NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning☆68Oct 31, 2025Updated 5 months ago
- Accelerating MoE with IO and Tile-aware Optimizations☆630Apr 1, 2026Updated 2 weeks ago
- 雅思备考☆15Oct 24, 2024Updated last year
- High-performance LLM operator library built on TileLang.☆102Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- An Operator for running the Vertical Pod Autoscaler on OpenShift☆33Mar 17, 2026Updated last month
- llm-d benchmark scripts and tooling☆55Apr 11, 2026Updated last week
- ☆33Jun 11, 2018Updated 7 years ago
- An experimental communicating attention kernel based on DeepEP.☆35Jul 29, 2025Updated 8 months ago
- ☆45May 4, 2025Updated 11 months ago
- ☆10Aug 16, 2019Updated 6 years ago
- ☆13May 17, 2020Updated 5 years ago