Agent skills for vLLM
☆67Apr 3, 2026Updated last month
Alternatives and similar repositories for vllm-skills
Users that are interested in vllm-skills are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Augmented Dickey-Fuller implementation in Go☆12Mar 15, 2019Updated 7 years ago
- Nabla Containers blog☆12May 26, 2021Updated 4 years ago
- Python library to add support for embedding natural code in Python with shared program state.☆30Jan 20, 2026Updated 3 months ago
- NVIDIA Networking NIC Configuration Operator For Kubernetes☆18May 3, 2026Updated last week
- An asynchronous streaming data management module for efficient post-training.☆65May 2, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Memory Topology for GPUs☆19Apr 22, 2026Updated 2 weeks ago
- AC No Code 是偷懒者最好的在OJ中写代码AC的方式: Write nothing; submit nowhere.☆10May 18, 2020Updated 5 years ago
- ☆24Updated this week
- Spatial Transformer Network (STN) provides attention to a particular region to in an image, by doing transformation to the input image. T…☆15Dec 21, 2020Updated 5 years ago
- turboquant-based compression engine for LLM KV cache☆58Apr 3, 2026Updated last month
- Demos of many Rosetta applications☆25Jun 10, 2025Updated 10 months ago
- The vLLM XPU kernels for Intel GPU☆40Apr 30, 2026Updated last week
- Genomics resources for the Long-Term Evolution Experiment with E. coli☆19Jul 13, 2025Updated 9 months ago
- Prototyp MegaScale-Infer: Serving Mixture-of-Experts at Scale with Disaggregated Expert Parallelism☆30Apr 4, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- S.A.S.S.A.F.R.A.S. : a simple automatic scholar sorter appropriate for researchers and scientists☆20Oct 1, 2025Updated 7 months ago
- Kubernetes enhancements for Network Topology Aware Gang Scheduling & Autoscaling☆201May 3, 2026Updated last week
- POCO: Pareto-Optimal Controller Placement☆25May 4, 2018Updated 8 years ago
- Open source version of DOCA GPUNetIO and DOCA Verbs libraries (limited features) to enable GDAKI technology on RDMA (IB and RoCE)☆46May 1, 2026Updated last week
- Intel® SHMEM - Device initiated shared memory based communication library☆32Nov 12, 2025Updated 5 months ago
- Converting text-LMs into Visual Language Models☆63Jan 31, 2026Updated 3 months ago
- Hi 👋 memset0 here!☆18Updated this week
- ☆19Jan 29, 2026Updated 3 months ago
- A small experiment on assigning a processes threads a specific CPU and then blocking it with a high priority thread☆33Sep 24, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆47Mar 15, 2025Updated last year
- [NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning☆68Oct 31, 2025Updated 6 months ago
- 雅思备考☆15Oct 24, 2024Updated last year
- Accelerating MoE with IO and Tile-aware Optimizations☆664May 3, 2026Updated last week
- An experimental communicating attention kernel based on DeepEP.☆34Jul 29, 2025Updated 9 months ago
- ☆16Apr 14, 2026Updated 3 weeks ago
- ☆10Aug 16, 2019Updated 6 years ago
- ☆45May 4, 2025Updated last year
- ☆13May 17, 2020Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Tool for obtaining information about PPL processes☆16Feb 12, 2024Updated 2 years ago
- Self-Loading Registration Free COM Functions☆11Nov 12, 2019Updated 6 years ago
- Flow - Modern C++ toolkit for async loops, logs, config, benchmarking, and more [See also `ipc` repo]☆14Updated this week
- Generate Go bindings for shared C libraries.☆18Jul 13, 2024Updated last year
- High-performance LLM operator library built on TileLang.☆118Updated this week
- websocket-protocol's implementation with multithread synchronization model in C++☆17Jul 23, 2017Updated 8 years ago
- ☆11Jan 8, 2022Updated 4 years ago