vLLM Daily Summarization of Merged PRs
☆50Apr 15, 2026Updated this week
Alternatives and similar repositories for vllm-daily
Users that are interested in vllm-daily are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Apr 19, 2025Updated last year
- Development containers for triton and triton-cpu☆27Updated this week
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆31Apr 22, 2025Updated 11 months ago
- ☆17Dec 8, 2023Updated 2 years ago
- Expert Specialization MoE Solution based on CUTLASS☆26Updated this week
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Hex encode & decode a string, right from your terminal.☆10Jan 5, 2023Updated 3 years ago
- Implements kernels with RISC-V Vector☆22Mar 24, 2023Updated 3 years ago
- Kernel sources for https://huggingface.co/kernels-community☆102Updated this week
- Triton-based Symmetric Memory operators and examples☆98Mar 28, 2026Updated 3 weeks ago
- these are custom recipes of nvidia nsight system post collection analysis.☆16Nov 7, 2025Updated 5 months ago
- USTC计算物理A☆10Aug 16, 2021Updated 4 years ago
- JAX bindings for the flash-attention3 kernels☆22Jan 2, 2026Updated 3 months ago
- A reading group for system verification papers☆10Sep 28, 2023Updated 2 years ago
- 一个提供NoneBot1兼容层的NoneBot2插件☆10Sep 20, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆18Apr 13, 2026Updated last week
- Official implementation for paper "FedGame: A Game-Theoretic Defense against Backdoor Attacks in Federated Learning" (NeurIPS 2023).☆13Oct 25, 2024Updated last year
- Pytorch distributed backend extension with compression support☆17Mar 24, 2025Updated last year
- ☆20Apr 18, 2024Updated 2 years ago
- Official implementation of the paper: [EMNLP 2025] RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruct…☆21Dec 9, 2025Updated 4 months ago
- Perplexity GPU Kernels☆567Nov 7, 2025Updated 5 months ago
- ☆10May 26, 2020Updated 5 years ago
- a collection of skills for vllm-omni☆53Apr 14, 2026Updated last week
- ☆16Nov 2, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated last year
- Exploring how optimizations for GEMMs work☆30Feb 28, 2026Updated last month
- FoF Upload,but with TencentCloud COS☆14Nov 10, 2024Updated last year
- Community maintained hardware plugin for vLLM on AWS Neuron☆28Mar 20, 2026Updated last month
- Learn LaTeX online☆15Apr 1, 2022Updated 4 years ago
- ☆164Feb 15, 2025Updated last year
- Scala 3 Standard Library with bracket syntax.☆11Jul 10, 2021Updated 4 years ago
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- Discuz Q.☆16Dec 23, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆15Apr 28, 2023Updated 2 years ago
- oneAPI - Data Parallel C++ course for students☆44Nov 4, 2024Updated last year
- ☆22May 5, 2025Updated 11 months ago
- Type-safe NBT parser with kotlinx.serialization, SNBT and JSON support.☆13Mar 4, 2023Updated 3 years ago
- 你 打 字 带 空 格☆11Oct 2, 2023Updated 2 years ago
- ☆19Nov 6, 2023Updated 2 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago