vLLM Daily Summarization of Merged PRs
☆51Jun 24, 2026Updated this week
Alternatives and similar repositories for vllm-daily
Users that are interested in vllm-daily are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆33Apr 19, 2025Updated last year
- Development containers for triton and triton-cpu☆28Updated this week
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆31Apr 22, 2025Updated last year
- Expert Specialization MoE Solution based on CUTLASS☆27Apr 14, 2026Updated 2 months ago
- Hex encode & decode a string, right from your terminal.☆10Jan 5, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implements kernels with RISC-V Vector☆22Mar 24, 2023Updated 3 years ago
- Empowering LLM Agents for Real-World Computer System Optimization☆18Sep 10, 2025Updated 9 months ago
- these are custom recipes of nvidia nsight system post collection analysis.☆16Nov 7, 2025Updated 7 months ago
- JAX bindings for the flash-attention3 kernels☆24Jan 2, 2026Updated 5 months ago
- Personal knowledge library☆10Nov 9, 2017Updated 8 years ago
- ☆116Apr 19, 2024Updated 2 years ago
- VeriBetrKV OSDI'20 artifact☆13Sep 5, 2020Updated 5 years ago
- MESMERIC: A Software-based NVM Emulator Supporting Read/Write Asymmetric Latencies☆10Oct 1, 2020Updated 5 years ago
- Kernel sources for https://huggingface.co/kernels-community☆127Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- CS294-162; Machine Learning Systems Seminar☆32Apr 11, 2023Updated 3 years ago
- A reading group for system verification papers☆10Sep 28, 2023Updated 2 years ago
- The vLLM XPU kernels for Intel GPU☆49Updated this week
- 2023 中国开源年度报告;2023 China Open Source Report☆16Mar 9, 2026Updated 3 months ago
- Official implementation for paper "FedGame: A Game-Theoretic Defense against Backdoor Attacks in Federated Learning" (NeurIPS 2023).☆13Oct 25, 2024Updated last year
- Pytorch distributed backend extension with compression support☆17Mar 24, 2025Updated last year
- Collection of scripts to build PyTorch and the domain libraries from source.☆14Jun 9, 2026Updated 2 weeks ago
- Perplexity GPU Kernels☆585Nov 7, 2025Updated 7 months ago
- ☆16Nov 2, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 古诗词分词,词向量分析,输出到excel,云图☆10Jul 6, 2022Updated 3 years ago
- Exploring how optimizations for GEMMs work☆36Feb 28, 2026Updated 4 months ago
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆18Aug 21, 2023Updated 2 years ago
- High-performance embedded graph database for analytics and real-time transactions☆117Updated this week
- Learn LaTeX online☆15Apr 1, 2022Updated 4 years ago
- A simple demonstration of how PyTorch autograd works☆16Sep 23, 2021Updated 4 years ago
- ☆12Sep 2, 2021Updated 4 years ago
- Scala 3 Standard Library with bracket syntax.☆11Jul 10, 2021Updated 4 years ago
- ☆15Apr 28, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Discuz Q.☆15Dec 23, 2020Updated 5 years ago
- ☆22May 5, 2025Updated last year
- Type-safe NBT parser with kotlinx.serialization, SNBT and JSON support.☆13Mar 4, 2023Updated 3 years ago
- ☆19Nov 6, 2023Updated 2 years ago
- ☆106Sep 9, 2024Updated last year
- A demo for hexo-theme-book.☆10Oct 10, 2020Updated 5 years ago
- Hung-Yi Lee Linear Algebra 2018 Fall Homework☆10May 5, 2019Updated 7 years ago