vLLM Daily Summarization of Merged PRs
☆48Mar 25, 2026Updated this week
Alternatives and similar repositories for vllm-daily
Users that are interested in vllm-daily are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆32Apr 19, 2025Updated 11 months ago
- Development containers for triton and triton-cpu☆27Updated this week
- The official implementation for the intra-stage fusion technique introduced in https://arxiv.org/abs/2409.13221☆31Apr 22, 2025Updated 11 months ago
- Kernel sources for https://huggingface.co/kernels-community☆83Updated this week
- Expert Specialization MoE Solution based on CUTLASS☆27Jan 19, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Triton-based Symmetric Memory operators and examples☆94Jan 15, 2026Updated 2 months ago
- these are custom recipes of nvidia nsight system post collection analysis.☆16Nov 7, 2025Updated 4 months ago
- USTC计算物理A☆10Aug 16, 2021Updated 4 years ago
- JAX bindings for the flash-attention3 kernels☆22Jan 2, 2026Updated 2 months ago
- VeriBetrKV OSDI'20 artifact☆13Sep 5, 2020Updated 5 years ago
- MESMERIC: A Software-based NVM Emulator Supporting Read/Write Asymmetric Latencies☆10Oct 1, 2020Updated 5 years ago
- 北京大学物理学院课程作业模板☆11Sep 30, 2022Updated 3 years ago
- CS294-162; Machine Learning Systems Seminar☆32Apr 11, 2023Updated 2 years ago
- A reading group for system verification papers☆10Sep 28, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆17Updated this week
- 一个提供NoneBot1兼容层的NoneBot2插件☆10Sep 20, 2021Updated 4 years ago
- Official implementation for paper "FedGame: A Game-Theoretic Defense against Backdoor Attacks in Federated Learning" (NeurIPS 2023).☆13Oct 25, 2024Updated last year
- Pytorch distributed backend extension with compression support☆17Mar 24, 2025Updated last year
- Collection of scripts to build PyTorch and the domain libraries from source.☆14Feb 4, 2026Updated last month
- Persistent Memory Tool Box☆12Mar 4, 2024Updated 2 years ago
- Official implementation of the paper: [EMNLP 2025] RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruct…☆21Dec 9, 2025Updated 3 months ago
- Perplexity GPU Kernels☆564Nov 7, 2025Updated 4 months ago
- ☆10May 26, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- 古诗词分词,词向量分析,输出到excel,云图☆10Jul 6, 2022Updated 3 years ago
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated last year
- ☆16Nov 2, 2022Updated 3 years ago
- clustering algorithm implementation☆13Nov 3, 2025Updated 4 months ago
- Exploring how optimizations for GEMMs work☆29Feb 28, 2026Updated last month
- STREAMer: Benchmarking remote volatile and non-volatile memory bandwidth☆17Aug 21, 2023Updated 2 years ago
- FoF Upload,but with TencentCloud COS☆14Nov 10, 2024Updated last year
- Learn LaTeX online☆15Apr 1, 2022Updated 3 years ago
- A simple demonstration of how PyTorch autograd works☆16Sep 23, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆163Feb 15, 2025Updated last year
- Scala 3 Standard Library with bracket syntax.☆11Jul 10, 2021Updated 4 years ago
- Standalone commandline CLI tool for compiling Triton kernels☆20Sep 13, 2024Updated last year
- Discuz Q.☆16Dec 23, 2020Updated 5 years ago
- ☆15Apr 28, 2023Updated 2 years ago
- ☆22May 5, 2025Updated 10 months ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago