A high-throughput and memory-efficient inference and serving engine for LLMs
☆29May 12, 2025Updated 9 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- Fastened CROWN: Tightened Neural Network Robustness Certificates☆10Feb 10, 2020Updated 6 years ago
- ncnn export & infer mobileclip☆19Aug 18, 2025Updated 6 months ago
- 基于select模型的多线程、高并发服务器,同时实现了内存池+对象池☆10Nov 4, 2019Updated 6 years ago
- PyTorch implementation of the SIESTA algorithm from our TMLR-2023 paper "SIESTA: Efficient Online Continual Learning with Sleep"☆13Oct 25, 2024Updated last year
- ☆11Oct 8, 2020Updated 5 years ago
- The Structure and Interpretation of Deep Networks Handbook☆14Dec 14, 2024Updated last year
- The official implementation of paper "Drop-Activation: Implicit Parameter Reduction and Harmonious Regularization".☆10May 30, 2019Updated 6 years ago
- An example of how the LIME algorithm can be used to provide real-world insight into the decision processes of a 'black-box' machine learn…☆15Feb 19, 2019Updated 7 years ago
- A WordPress module that handles installing plugins and themes.☆11Feb 19, 2026Updated 2 weeks ago
- ☆11Jun 2, 2021Updated 4 years ago
- Fully open reproduction of DeepSeek-R1☆12Mar 24, 2025Updated 11 months ago
- An Efficient Dataset Condensation Plugin and Its Application to Continual Learning. NeurIPS, 2023.☆12Nov 29, 2023Updated 2 years ago
- Unofficial docker wrapper for Qualcomm SNPE(Snapdragon Neural Processing Engine) SDK☆11Mar 3, 2022Updated 4 years ago
- GPU methods for alpha matting, including cutting edge research algorithms by Philip G. Lee.☆12Jan 8, 2014Updated 12 years ago
- official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation☆13Apr 15, 2024Updated last year
- Public Docker Images Collection☆11Jan 10, 2026Updated last month
- This is the official code for the paper "Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaborati…☆12Aug 13, 2024Updated last year
- Code for "Can We Characterize Tasks Without Labels or Features?" (CVPR 2021)☆11Aug 31, 2021Updated 4 years ago
- hestiacp-nginx-nextjs☆12Oct 14, 2025Updated 4 months ago
- Weighted Nonlocal Total Variation in Image Processing☆10Jul 11, 2023Updated 2 years ago
- A set of tools to help migration to WordPress.☆18Feb 25, 2026Updated last week
- Real-time AI video segmentation of USB camera and streaming over HTTP☆12Apr 23, 2025Updated 10 months ago
- ☆10Dec 12, 2020Updated 5 years ago
- Approximate the product between infinite functional objects on a manifold -- i.e. belief products☆12Feb 18, 2026Updated 2 weeks ago
- ☆13Nov 26, 2023Updated 2 years ago
- A straightforward implementation of EGBM-based Generalized Additive Model☆14Oct 15, 2020Updated 5 years ago
- Linear Mode Connectivity in Multitask and Continual Learning: https://arxiv.org/abs/2010.04495☆12Oct 12, 2020Updated 5 years ago
- A repository to store my cuda codes, including some common-used kernels.☆12Sep 19, 2021Updated 4 years ago
- ☆10Mar 24, 2024Updated last year
- Code repo for the paper "Semantic Correspondence via 2D-3D-2D Cycle"☆12Jan 28, 2021Updated 5 years ago
- Mcity Data Engine☆21Feb 4, 2026Updated last month
- Matlab implementation of Poisson image editing☆14Feb 25, 2023Updated 3 years ago
- WanJuan-CC是以CommonCrawl为基础,经过数据抽取,规则清洗,去重,安全过滤,质量清洗等步骤得到的高质量数据。☆14Apr 18, 2024Updated last year
- We introduce a model of lifelong learning, based on a Network of Experts. New tasks / experts are learned and added to the model sequenti…☆11Aug 8, 2017Updated 8 years ago
- ☆17Jun 26, 2021Updated 4 years ago
- 📍 Official repository of paper "ProtoCLIP: Prototypical Contrastive Language Image Pretraining" (IEEE TNNLS 2023)☆55Nov 8, 2023Updated 2 years ago
- Implementation of model-free computational optics to a wide range of tasks.☆19Oct 21, 2024Updated last year
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆13May 31, 2023Updated 2 years ago
- The official implementation of ADDP (ICLR 2024)☆12Mar 27, 2024Updated last year