LiuXiaoxuanPKU / vllmView external linksLinks
A high-throughput and memory-efficient inference and serving engine for LLMs
☆11Sep 4, 2025Updated 5 months ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below
Sorting:
- ☆12Dec 19, 2024Updated last year
- The largest open corpus of .docx files for document processing research☆45Jan 22, 2026Updated 3 weeks ago
- ☆13May 3, 2023Updated 2 years ago
- This Repo Contains Script To Fine Tune Open Source Models Using Unsloth by using UI with simple click and progress☆11Oct 3, 2024Updated last year
- Frontend (and soon also midleware and backend) for a new, opensource image generation platform.☆14Nov 5, 2022Updated 3 years ago
- [EMNLP 2022] RLET: A Reinforcement Learning Based Approach for Explainable QA with Entailment Trees☆11Jul 15, 2023Updated 2 years ago
- A sample app to debug and validate cellular modems on balena devices☆13Jun 5, 2019Updated 6 years ago
- Generic build server☆64May 25, 2014Updated 11 years ago
- notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and produc…☆10Dec 25, 2024Updated last year
- Code for data reduction and analysis of Galaxy Zoo 2☆14May 20, 2016Updated 9 years ago
- ☆11Oct 11, 2023Updated 2 years ago
- ☆16Jul 29, 2025Updated 6 months ago
- React 0.13 with ES6, Immutable.js and Flux, Isomorphic as well☆11Mar 10, 2015Updated 10 years ago
- ☆12Updated this week
- Website to view yale clubs and events☆15Feb 1, 2026Updated 2 weeks ago
- ☆14Jun 28, 2023Updated 2 years ago
- Yet another isomorphic react boilerplate. This one does not require node on server.☆10Apr 4, 2017Updated 8 years ago
- NodeJS SDK for the Sellix Developers API (developers.sellix.io). Quickly get started and create products, payments and more using NodeJS.☆11Jan 5, 2024Updated 2 years ago
- Implementation of IntelliJ IDEA code completion plugin using a local LLM.☆17Feb 9, 2026Updated last week
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 2 years ago
- ☆10Feb 3, 2025Updated last year
- 🚀 Automated deployment stack for AMD MI300 GPUs with optimized ML/DL frameworks and HPC-ready configurations☆12Nov 30, 2024Updated last year
- This is a cog implementation of StableLM☆17Jun 6, 2023Updated 2 years ago
- Unofficial package to easily interact with the Kits.AI API☆10Mar 1, 2024Updated last year
- A simple one file python script that executes AI processes defined in YML.☆14Mar 26, 2023Updated 2 years ago
- Caching for Graphql Resolvers☆19Nov 21, 2019Updated 6 years ago
- Port of Facebook's LLaMA model in C/C++☆12Updated this week
- [Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆14Mar 18, 2025Updated 10 months ago
- a temporal graph analytics library based on Flink Stateful Functions☆11Jun 8, 2023Updated 2 years ago
- A package to interact with poe.com☆12Mar 16, 2023Updated 2 years ago
- ☆19Oct 2, 2024Updated last year
- arXiv-Chat: An AI research assistant and Discord bot☆13Jul 16, 2023Updated 2 years ago
- ☆13Oct 25, 2024Updated last year
- ☆18Dec 20, 2025Updated last month
- Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?☆19Mar 9, 2025Updated 11 months ago
- A serverless application that uses AnimateDiff to run a Text-to-Video task on RunPod.☆19Mar 8, 2024Updated last year
- [ACL 2025] Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models☆34Nov 4, 2025Updated 3 months ago
- 针对qwen微调模型进行数据预处理☆13Jan 8, 2024Updated 2 years ago
- ☆15Nov 7, 2022Updated 3 years ago