A high-throughput and memory-efficient inference and serving engine for LLMs
☆18Nov 24, 2023Updated 2 years ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated 2 years ago
- the benchmark for finance☆11Jul 4, 2023Updated 2 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model☆18Aug 2, 2021Updated 4 years ago
- this is based on the paper Chain-of-Retrieval Augmented Generation☆15Mar 29, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆21May 22, 2023Updated 3 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated 2 years ago
- LangChain DeepResearch: Autonomous recursive research powered by any LLM☆19Mar 19, 2025Updated last year
- ☆43May 9, 2024Updated 2 years ago
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Sep 9, 2022Updated 3 years ago
- ☆11Mar 12, 2021Updated 5 years ago
- This repository is for the "LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation".☆17Nov 18, 2025Updated 6 months ago
- Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629☆23Oct 14, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Aug 15, 2023Updated 2 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆11Jan 29, 2019Updated 7 years ago
- Unconditional Geomodeling related work (codes, data, and results)☆18Jan 4, 2023Updated 3 years ago
- 📰 Named entitity recognition (NER) and Entity linking (EL) on the dataset of Patents☆16Jun 5, 2022Updated 3 years ago
- code for "Fine-grained Entity Typing via Label Reasoning" EMNLP2021☆13May 27, 2022Updated 4 years ago
- An implementation for MLLM oversensitivity evaluation☆18Nov 16, 2024Updated last year
- PDF table extraction☆10Dec 14, 2021Updated 4 years ago
- this is a high performance cuda porting of cbow model of word2vec☆17Sep 14, 2014Updated 11 years ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆76Jun 25, 2025Updated 11 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Contextual Retrieval solves this problem by prepending chunk-specific explanatory context to each chunk before embedding (“Contextual Emb…☆28Sep 29, 2024Updated last year
- ☆17Nov 3, 2024Updated last year
- ☆12Apr 29, 2024Updated 2 years ago
- Server for ZKML☆22Mar 5, 2023Updated 3 years ago
- ☆14Jul 11, 2024Updated last year
- implementation of http://arxiv.org/pdf/1511.06391v4.pdf in keras☆13Oct 3, 2016Updated 9 years ago
- ☆23Feb 8, 2025Updated last year
- java写的socks5翻墙工具☆13Mar 13, 2015Updated 11 years ago
- [ICLR 2026] The official implementation of the paper “Anchored Supervised Fine-Tuning”☆41May 8, 2026Updated 3 weeks ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational Autoencoders…☆16Sep 27, 2021Updated 4 years ago
- The second repository (GeoModeling_Conditional_ProGAN) is used for conditioning to well data and global features. This repository is for …☆24Jan 4, 2023Updated 3 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- A new algorithm that formulates jailbreaking as a reasoning problem.☆26Jul 2, 2025Updated 10 months ago
- Unofficial PyTorch implementation of the paper "Multi-Label Image Recognition with Graph Convolutional Networks"☆10Feb 19, 2023Updated 3 years ago
- ☆21Jan 4, 2023Updated 3 years ago
- 2022 WAIC 黑客松蚂蚁财富赛道:AntSQL大规模金融语义解析中文Text-to-SQL挑战 赛 一位萌新的代码 嘻嘻嘻☆14Mar 11, 2023Updated 3 years ago