A high-throughput and memory-efficient inference and serving engine for LLMs
☆18Nov 24, 2023Updated 2 years ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ICE-PIXIU:A Cross-Language Financial Megamodeling Framework☆19Dec 4, 2024Updated last year
- the benchmark for finance☆11Jul 4, 2023Updated 2 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model☆18Aug 2, 2021Updated 4 years ago
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Unsupervised tableQA and databaseQA on chinese finance question and tabular data☆13Apr 20, 2023Updated 3 years ago
- ☆21May 22, 2023Updated 2 years ago
- ☆43May 9, 2024Updated last year
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Sep 9, 2022Updated 3 years ago
- ☆11Mar 12, 2021Updated 5 years ago
- This repository is for the "LLM-Aligned Geographic Item Tokenization for Local-Life Recommendation".☆17Nov 18, 2025Updated 5 months ago
- Rust 官方周报(简体中文版)☆15Jul 15, 2021Updated 4 years ago
- Unconditional Geomodeling related work (codes, data, and results)☆17Jan 4, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Qwen1.5大模型微调、基于PEFT框架LoRA微调,在数据集HC3-Chinese上实现文本分类。☆12Jun 29, 2024Updated last year
- 📰 Named entitity recognition (NER) and Entity linking (EL) on the dataset of Patents☆16Jun 5, 2022Updated 3 years ago
- code for "Fine-grained Entity Typing via Label Reasoning" EMNLP2021☆13May 27, 2022Updated 3 years ago
- PDF table extraction☆10Dec 14, 2021Updated 4 years ago
- ☆12Jun 19, 2025Updated 10 months ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ☆17Nov 3, 2024Updated last year
- Contextual Retrieval solves this problem by prepending chunk-specific explanatory context to each chunk before embedding (“Contextual Emb…☆28Sep 29, 2024Updated last year
- Zero-shot entity linking with less data☆15Aug 1, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICLR 2026] The official implementation of the paper “Anchored Supervised Fine-Tuning”☆37Feb 12, 2026Updated 2 months ago
- ☆12Apr 29, 2024Updated last year
- implementation of http://arxiv.org/pdf/1511.06391v4.pdf in keras☆13Oct 3, 2016Updated 9 years ago
- java写的socks5翻墙工具☆13Mar 13, 2015Updated 11 years ago
- Code for paper "Open Relation and Event Type Discovery with Type Abstraction". EMNLP 22'☆16Nov 30, 2022Updated 3 years ago
- The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational Autoencoders…☆16Sep 27, 2021Updated 4 years ago
- ☆12Dec 8, 2022Updated 3 years ago
- The second repository (GeoModeling_Conditional_ProGAN) is used for conditioning to well data and global features. This repository is for …☆23Jan 4, 2023Updated 3 years ago
- Chain of Agents implementation in Python and Swift☆29Jan 15, 2026Updated 3 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Unofficial PyTorch implementation of the paper "Multi-Label Image Recognition with Graph Convolutional Networks"☆10Feb 19, 2023Updated 3 years ago
- 2022 WAIC 黑客松蚂蚁财富赛道:AntSQL大规模金融语义解析中文Text-to-SQL挑战赛 一位萌新的代码 嘻嘻嘻☆14Mar 11, 2023Updated 3 years ago
- Using tensorflow to create a recommendation engine with DNN☆16Jul 30, 2016Updated 9 years ago
- Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"☆25Sep 13, 2024Updated last year
- The source code of paper "Gradient Imitation Reinforcement Learning for Low Resource Relation Extraction"☆24Oct 28, 2021Updated 4 years ago
- ☆31Sep 12, 2025Updated 7 months ago
- Data and code for the paper "The Moral Integrity Corpus: A Benchmark for Ethical Dialogue Systems"☆21Jul 18, 2023Updated 2 years ago