A high-throughput and memory-efficient inference and serving engine for LLMs
☆18Nov 24, 2023Updated 2 years ago
Alternatives and similar repositories for vllm
Users that are interested in vllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Shaping Language Models with Cognitive Insights☆15Feb 29, 2024Updated 2 years ago
- ICE-PIXIU:A Cross-Language Financial Megamodeling Framework☆19Dec 4, 2024Updated last year
- the benchmark for finance☆11Jul 4, 2023Updated 2 years ago
- This is AlpaGasus2-QLoRA based on LLaMA2 with AlpaGasus mechanism using QLoRA!☆15Nov 22, 2023Updated 2 years ago
- Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model☆18Aug 2, 2021Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Just awful. | The counter part of awesome.☆10Apr 25, 2018Updated 8 years ago
- ☆13Jun 11, 2024Updated last year
- Unsupervised tableQA and databaseQA on chinese finance question and tabular data☆13Apr 20, 2023Updated 3 years ago
- ☆21May 22, 2023Updated 2 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- LangChain DeepResearch: Autonomous recursive research powered by any LLM☆19Mar 19, 2025Updated last year
- ☆43May 9, 2024Updated 2 years ago
- accelerate generating vector by using onnx model☆18Jan 23, 2024Updated 2 years ago
- A Multi-Format Transfer Learning Model for Event Argument Extraction via Variational Information Bottleneck☆10Sep 9, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ChatYuan-7B☆13Jun 16, 2023Updated 2 years ago
- Research project on glyph-based Chinese character embedding. Preparing for EMNLP 2019☆11Mar 18, 2019Updated 7 years ago
- Keras library for building (Universal) Transformers, facilitating BERT and GPT models☆11Jan 29, 2019Updated 7 years ago
- Rust 官方周报(简体中文版)☆15Jul 15, 2021Updated 4 years ago
- Unconditional Geomodeling related work (codes, data, and results)☆18Jan 4, 2023Updated 3 years ago
- 📰 Named entitity recognition (NER) and Entity linking (EL) on the dataset of Patents☆16Jun 5, 2022Updated 3 years ago
- code for "Fine-grained Entity Typing via Label Reasoning" EMNLP2021☆13May 27, 2022Updated 3 years ago
- WPF编写的词向量可视化工具,比较word2vec, glove, fastText的不同☆31Mar 6, 2017Updated 9 years ago
- [ACL'25 Oral] What Happened in LLMs Layers when Trained for Fast vs. Slow Thinking: A Gradient Perspective☆76Jun 25, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆12Jun 19, 2025Updated 10 months ago
- Open-source Human Feedback Library☆11Oct 25, 2023Updated 2 years ago
- ☆17Nov 3, 2024Updated last year
- Zero-shot entity linking with less data☆15Aug 1, 2022Updated 3 years ago
- ☆12Apr 29, 2024Updated 2 years ago
- implementation of http://arxiv.org/pdf/1511.06391v4.pdf in keras☆13Oct 3, 2016Updated 9 years ago
- ☆22Feb 8, 2025Updated last year
- Code for paper "Open Relation and Event Type Discovery with Type Abstraction". EMNLP 22'☆16Nov 30, 2022Updated 3 years ago
- [ICLR 2026] The official implementation of the paper “Anchored Supervised Fine-Tuning”☆38Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Evaluation utilities based on SymPy.☆22Dec 12, 2024Updated last year
- ☆12Dec 8, 2022Updated 3 years ago
- A new algorithm that formulates jailbreaking as a reasoning problem.☆26Jul 2, 2025Updated 10 months ago
- The second repository (GeoModeling_Conditional_ProGAN) is used for conditioning to well data and global features. This repository is for …☆24Jan 4, 2023Updated 3 years ago
- Chain of Agents implementation in Python and Swift☆30Jan 15, 2026Updated 3 months ago
- 🔱 A naive tool for AssetBundles exploring.☆11Nov 22, 2022Updated 3 years ago
- Unofficial PyTorch implementation of the paper "Multi-Label Image Recognition with Graph Convolutional Networks"☆10Feb 19, 2023Updated 3 years ago