Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.
☆45Jul 16, 2024Updated last year
Alternatives and similar repositories for vllm-embedding
Users that are interested in vllm-embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IJMLC: Open-TI: Open Traffic Intelligence with Augmented Language Model☆22Jul 30, 2025Updated 9 months ago
- ☆21Apr 24, 2026Updated 2 weeks ago
- A `tree` util enhanced with tokens, lines, and components. `pip install -U tree_plus`☆15Nov 24, 2025Updated 5 months ago
- Informative Conversational Query Rewriting☆39Jan 29, 2024Updated 2 years ago
- Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.☆40Apr 29, 2024Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Yet another coding assistant powered by LLM.☆16Sep 11, 2024Updated last year
- Playing with CSM☆22Mar 14, 2025Updated last year
- ACM Transactions on Information Systems (TOIS), the code and datasets for CKML.☆13Aug 31, 2023Updated 2 years ago
- Synthetic Hypertext and Homomorphic Catalogue☆15Dec 28, 2024Updated last year
- WhisperX Service love docker!☆18Aug 17, 2024Updated last year
- A BERT-based application for reusable text classification at scale☆37Jul 23, 2023Updated 2 years ago
- ☆14Mar 3, 2026Updated 2 months ago
- Layout Analysis Dataset with Segmonto (LADaS)☆25Jul 12, 2025Updated 9 months ago
- ☆21Feb 6, 2024Updated 2 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A simple AI agent controlling a simulation of a smart home☆13Jun 13, 2024Updated last year
- ☆18Dec 1, 2023Updated 2 years ago
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆15Jun 3, 2023Updated 2 years ago
- How to quickly serve an LLM using Fast API, Celery, and Redis☆17Aug 29, 2023Updated 2 years ago
- The best terminal chat client for your live streams☆19Jun 10, 2023Updated 2 years ago
- AWS Endpoint for Meta's MusicGen, with a Max4Live device to use it within Ableton Live☆17Jun 16, 2023Updated 2 years ago
- LinGPT, a GPT-4 webpage with just a single HTML file. 只有一个html文件的GPT4聊天网页,零门槛,10秒搞定。多Key轮询 Auto Key Rotation 支持代理平台/第三方Key Supports proxy…☆12Aug 28, 2023Updated 2 years ago
- ☆40Jul 26, 2024Updated last year
- A beautiful Astro theme based on Ghost Simply theme☆12Apr 23, 2026Updated 2 weeks ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Local transcription and speaker diarization with pyannote and parakeet☆27May 2, 2026Updated last week
- Python package to extract and analyse Canadian, United States and Indian real estate data from REALTOR.CA, REALTOR.COM and HOUSING.COM☆16Dec 21, 2025Updated 4 months ago
- A glowfic to epub converter.☆14Apr 11, 2026Updated 3 weeks ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆51May 19, 2025Updated 11 months ago
- FastAPI Microservices Architecture SDK - As Basis for multiple services in a platform/system☆12Oct 4, 2022Updated 3 years ago
- MediaWiki Categories Model☆13Feb 14, 2024Updated 2 years ago
- tuimorphic choose-your-own-adventure story game☆19Apr 30, 2026Updated last week
- ☆14Jul 25, 2023Updated 2 years ago
- A CLI tool to help you easily delete forked repositories.☆10Feb 16, 2026Updated 2 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- An unofficial MCP interface to interact with the PapersWithCode API☆22Jun 7, 2025Updated 11 months ago
- model UI experiments☆14Aug 20, 2024Updated last year
- one-click deepfake (face swap)☆10May 30, 2023Updated 2 years ago
- Code repository for the paper "The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Le…☆14Jan 16, 2025Updated last year
- ☆22Dec 18, 2025Updated 4 months ago
- gpt for bash: your wish is the command☆14Aug 8, 2023Updated 2 years ago
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated 2 years ago