Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.
☆45Jul 16, 2024Updated last year
Alternatives and similar repositories for vllm-embedding
Users that are interested in vllm-embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IJMLC: Open-TI: Open Traffic Intelligence with Augmented Language Model☆22Jul 30, 2025Updated 10 months ago
- ☆23May 12, 2026Updated 2 weeks ago
- A `tree` util enhanced with tokens, lines, and components. `pip install -U tree_plus`☆15Nov 24, 2025Updated 6 months ago
- Open Source Text Embedding Models with OpenAI Compatible API☆168Jul 13, 2024Updated last year
- setup the env for vllm users☆16Oct 31, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 3 years ago
- WhisperX Service love docker!☆18Aug 17, 2024Updated last year
- BPE modification that implements removing of the intermediate tokens during tokenizer training.☆27Nov 25, 2024Updated last year
- ☆11Nov 5, 2021Updated 4 years ago
- Layout Analysis Dataset with Segmonto (LADaS)☆25Jul 12, 2025Updated 10 months ago
- WaterCooler is an open source, desktop GUI for interacting with ChatGPT, created with Tauri.☆31Dec 28, 2023Updated 2 years ago
- A simple AI agent controlling a simulation of a smart home☆13Jun 13, 2024Updated last year
- ☆18Dec 1, 2023Updated 2 years ago
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆16Jun 3, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- How to quickly serve an LLM using Fast API, Celery, and Redis☆17Aug 29, 2023Updated 2 years ago
- Official Webassemly (NodeJS) port of OpenFHE: Work in progress☆18May 23, 2025Updated last year
- Local search for NAS☆18Nov 3, 2020Updated 5 years ago
- Shen Zhou, Tieyun Qian: On the Strength of Sequence Labeling and Generative Models for Aspect Sentiment Triplet Extraction. Findings of A…☆12May 26, 2023Updated 3 years ago
- ALAS: Autonomous Learning Agent System☆17Aug 14, 2025Updated 9 months ago
- 🔬 ArXiv论文智能解读助手 - Arxiv-MCP-Server, 支持MCP协议的学术论文一键下载、解析、翻译为中文,并生成微信公众号文章格式☆42Jun 16, 2025Updated 11 months ago
- PyTorch Implementation of A Deep Learning System for Predicting Size and Fit in Fashion E-Commerce (RecSys'19)☆14Aug 23, 2021Updated 4 years ago
- Review econometrics concepts with code examples☆16Oct 23, 2022Updated 3 years ago
- A glowfic to epub converter.☆14Apr 11, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆10Jan 10, 2025Updated last year
- This repository contains the results and code for the MLPerf™ Inference v3.0 benchmark.☆19Jul 24, 2025Updated 10 months ago
- FastAPI Microservices Architecture SDK - As Basis for multiple services in a platform/system☆12Oct 4, 2022Updated 3 years ago
- A methodology designed to measure the contribution of the features to the predictive performance of any econometric or machine learning m…☆18Nov 28, 2024Updated last year
- ☆14Jul 25, 2023Updated 2 years ago
- A framework for writing Unstract Tools/Apps☆23Nov 5, 2025Updated 6 months ago
- An unofficial MCP interface to interact with the PapersWithCode API☆22Jun 7, 2025Updated 11 months ago
- ☆13Aug 10, 2023Updated 2 years ago
- ☆52Feb 19, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Python client and REST API for calling an Instruction-Tuned Chat-Style LLM☆16Mar 26, 2023Updated 3 years ago
- tuimorphic choose-your-own-adventure story game☆20Apr 30, 2026Updated last month
- ☆22Dec 18, 2025Updated 5 months ago
- gpt for bash: your wish is the command☆14Aug 8, 2023Updated 2 years ago
- Set of scripts to finetune LLMs☆38Mar 30, 2024Updated 2 years ago
- The best terminal chat client for your live streams☆19Jun 10, 2023Updated 2 years ago
- UI for testing prompts across various datasets locally☆13Nov 2, 2024Updated last year