Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.
☆45Jul 16, 2024Updated last year
Alternatives and similar repositories for vllm-embedding
Users that are interested in vllm-embedding are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IJMLC: Open-TI: Open Traffic Intelligence with Augmented Language Model☆22Jul 30, 2025Updated 8 months ago
- ☆16Apr 8, 2025Updated last year
- realtime conversational dynamics☆19Mar 19, 2025Updated last year
- Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.☆40Apr 29, 2024Updated last year
- Playing with CSM☆22Mar 14, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [ACM TOIS] Multi-Behavior Recommendation with Personalized Directed Acyclic Behavior Graphs☆14Dec 6, 2024Updated last year
- A BERT-based application for reusable text classification at scale☆38Jul 23, 2023Updated 2 years ago
- Layout Analysis Dataset with Segmonto (LADaS)☆24Jul 12, 2025Updated 9 months ago
- ☆11Nov 5, 2021Updated 4 years ago
- ☆21Feb 6, 2024Updated 2 years ago
- A synthetic 24 hour traffic scenario for a 45 km section of the German highway A81 between Stuttgart Feuerbach - Heilbronn (Baden-Württem…☆12Oct 5, 2020Updated 5 years ago
- ☆10Mar 24, 2023Updated 3 years ago
- A GPU version implementation of Guided Filter, using CUDA C/C++, calculates 1080P images in 10ms on 4090☆10Jun 21, 2023Updated 2 years ago
- A simple AI agent controlling a simulation of a smart home☆13Jun 13, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [NeurIPS 2024] Beyond Single Stationary Policies: Meta-Task Players as Naturally Superior Collaborators☆16Nov 15, 2024Updated last year
- To assess the longtext capabilities more comprehensively, we propose Needle-in-a-Haystack PLUS, which shifts the focus from simple fact r…☆13Mar 4, 2024Updated 2 years ago
- Traccar Linux Client☆13Aug 4, 2013Updated 12 years ago
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆15Jun 3, 2023Updated 2 years ago
- The best terminal chat client for your live streams☆19Jun 10, 2023Updated 2 years ago
- 🔥 AgentScale: A Scalable Microservices-based Agent Orchestration Framework☆27Jul 25, 2024Updated last year
- My portfolio☆10Jun 4, 2022Updated 3 years ago
- ☆40Jul 26, 2024Updated last year
- ☆16Aug 23, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆18Feb 18, 2025Updated last year
- 🔬 ArXiv论文智能解读助手 - Arxiv-MCP-Server, 支持MCP协议的学术论文一键下载、解析、翻译为中文,并生成微信公众号文章格式☆39Jun 16, 2025Updated 10 months ago
- Fantastic Dungeons - 7DRL 2016☆10Mar 12, 2016Updated 10 years ago
- Review econometrics concepts with code examples☆16Oct 23, 2022Updated 3 years ago
- A glowfic to epub converter.☆14Apr 11, 2026Updated last week
- ☆13Sep 12, 2024Updated last year
- ☆10Jan 10, 2025Updated last year
- FastAPI Microservices Architecture SDK - As Basis for multiple services in a platform/system☆12Oct 4, 2022Updated 3 years ago
- tuimorphic choose-your-own-adventure story game