Open Source Text Embedding Models with OpenAI Compatible API
☆167Jul 13, 2024Updated last year
Alternatives and similar repositories for open-text-embeddings
Users that are interested in open-text-embeddings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- setup the env for vllm users☆16Oct 31, 2023Updated 2 years ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆45Jul 16, 2024Updated last year
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,752Mar 24, 2026Updated 3 weeks ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Jun 6, 2023Updated 2 years ago
- ☆10Nov 1, 2025Updated 5 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A blazing fast inference solution for text embeddings models☆4,684Updated this week
- Your AI Powered Toolkit☆17May 19, 2024Updated last year
- Sentence Embedding as a Service☆15Jun 30, 2025Updated 9 months ago
- OpenAI compatible API for TensorRT LLM triton backend☆219Aug 1, 2024Updated last year
- Normalize text string☆12Nov 6, 2018Updated 7 years ago
- Node starter kit for semantic-search. Uses Mighty Inference Server with Qdrant vector search.☆15May 15, 2023Updated 2 years ago
- llm-inference is a platform for publishing and managing llm inference, providing a wide range of out-of-the-box features for model deploy…☆94May 17, 2024Updated last year
- vsftpd server providing FTP access to files from an Amazon S3 bucket☆18Oct 24, 2019Updated 6 years ago
- Conversion script adapting vicuna dataset into alpaca format for use with oobabooga's trainer☆13Jun 21, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- A fluent, scalable, and easy-to-use LLM data processing framework.☆28Jan 31, 2026Updated 2 months ago
- Repo for testing foundation models☆12Jan 19, 2024Updated 2 years ago
- ☆19Oct 18, 2025Updated 6 months ago
- ☆26Jul 13, 2024Updated last year
- A framework for evaluating the effectiveness of chain-of-thought reasoning in language models.☆19Feb 6, 2025Updated last year
- Cryptocurrency tax and tracking tools for the Beancount platform.☆12Mar 25, 2026Updated 3 weeks ago
- ☆145Aug 20, 2025Updated 7 months ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆132Jun 25, 2024Updated last year
- Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-p…☆9,225Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆109Aug 18, 2025Updated 8 months ago
- ☆11Aug 20, 2025Updated 7 months ago
- Lightweight continuous batching OpenAI compatibility using HuggingFace Transformers include T5 and Whisper.☆29Mar 15, 2025Updated last year
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆51Oct 18, 2024Updated last year
- Open Source WizardCoder Dataset☆166Jul 12, 2023Updated 2 years ago
- Python bindings for llama.cpp☆68Feb 29, 2024Updated 2 years ago
- Generate single-file static responsive HTML page from Markdown with syntax-highlighting.☆16Apr 8, 2026Updated last week
- Imitate OpenAI with Local Models☆90Aug 27, 2024Updated last year
- Paste Word, get Markdown☆17Jul 30, 2024Updated last year
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆36Sep 6, 2024Updated last year
- Code implement reposity of Paper HiQA☆107Mar 2, 2025Updated last year
- Cython implementation of Moattar and Homayounpour's Voice Activity Detection (VAD) algorithm fast enough for real-time on an RPi 3.☆12Aug 18, 2018Updated 7 years ago
- Run `playwright.launchServer()` in docker☆15Feb 1, 2026Updated 2 months ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆91Feb 2, 2025Updated last year
- Have a natural voice conversation with an LLM☆264Jan 20, 2026Updated 2 months ago
- Very simple and customizable mock/echo server☆17Apr 8, 2026Updated last week