rag-wtf / open-text-embeddingsView external linksLinks
Open Source Text Embedding Models with OpenAI Compatible API
☆167Jul 13, 2024Updated last year
Alternatives and similar repositories for open-text-embeddings
Users that are interested in open-text-embeddings are comparing it to the libraries listed below
Sorting:
- fast-embeddings-api☆16Nov 23, 2023Updated 2 years ago
- Deployment a light and full OpenAI API for production with vLLM to support /v1/embeddings with all embeddings models.☆44Jul 16, 2024Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆38Jun 6, 2023Updated 2 years ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,661Feb 5, 2026Updated last week
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Oct 18, 2024Updated last year
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.☆11May 26, 2023Updated 2 years ago
- Node starter kit for semantic-search. Uses Mighty Inference Server with Qdrant vector search.☆15May 15, 2023Updated 2 years ago
- A blazing fast inference solution for text embeddings models☆4,476Feb 4, 2026Updated last week
- Langport is a language model inference service☆94Sep 9, 2024Updated last year
- Repo for paper "Tell Me More! Towards Implicit User Intention Understanding of Language Model Driven Agents"☆61Feb 20, 2024Updated last year
- Python bindings for llama.cpp☆68Feb 29, 2024Updated last year
- Voice activity engine benchmark framework☆21Jan 14, 2026Updated 3 weeks ago
- SLOP Detector and analyzer based on dictionary for shareGPT JSON and text☆82Feb 7, 2026Updated last week
- ☆19Sep 4, 2024Updated last year
- Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-p…☆9,053Updated this week
- Have a natural voice conversation with an LLM☆262Jan 20, 2026Updated 3 weeks ago
- Enable tool-use ability for any LLM model (DeepSeek V3/R1, etc.)☆58May 27, 2025Updated 8 months ago
- 🩹Editing large language models within 10 seconds⚡☆1,361Aug 13, 2023Updated 2 years ago
- Imitate OpenAI with Local Models☆89Aug 27, 2024Updated last year
- RestAI's Frontend☆22Sep 4, 2025Updated 5 months ago
- 如需体验textin文档解析,请点击https://cc.co/16YSIy☆22Jul 9, 2024Updated last year
- Merge Transformers language models by use of gradient parameters.☆214Aug 8, 2024Updated last year
- Conversational Retrieval Evaluation Dataset☆101Aug 19, 2025Updated 5 months ago
- Generate visual podcasts about novels using open source models☆25Feb 15, 2023Updated 2 years ago
- Retrieval and Retrieval-augmented LLMs☆11,280Dec 15, 2025Updated last month
- TheBloke's Dockerfiles☆308Mar 8, 2024Updated last year
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆104May 30, 2025Updated 8 months ago
- Interactive chat application leveraging OpenAI's GPT-4 for real-time conversation simulations. Built with Flask, this project showcases s…☆25Apr 2, 2024Updated last year
- Browser extensions for the Knowledge application☆33Jul 16, 2022Updated 3 years ago
- ☆32Jul 5, 2024Updated last year
- OpenChat: Advancing Open-source Language Models with Imperfect Data☆5,472Sep 13, 2024Updated last year
- A pipeline for LLM knowledge distillation☆112Apr 2, 2025Updated 10 months ago
- Train Llama Loras Easily☆31Aug 3, 2023Updated 2 years ago
- Complex RAG backend☆29Mar 28, 2024Updated last year
- An interface for llama.cpp, ChatGPT, Gemini, and Claude